LLMs Learn from Grammar Books, But There's a Catch

New research shows Large Language Models can explicitly learn languages, challenging prior assumptions.

A recent study reveals Large Language Models (LLMs) can learn new languages from grammar explanations, a process called 'explicit learning.' This capability, however, decreases with increasing linguistic complexity. Fine-tuning improves performance but struggles with generalization.

By Mark Ellison

September 19, 2025

3 min read

LLMs Learn from Grammar Books, But There's a Catch

Key Facts

LLMs possess a measurable capacity for explicit learning from grammar explanations.
This explicit learning ability diminishes as linguistic complexity increases.
Supervised fine-tuning significantly enhances LLM performance in explicit learning.
Fine-tuning struggles to generalize to typologically novel or more complex linguistic features.
The research used controlled translation experiments between English and cryptographically generated languages.

Why You Care

Ever wondered if an AI could learn a new language just like you do, by reading a grammar book? This isn’t science fiction anymore. A new study shows Large Language Models (LLMs) can indeed learn languages this way. This capability, termed “explicit learning,” could change how we develop AI for diverse languages. Why should you care? Because this impacts how accessible AI tools become for everyone, especially those speaking less common languages.

What Actually Happened

Researchers Malik Marmonier, Rachel Bawden, and Benoît Sagot recently explored a fascinating aspect of AI. They investigated an LLM’s capacity to learn new languages. This learning happened by using explanations found in a grammar book, according to the announcement. The team designed controlled translation experiments for this purpose. They translated between English and specially constructed languages. These languages were generated from Latin or French using cryptographic methods, as detailed in the blog post. This approach allowed for a rigorous assessment of the LLM’s learning ability. The study’s findings reveal a measurable capacity for explicit learning in LLMs.

Why This Matters to You

This research has significant implications for how we approach AI language creation. Imagine an LLM that can quickly pick up a new dialect from a simple guide. This could open doors for AI applications in underserved linguistic communities. The study suggests that while LLMs can learn explicitly, their ability diminishes with more complex linguistic phenomena. This is a crucial detail for developers. However, supervised fine-tuning with specific “chains of thought” significantly boosts performance, the research shows. This means targeted training can make a big difference. But what does this mean for your daily interactions with AI?

Consider this breakdown of the findings:

Learning Aspect	LLM Capability
Explicit Learning	Measurable capacity demonstrated
Complexity Impact	Ability diminishes as complexity increases
Fine-tuning	Significantly enhances performance
Generalization	Struggles with novel or more complex features

“Contrary to previous studies, our results demonstrate that LLMs do possess a measurable capacity for explicit learning,” the team revealed. This statement challenges prior assumptions about AI’s learning mechanisms. Think of it as teaching a student from a textbook versus just showing them examples. How might this change the way you interact with translation apps or voice assistants in the future?

The Surprising Finding

Here’s the twist: previous studies often suggested LLMs struggled with this kind of direct, rule-based learning. However, this new research presents a different picture. The study finds that LLMs do have a measurable capacity for explicit learning. This means they can learn from grammar explanations, not just vast amounts of text data. This finding is surprising because it challenges the idea that LLMs are purely statistical pattern matchers. It implies a deeper form of comprehension. However, the paper states that this ability “diminishes as the complexity of the linguistic phenomena to be learned increases.” This suggests a limit to their current explicit learning capabilities. For example, learning simple verb conjugations might be easy, but mastering nuanced idiomatic expressions remains a challenge.

What Happens Next

This research points to clear directions for future AI creation. The team highlights the need for more diverse training sets. They also suggest alternative fine-tuning strategies. These improvements could further enhance explicit learning by LLMs, according to the announcement. Imagine an AI that can learn a new language from a single grammar book in a matter of weeks. This could benefit low-resource languages immensely. These are languages typically described in grammar books but lack extensive digital text corpora. For example, a small indigenous language with limited online presence could gain AI support. This could happen without needing massive data collection efforts. Your future AI tools might become much more adaptable. The industry implications are vast, promising more inclusive AI technologies for global communication.

Ready to start creating?