Qalb: A Development for Urdu AI with the Largest LLM

A new language model, Qalb, significantly improves AI performance for over 230 million Urdu speakers.

Urdu, spoken by over 230 million people, has been critically underrepresented in AI. A new large language model (LLM) called Qalb has been developed to address this gap. It achieves state-of-the-art performance, outperforming previous models by a significant margin.

By Katie Rowan

January 22, 2026

4 min read

Qalb: A Development for Urdu AI with the Largest LLM

Key Facts

Qalb is the largest state-of-the-art Urdu large language model (LLM).
Urdu is spoken by over 230 million people but is underrepresented in NLP systems.
Qalb was developed using a two-stage approach: continued pre-training and supervised fine-tuning.
The model was pre-trained on 1.97 billion tokens, including 1.84 billion Urdu tokens and 140 million English tokens.
Qalb achieved a weighted average score of 90.34, outperforming previous models by significant margins.

Why You Care

Ever felt frustrated when AI tools struggle with languages beyond English? Many of us have. What if your native language, spoken by hundreds of millions, was largely ignored by the latest AI advancements? This has been the reality for Urdu speakers. Now, a new creation promises to change that. Researchers have introduced Qalb, the largest Urdu large language model (LLM), specifically designed for the language’s 230 million speakers. This means more accurate translations, better content generation, and more intuitive interactions for you.

What Actually Happened

Despite the rapid progress in large language models, Urdu has remained significantly underrepresented in modern natural language processing (NLP) systems. Existing multilingual models often perform poorly on Urdu-specific tasks, as detailed in the blog post. They struggle with the language’s complex morphology, its unique right-to-left Nastaliq script, and its rich literary traditions. Even the well-known LLaMA-3.1 8B-Instruct model showed limited capability in generating fluent or contextually appropriate Urdu text. To tackle this, a team of researchers introduced Qalb. This Urdu large language model was developed using a two-stage approach. It involved continued pre-training followed by supervised fine-tuning. Starting with the LLaMA 3.1 8B model, the team performed continued pre-training on a massive dataset. This dataset contained 1.97 billion tokens. It included 1.84 billion tokens of diverse Urdu text. This text spanned news archives, classical and contemporary literature, government documents, and social media. They also combined it with 140 million tokens of English Wikipedia data. This was done to prevent catastrophic forgetting, according to the announcement. The resulting model was then fine-tuned on the Alif Urdu-instruct dataset.

Why This Matters to You

Qalb’s creation marks a significant leap forward for Urdu speakers and anyone interacting with the language. Imagine a world where your AI assistant understands the nuances of Urdu poetry. Think of it as having an AI that can truly grasp the cultural context of your conversations. This model has shown substantial improvements across various benchmarks. The research shows it achieved a weighted average score of 90.34. This significantly outperforms the previous Alif-1.0-Instruct model. It also surpasses the base LLaMA-3.1 8B-Instruct model by a remarkable 44.64 points. This means more accurate and reliable AI applications for your daily use. How much easier would your digital life be with an AI that truly speaks your language?

Here’s a quick look at Qalb’s performance:

Model	Weighted Average Score
Qalb	90.34
Alif-1.0-Instruct	87.1
LLaMA-3.1 8B-Instruct	45.7

As Muhammad Taimoor Hassan, one of the authors, stated, “Our results demonstrate that continued pre-training on diverse, high-quality language data, combined with targeted instruction fine-tuning, effectively adapts foundation models to low-resource languages.” This approach ensures that Qalb not only understands Urdu but also respects its unique linguistic characteristics. For example, if you’re a content creator, you can expect AI tools powered by Qalb to generate more culturally relevant and grammatically correct Urdu content. This will save you time and effort.

The Surprising Finding

What truly stands out is the sheer scale of betterment Qalb achieved over existing multilingual models. It’s not just a small step forward. The team revealed that Qalb surpassed the base LLaMA-3.1 8B-Instruct model by an astonishing 44.64 points. This is a significant margin. This finding challenges the common assumption that simply using a large, general-purpose multilingual model is sufficient for less-resourced languages. The study finds that these models often struggle with complex linguistic features. Urdu’s Nastaliq script and rich morphology are examples. The systematic continued pre-training and fine-tuning on a massive, diverse Urdu dataset were crucial. This specialized approach clearly delivered superior results. It highlights the necessity of dedicated language-specific training. This is especially true for languages with unique characteristics.

What Happens Next

The introduction of Qalb signals a promising future for Urdu in the AI landscape. We can expect to see more specialized AI applications emerging within the next 12-18 months. For example, developers might integrate Qalb into translation services. This would provide highly accurate Urdu-to-English and English-to-Urdu translations. Content platforms could also use it to generate culturally sensitive Urdu articles or social media posts. The industry implications are vast. It could encourage more investment in language-specific AI creation for other underrepresented languages. Our advice to you: keep an eye on new AI tools offering Urdu support. Prioritize those that explicitly mention leveraging models like Qalb. This will ensure you get the best performance. The technical report explains that this systematic approach could be a blueprint for adapting foundation models. This could lead to better AI for many more languages globally.

Ready to start creating?