SANSKRITI: Benchmarking AI's Understanding of Indian Culture

New dataset reveals how well language models grasp India's rich cultural diversity.

A new benchmark called SANSKRITI has been introduced to evaluate how effectively language models understand Indian culture. This dataset, featuring over 21,000 question-answer pairs, highlights significant gaps in AI's cultural knowledge, especially in region-specific contexts. It aims to improve AI's global effectiveness by focusing on local socio-cultural nuances.

By Mark Ellison

October 30, 2025

4 min read

SANSKRITI: Benchmarking AI's Understanding of Indian Culture

Key Facts

SANSKRITI is a new benchmark for evaluating language models' knowledge of Indian culture.
It comprises 21,853 question-answer pairs covering 28 states and 8 union territories.
The benchmark evaluates sixteen key attributes of Indian culture, including rituals, cuisine, and festivals.
Evaluations revealed significant disparities and struggles in models' ability to handle culturally nuanced queries, especially in region-specific contexts.
The dataset aims to set a new standard for assessing and improving the cultural understanding of Language Models (LMs).

Why You Care

Ever wondered if your favorite AI truly understands the world’s diverse cultures? Can it differentiate between a Diwali celebration in Punjab and a Durga Puja in Bengal? A new benchmark, SANSKRITI, shows that many language models struggle with such cultural nuances, according to the announcement. This directly impacts how useful AI can be in various global contexts, especially for your content creation or research focused on India.

What Actually Happened

Researchers have introduced SANSKRITI, a comprehensive benchmark designed to evaluate language models’ (LMs) knowledge of Indian culture, as detailed in the blog post. This benchmark addresses a essential gap: while LMs are tools, their global effectiveness relies on understanding local socio-cultural contexts. SANSKRITI is the largest dataset of its kind, comprising 21,853 meticulously curated question-answer pairs. These questions cover 28 states and 8 union territories within India, according to the paper. The dataset spans sixteen key attributes of Indian culture, offering a comprehensive representation of India’s cultural tapestry. This includes aspects like rituals, cuisine, dance, and festivals, the research shows.

Why This Matters to You

Imagine you are a content creator trying to develop a series about Indian festivals. You rely on an AI to generate scripts or research facts. If that AI lacks a deep understanding of regional variations, your content could be inaccurate or even offensive. SANSKRITI directly addresses this challenge, aiming to make AI more culturally intelligent for users like you.

For example, if you ask an AI about ‘Bihu,’ a harvest festival, SANSKRITI helps evaluate if the AI understands its significance primarily in Assam. Without this benchmark, an AI might offer generic festival information, missing the specific cultural context. The study finds that current Large Language Models (LLMs), Indic Language Models (ILMs), and Small Language Models (SLMs) show significant disparities in handling culturally nuanced queries. Many models struggle with region-specific contexts, the team revealed.

Key Cultural Attributes Evaluated by SANSKRITI:

Category	Description
Rituals & Ceremonies	Traditional practices and customs
History	Significant past events and figures
Tourism	Popular destinations and cultural sites
Cuisine	Regional food and culinary traditions
Dance & Music	Traditional and contemporary performing arts
Costume	Traditional attire and fashion
Language	Regional languages and dialects
Art	Visual arts, crafts, and architecture
Festivals	Celebrations and their cultural significance
Religion	Belief systems and spiritual practices
Medicine	Traditional and modern healthcare practices
Transport	Modes of travel and infrastructure
Sports	Traditional games and popular sports
Nightlife	Evening entertainment and social activities
Personalities	Influential figures and celebrities

How much more effective could your AI-powered research be if it truly understood the intricate cultural fabric of a region? This benchmark aims to improve AI’s ability to provide more accurate and contextually relevant information for your needs. “Language Models (LMs) are indispensable tools shaping modern workflows, but their global effectiveness depends on understanding local socio-cultural contexts,” according to the authors.

The Surprising Finding

Here’s an interesting twist: despite the capabilities of many leading language models, the research shows they still exhibit significant weaknesses in culturally specific knowledge. The study found that many models struggle particularly with region-specific contexts. This is surprising because these models are trained on vast amounts of data. However, this data often lacks the depth required for nuanced cultural understanding, especially for diverse regions like India. For example, an AI might know about ‘Indian food’ generally but fail to distinguish between the distinct culinary traditions of North and South India. This challenges the common assumption that simply having more data automatically leads to comprehensive cultural intelligence. The benchmark highlights that quantity alone is not enough; the quality and cultural specificity of training data are crucial.

What Happens Next

The introduction of SANSKRITI sets a new standard for evaluating AI’s cultural understanding, as mentioned in the release. We can expect AI developers to use this benchmark to refine their models over the next 6-12 months. This will likely lead to more culturally aware AI tools. For example, future AI assistants might offer more personalized travel recommendations based on specific regional customs. Content creators could see AI generating more authentic and contextually rich narratives about diverse cultures.

Developers should consider integrating more culturally diverse datasets into their training pipelines. Users, meanwhile, should continue to critically assess AI outputs for cultural accuracy. The industry implications are clear: AI must move beyond generic knowledge to embrace deep cultural intelligence. This will ensure AI tools are truly globally effective. The researchers hope SANSKRITI will aid in “assessing and improving the cultural understanding of LMs.”

Ready to start creating?