LLMs Bridge Science Language Gaps with 95.9% Accuracy

New research shows large language models can translate scientific papers, making global knowledge more accessible.

A recent study by Hannah Calzi Kleidermacher and James Zou demonstrates how large language models (LLMs) can effectively translate scientific articles into multiple languages. This approach, which maintains original formatting, achieved an average accuracy of 95.9% in conveying key scientific details, aiming to reduce language barriers in global research.

By Mark Ellison

September 6, 2025

4 min read

LLMs Bridge Science Language Gaps with 95.9% Accuracy

Key Facts

Large language models (LLMs) can translate scientific papers while preserving JATS XML formatting.
The translation method achieved an average performance of 95.9% accuracy in conveying key scientific details.
Translations were conducted across multiple scientific disciplines into 28 languages.
A user study found that a third of authors preferred some English technical terms to remain untranslated.
In-context learning techniques can mitigate 'overtranslation' and align with domain-specific preferences.

Why You Care

Imagine a world where language is no longer a barrier to scientific discovery. What if every notable paper, regardless of its original language, was instantly accessible to researchers worldwide? A new study reveals that large language models (LLMs) are making this vision a reality. This creation could profoundly change how you access and share scientific knowledge.

What Actually Happened

Researchers Hannah Calzi Kleidermacher and James Zou have developed a novel method for translating scientific papers using large language models. As detailed in the blog post, their approach leverages LLMs to translate published scientific articles. Crucially, it preserves their native JATS XML formatting. This creates a practical, automated approach for academic journals. The team revealed they translated articles across multiple scientific disciplines into 28 different languages.

To evaluate the translation accuracy, the study introduced a new question-and-answer (QA) benchmarking method. In this method, an LLM generates comprehension-based questions from the original text. Then, it answers these questions based on the translated text. The company reports that their benchmark results show an impressive average performance of 95.9%. This indicates that key scientific details are accurately conveyed.

Why This Matters to You

This research has significant implications for the global scientific community and for anyone interested in accessing diverse research. For example, think of a researcher in Brazil who struggles to read oncology papers published only in English. This system could provide , accurate translations, democratizing access to essential information. The study finds that authors consistently found the translations to accurately capture the original information in their articles. This suggests high reliability.

What’s more, the team revealed that a user study involved translating scientific papers for 15 researchers into their native languages. This practical application highlights the utility of the system. How might this system change your ability to consume global research?

Consider the practical benefits:

Increased Accessibility: Researchers from non-English speaking countries can access a wider range of publications.
Faster Dissemination: New findings can reach a global audience more quickly.
Enhanced Collaboration: Language barriers, a common hurdle, are significantly reduced.
Preservation of Formatting: The JATS XML format is maintained, crucial for scientific documents.

According to the announcement, “the authors consistently found the translations to accurately capture the original information in their articles.” This strong endorsement from actual users underscores the quality of the translations.

The Surprising Finding

While the overall accuracy was high, the study uncovered an interesting nuance. Interestingly, a third of the authors involved in the user study found many technical terms “overtranslated.” These authors expressed a preference to keep certain terminology more familiar in English untranslated. This finding challenges the assumption that a , literal translation is always desired. Sometimes, established English technical terms are simply understood universally. This insight suggests that future LLM-driven translation tools may need to offer more nuanced control.

For instance, a term like ‘machine learning’ might be universally . Overtanslating it could introduce confusion rather than clarity. The technical report explains how in-context learning techniques can be used to align translations with domain-specific preferences. This includes mitigating overtranslation, highlighting the adaptability and utility of LLM-driven scientific translation.

What Happens Next

The future for this system looks promising. The team revealed that the code and translated articles are already available, suggesting a potential rollout for academic publishers in the coming months. We might see initial integrations into journal submission systems by late 2025 or early 2026. This would allow authors to publish in English while providing LLM-generated translations simultaneously.

Imagine a major scientific journal offering a ‘translate’ button for every article on its system. This would open up its content to millions more readers instantly. For you, this means a future where language is less of a barrier to accessing the latest scientific breakthroughs. The research shows how in-context learning can refine translations. This suggests continuous betterment in accuracy and nuance. This adaptability makes LLM-driven scientific translation a tool for the global research community.

Ready to start creating?