Nvidia · Filed Sep 30, 2025 · Published Jun 4, 2026 · verified — real USPTO data

Nvidia Patents a Way to Bring AI Language Models to Underserved Languages

Most of the world's roughly 7,000 languages get almost nothing from AI language models — Nvidia's new patent describes a systematic way to fix that, at least for one language at a time.

Nvidia Patent: Shrinking LLMs for Low-Resource Languages — figure from US 2026/0154514 A1
FIG. 1A — rendered from the official USPTO publication PDF.
Publication number US 2026/0154514 A1
Applicant NVIDIA Corporation
Filing date Sep 30, 2025
Publication date Jun 4, 2026
Inventors Raviraj JOSHI, Kanishk SINGLA, Anusha KAMATH, Raunak KALANI, Utkarsh VAIDYA, Sanjay Singh CHAUHAN, Niranjan WARTIKAR, Eileen Margaret Peters LONG
CPC classification 704/2
Grant likelihood Medium
Examiner CENTRAL, DOCKET (Art Unit OPAP)
Status Docketed New Case - Ready for Examination (Mar 16, 2026)
Document 20 claims

How Nvidia shrinks AI models for rare languages

Imagine trying to use a voice assistant or chatbot in a language that only a few million people speak. Most modern AI models are trained overwhelmingly on English — and to a lesser extent, a handful of other major languages — which means speakers of smaller or regional languages are largely left out.

Nvidia's patent describes a method for taking a large, general-purpose AI language model and squeezing it down into a smaller, more focused model that handles both English and one specific underrepresented language. The smaller model is designed to be efficient enough to run on devices with limited computing power, not just massive data centers.

To teach the model the target language, Nvidia's approach creates extra training data by translating English content, then also converts some of that text into the Roman alphabet — a trick that helps because many speakers of regional languages type in their native tongue using English letters. The model is also tuned to reflect local values around things like politeness, privacy, and conversational tone.

Inside Nvidia's compression and transliteration pipeline

The patent outlines a multi-step pipeline for producing a bilingual Small Language Model (SLM) — an AI model compact enough to run efficiently, yet capable in both English and a chosen Low-Resource Language (LRL).

The process starts with model compression: a large multilingual LLM is pruned or distilled (think of it as editing a sprawling encyclopedia into a focused guidebook) to produce a smaller multilingual base model. That base model is then refined through continued pre-training — additional training runs — on a curated mix of four types of data:

  • Natural English content — existing English-language text from real sources
  • Natural LRL content — authentic text already written in the target language
  • Synthetic LRL content — English sources machine-translated into the target language to bulk up the training set
  • Transliterated LRL content — text written in the target language but converted into Roman script (Latin letters), capturing how many real users type regional languages on standard keyboards

After pre-training, the model goes through alignment techniques — essentially a values-tuning stage that adjusts the model's outputs to match local cultural norms around profanity, privacy, bias, and conversational style. This is similar in spirit to the RLHF (Reinforcement Learning from Human Feedback) process used to make models like ChatGPT feel polite and helpful, but tailored per language community.

What this means for non-English AI applications

The vast majority of AI assistant products today work well in English and a small cluster of high-resource languages like Mandarin, Spanish, and French. Speakers of regional languages — think Marathi, Kannada, or Swahili — often get poor performance or nothing at all. This patent describes a repeatable factory for spinning up capable, compact models for any underrepresented language, which could make localized AI products viable for the first time.

For Nvidia, this also connects to its broader push into edge AI and on-device inference. A smaller, bilingual SLM is the kind of model that could run on an Nvidia-powered embedded system or smart device in a market where English-first AI has failed to land. Whether this becomes a product or a platform others build on top of is the real question.

Editorial take

This is quietly important work. Language coverage is one of the most stubborn gaps in AI, and Nvidia is approaching it with an engineering pipeline rather than just throwing more data at the problem. The transliteration angle — accounting for how people actually type regional languages using Roman letters — shows real-world awareness that most academic NLP work misses.

Get one Big Tech patent every Sunday

Plain English, intelligent commentary, no hype. Free.

Source. Full patent text and figures from the official USPTO publication PDF.

Editorial commentary on a publicly published patent application. Not legal advice.