HomeNewsTechnologySarvam AI unveils OpenHathi, the first Hindi large language model

Sarvam AI unveils OpenHathi, the first Hindi large language model

The model is built on Meta AI's Llama2-7B architecture, and according to Sarvam AI, it delivers performance on par with GPT-3.5 for Indic languages.

December 13, 2023 / 15:05 IST
Story continues below Advertisement
.
Sarvam AI’s AI model has a 48,000-token extension of Llama2-7B’s tokenizer and undergoes a two-phase training process.

Homegrown AI startup Sarvam AI has released OpenHathi-Hi-v0.1, the first Hindi large language model (LLM) in the OpenHathi series.

The model is built on Meta AI's Llama2-7B architecture, and according to Sarvam AI, it delivers performance on par with GPT-3.5 for Indic languages.

Story continues below Advertisement

The AI model used by Sarvam AI has a 48,000-token extension of Llama2-7B’s tokenizer and undergoes a two-phase training process. The first phase involves embedding alignment, which aligns randomly initialised Hindi embeddings. The second phase is bilingual language modeling, where the model is trained to attend cross-lingually across tokens.

Also read: What powers ChatGPT and Bard? A look at LLMs or large language models