HomeNewsTechnologySarvam AI unveils OpenHathi, the first Hindi large language model

Sarvam AI unveils OpenHathi, the first Hindi large language model

The model is built on Meta AI's Llama2-7B architecture, and according to Sarvam AI, it delivers performance on par with GPT-3.5 for Indic languages.

Sarvam AI’s AI model has a 48,000-token extension of Llama2-7B’s tokenizer and undergoes a two-phase training process.

Homegrown AI startup Sarvam AI has released OpenHathi-Hi-v0.1, the first Hindi large language model (LLM) in the OpenHathi series.

The model is built on Meta AI's Llama2-7B architecture, and according to Sarvam AI, it delivers performance on par with GPT-3.5 for Indic languages.

Story continues below Advertisement

Remove Ad

The AI model used by Sarvam AI has a 48,000-token extension of Llama2-7B’s tokenizer and undergoes a two-phase training process. The first phase involves embedding alignment, which aligns randomly initialised Hindi embeddings. The second phase is bilingual language modeling, where the model is trained to attend cross-lingually across tokens.

Also read: What powers ChatGPT and Bard? A look at LLMs or large language models

Download MC Apps:

Copyright © Network18 Media & Investments Limited. All rights reserved. Reproduction of news articles, photos, videos or any other content in whole or in part in any form or medium without express written permission of moneycontrol.com is prohibited.

English

Markets

News

Personal Finance

Mutual Funds

Commodities

Media

Invest Now

Specials

Sarvam AI unveils OpenHathi, the first Hindi large language model

The model is built on Meta AI's Llama2-7B architecture, and according to Sarvam AI, it delivers performance on par with GPT-3.5 for Indic languages.

Related Stories

Trending Topics

News

Markets

Personal Finance

Mutual Funds

Tools

Community

Network 18 Sites

Quick Links