Moneycontrol PRO

HomeNewsTechnologyHere are the large language models that are shaping the future of AI

Here are the large language models that are shaping the future of AI

Large Language Models are vast deep neural networks that are massive in size and scope. When you interact with a conversational AI chatbot, these models help process your prompts.

Rohith Bhaskar

February 15, 2023 / 21:11 IST

Follow Us On Google

Add as a Preferred Source on Google

Large Language Models (LLM) are the backbone of many Natural Language Processing (NLP) networks like ChatGPT or Bard.

When you interact with a conversational AI chatbot, LLMs help process your prompts. They help the AI understand your text, break down the meanings of your words, and establish context to generate a response.

Advancements in the field of neural networks have led to more complex systems that are the key to realising the full potential of AI.

Also Read | Why the error by Bard has Google’s investors worried

Here are some of the most prominent LLMs that are leading the way.

Related Stories

GPT-3.5

You may have heard of this one. It powers one of the most impressive demonstrations of NLP today - ChatGPT.

GPT stands for Generative Pre-Trained Transformer. It is a deep-learning neural network that helps ChatGPT understand and respond to human text.

OpenAI achieved this by training the model on vast amounts of text data from all over the internet, but the wrinkle is the additional fine-tuning that OpenAI used to teach the model.

It used Reinforcement Training from Human Feedback (RLHF) to improve predictions.

In simpler terms, the LLM trained under human supervision. They categorised each action it took as desirable or undesirable. It allowed the model to learn like humans do by making mistakes and with positive reinforcement.

GPT-4, scheduled for late 2023, will focus on improving parameters and fine-tuning them further. OpenAI says that this will also save computing costs since it will be a more efficient system.

LaMDA

Google's Language Model for Dialogue Applications (LaMDA) powers the recently announced Bard AI.

This LLM was trained on hours of conversational dialogue, which Google says allows it to pick up subtle nuances in linguistics, and gives it the ability to hold open-ended conversations.

Google has already announced a successor to the model, LaMDA 2, which is more finely tuned and can provide recommendations on user queries.

LaMDA 2 uses Google's Pathways Language Model (PaLM) with 540 billion parameters, allowing it to analyse even more complex queries.

Also Read | ChatGPT: Will the AI legal framework come of age in India?

WuDao 2.0

The biggest model in existence, with a whopping 1.75 trillion parameters, is China's WuDao 2.0.

Created by the Beijing Academy of Artificial Intelligence (BAAI) and introduced in January 2021, WuDao 1.0 was the first out of the gate.

In May 2021, BAAI unveiled the second revision titled WuDao 2.0.

It is similar to GPT but uses a vast library of parameters. Comparatively, GPT runs on 175 billion parameters, whereas WuDao has 1.75 trillion. WuDao 2.0 can simulate human speech and generate content.

MT-NLG

Jointly developed by Nvidia and Microsoft, the Megatron-Turing Natural Language Generation (MT-NLG) is the successor to Microsoft's Megatron-LM and Nvidia's Turing NLG 17B.

The models were trained on Nvidia's Selene ML supercomputer using a dataset of 530 billion parameters.

The 105-layer deep neural network can undertake a vast set of natural language tasks like - completion prediction, reading comprehension, intelligent reasoning, language inferences and more.

Also Read | AI for weather forecasting: How Google and DeepMind researchers approach the problem

Bloom

One of the newest models on the list, BigScience Large Open-science Open-access Multilingual Language Model (Bloom), is an open-source LLM built by a consortium of more than 1000 AI researchers.

The model can generate text in 46 languages and code in 13 programming languages.

Built on a dataset of 176 billion parameters, the LLM can be accessed on the cloud or a local machine.

Invite your friends and family to sign up for MC Tech 3, our daily newsletter that breaks down the biggest tech and startup stories of the day

Rohith Bhaskar

first published: Feb 15, 2023 09:11 pm

Discover the latest Business News, Sensex, and Nifty updates. Obtain Personal Finance insights, tax queries, and expert opinions on Moneycontrol or download the Moneycontrol App to stay updated!

Subscribe to Tech Newsletters

Al Edge Newsletter On Saturdays

Find the best of Al News in one place, specially curated for you every weekend.
MC Tech 3 Newsletter Daily-Weekdays

Stay on top of the latest tech trends and biggest startup news.

Subscribe

Advisory Alert: It has come to our attention that certain individuals are representing themselves as affiliates of Moneycontrol and soliciting funds on the false promise of assured returns on their investments. We wish to reiterate that Moneycontrol does not solicit funds from investors and neither does it promise any assured returns. In case you are approached by anyone making such claims, please write to us at grievanceofficer@nw18.com or call on 02268882347