HomeNewsTechnologyMeta to train ASR models by clustering speech at an 'utterance level'

Meta to train ASR models by clustering speech at an 'utterance level'

The goal is to meld similar utterances from a diverse group of speakers together in one data set, and then use that to train the ASR model.

The most popular examples of ASR's are smartphone assistants like Apple's Siri, Amazon Echo or Google Assistant.(Representative image)

Meta has developed a new way to train Automatic Speech Recognition (ASR) models by clustering speech at an "utterance level".

ASR models, as the name implies, are used in systems that aim to transcribe spoken language into text, that can be used to carry out various functions. The most popular examples of ASR's are smartphone assistants like Apple's Siri, Amazon Echo or Google Assistant.

Story continues below Advertisement

Remove Ad

Also read | WhatsApp testing logins using phone numbers on WhatsApp Web

Despite the advancement of AI technology, you may find these assistants will sometimes have a hard time understanding you speech. Meta aims to improve this clustering various speakers from different ethnicities together, rather than traditional data sets that train ASR models based on metrics such as age group or gender.

Download MC Apps:

Copyright © Network18 Media & Investments Limited. All rights reserved. Reproduction of news articles, photos, videos or any other content in whole or in part in any form or medium without express written permission of moneycontrol.com is prohibited.

English

Markets

News

Personal Finance

Mutual Funds

Commodities

Media

Invest Now

Specials

Meta to train ASR models by clustering speech at an 'utterance level'

The goal is to meld similar utterances from a diverse group of speakers together in one data set, and then use that to train the ASR model.

Related Stories

Trending Topics

News

Markets

Personal Finance

Mutual Funds

Tools

Community

Network 18 Sites

Quick Links