Nvidia rolls out new AI model which creates audio from text

Nvidia's new experimental model Fugatto can create a music snippet based on a text prompt, remove or add instruments from an existing song

Nvidia

Continuing its charge in AI innovation, Nvidia has rolled out a new AI model which can create audio from text prompts. Called Fugatto, (Foundational Generative Audio Transformer Opus), it generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files. “While some AI models can compose a song or modify a voice, none have the dexterity of the new offering,” said Nvidia in a blog post.

How will the model work?

Story continues below Advertisement

Remove Ad

Fugatto can create a music snippet based on a text prompt, remove or add instruments from an existing song, change the accent or emotion in a voice. Furthermore, it can even let people produce sounds never heard before.

“We wanted to create a model that understands and generates sound like humans do,” said Rafael Valle, a manager of applied audio research at Nvidia.

Download MC Apps:

Copyright © Network18 Media & Investments Limited. All rights reserved. Reproduction of news articles, photos, videos or any other content in whole or in part in any form or medium without express written permission of moneycontrol.com is prohibited.

English

Markets

News

Personal Finance

Mutual Funds

Commodities

Media

Invest Now

Specials

Nvidia rolls out new AI model which creates audio from text

Nvidia's new experimental model Fugatto can create a music snippet based on a text prompt, remove or add instruments from an existing song

Related Stories

Trending Topics

News

Markets

Personal Finance

Mutual Funds

Tools

Community

Network 18 Sites

Quick Links