HomeTechnologyNvidia rolls out new AI model which creates audio from text

Nvidia rolls out new AI model which creates audio from text

Nvidia's new experimental model Fugatto can create a music snippet based on a text prompt, remove or add instruments from an existing song

November 25, 2024 / 20:12 IST
Story continues below Advertisement
Nvidia
Nvidia

Continuing its charge in AI innovation, Nvidia has rolled out a new AI model which can create audio from text prompts. Called Fugatto, (Foundational Generative Audio Transformer Opus), it generates or transforms any mix of music, voices and sounds described with prompts using any combination of text and audio files. “While some AI models can compose a song or modify a voice, none have the dexterity of the new offering,” said Nvidia in a blog post.

How will the model work?

Story continues below Advertisement

Fugatto can create a music snippet based on a text prompt, remove or add instruments from an existing song, change the accent or emotion in a voice. Furthermore, it can even let people produce sounds never heard before.

“We wanted to create a model that understands and generates sound like humans do,” said Rafael Valle, a manager of applied audio research at Nvidia.