Categories
News

Nvidia Unveils AI Model For Voice Modification And Novel Sounds Generation


Nvidia unveiled a brand new synthetic intelligence mannequin for creating music and audio that may alter voices and create unique sounds.

TakeAway Factors:

  • A brand new AI mannequin for creating music and audio that may alter voices and create unique sounds was unveiled by Nvidia on Monday.
  • Santa Clara, California-based Nvidia’s model generates sound results and music from a textual content description, together with novel sounds similar to making a trumpet bark like a canine.
  • Nvidia’s new mannequin was skilled on open-source information, and the corporate mentioned it’s nonetheless debating whether or not and launch it publicly.

Voice modification and sound technology synthetic intelligence mannequin

A brand new synthetic intelligence mannequin for music and audio manufacturing that may change voices and produce distinctive sounds was offered by Nvidia. This know-how is focused at those that create video video games, films, and music.

Nvidia, the world’s largest provider of chips and software program used to create AI methods, mentioned it doesn’t have rapid plans to publicly launch the know-how, which it calls Fugatto, quick for Foundational Generative Audio Transformer Opus 1.

It joins different applied sciences proven by startups similar to Runway and bigger gamers similar to Meta Platforms that may generate audio or video from a textual content immediate.

Santa Clara, California-based Nvidia’s model generates sound results and music from a textual content description, together with novel sounds similar to making a trumpet bark like a canine.

What makes it completely different from different AI applied sciences is its potential to soak up and modify current audio, for instance, by taking a line performed on a piano and remodeling it right into a line sung by a human voice, or by taking a spoken phrase recording and altering the accent used and the temper expressed.

“If we take into consideration artificial audio over the previous 50 years, music sounds completely different now due to computer systems, due to synthesizers,” mentioned Bryan Catanzaro, vice chairman of utilized deep studying analysis at Nvidia. “I believe that generative AI goes to deliver new capabilities to music, to video video games and to bizarre of us that need to create issues.”

Whereas firms similar to OpenAI are negotiating with Hollywood studios over whether or not and the way the AI may very well be used within the leisure business, the connection between tech and Hollywood has grow to be tense, significantly after Hollywood star Scarlett Johansson accused OpenAI of imitating her voice.

Options

Nvidia’s new mannequin was skilled on open-source information, and the corporate mentioned it’s nonetheless debating whether or not and launch it publicly.

“Any generative know-how all the time carries some dangers, as a result of folks would possibly use that to generate issues that we would like they don’t,” Catanzaro mentioned. “We must be cautious about that, which is why we don’t have rapid plans to launch this.”

Creators of generative AI fashions have but to find out forestall abuse of the know-how similar to a consumer producing misinformation or infringing on copyrights by producing copyrighted characters.

OpenAI and Meta equally haven’t mentioned once they plan to launch to the general public their fashions that generate audio or video.











Source link

Leave a Reply

Your email address will not be published. Required fields are marked *