OpenAI holds back wide release of voice-cloning tech due to misuse concerns

Voice Engine can clone voices with 15 seconds of audio, but OpenAI is warning of potential misuse.

Voice synthesis has come a long way since 1978’s Speak & Spell toy, which once wowed people with its state-of-the-art ability to read words aloud using an electronic voice. Now, using deep-learning AI models, software can create not only realistic-sounding voices, but also convincingly imitate existing voices using small samples of audio.

Along those lines, OpenAI just announced Voice Engine, a text-to-speech AI model for creating synthetic voices based on a 15-second segment of recorded audio. It has provided audio samples of the Voice Engine in action on its website.

Once a voice is cloned, a user can input text into the Voice Engine and get an AI-generated voice result. But OpenAI is not ready to widely release its technology yet. The company initially planned to launch a pilot program for developers to sign up for the Voice Engine API earlier this month. But after more consideration about ethical implications, the company decided to scale back its ambitions for now.

Read 14 remaining paragraphs | Comments

ars-rss

Recent Posts

Recent Comments

NordLayer adds malware detection tool to help keep businesses safe

Nintendo Switch Online continues to grow its collection with the addition of three new Sega Genesis games

Google Chat adds huddles in its latest attempt to be like Discord and Slack

Categories

Archives

Recent Posts

Recent Comments

NordLayer adds malware detection tool to help keep businesses safe

Nintendo Switch Online continues to grow its collection with the addition of three new Sega Genesis games

Google Chat adds huddles in its latest attempt to be like Discord and Slack

Categories

Archives

OpenAI holds back wide release of voice-cloning tech due to misuse concerns

Leave a Reply Cancel reply

Archives

Categories