OpenAI Unveils Voice Cloning Tool Just From 15 Seconds of Audio
OpenAI has recently revealed its groundbreaking voice cloning technology, known as Voice Engine, which can replicate any speaker’s voice with just a short 15-second audio sample. The company boasts that this tool can generate speech that sounds completely natural, with emotive and realistic qualities.
This innovative technology builds upon OpenAI’s pre-existing text-to-speech API and has been under development since 2022. By utilizing Voice Engine, individuals can create lifelike voices that closely resemble the original speaker. The capabilities of this tool have been demonstrated through various audio samples shared on OpenAI’s official blog.
While Voice Engine shows promise in aiding reading assistance, language translation, and assisting individuals with speech impairments, there are concerns regarding potential misuse. The technology could be exploited for deepfake purposes, raising serious privacy and ethical issues that need to be addressed before a full-scale launch.
In response to these risks, OpenAI has implemented safety measures such as requiring users to disclose that the voices are AI-generated, incorporating watermarking for audio traceability, and proactive monitoring to oversee its usage. The company also plans to establish a no-go voice list to prevent the impersonation of prominent figures without consent.
Although Voice Engine’s official release date remains undisclosed, speculations suggest that it could be priced at $15 per one million characters for the standard version and double the cost for an HD version. This affordability could revolutionize audiobook creation and accessibility.
Moreover, OpenAI continues to make strides in the tech industry with its recent collaboration with Microsoft on the development of an AI-based supercomputer called Stargate. This partnership, estimated to cost $100 billion, signifies OpenAI’s commitment to advancing artificial intelligence technologies.
As the world eagerly anticipates the launch of Voice Engine and other AI innovations, it is essential to prioritize ethical considerations and regulatory frameworks to ensure responsible usage in a rapidly evolving digital landscape.