OpenAI has recently introduced a new voice cloning tool named Voice Engine, following the success of its video generator, Sora. This innovative tool utilizes text input and a brief 15-second audio sample to produce lifelike speech that closely mirrors the original speaker’s voice.
Although highly anticipated, OpenAI has opted to keep the Voice Engine technology under wraps due to concerns over potential misuse and the proliferation of fake content online. The company emphasized the importance of obtaining explicit consent from individuals whose voices are replicated using the tool and ensuring that AI-generated voices are clearly distinguished for audiences.
Partners collaborating with OpenAI to test the Voice Engine tool have identified various potential use cases for this advanced technology. However, the company remains cautious about releasing it on a broader scale, recognizing the risks associated with synthetic voice misuse, particularly in the context of spreading disinformation through deepfakes during an election year.
The introduction of Voice Engine comes on the heels of the successful launch of Sora, a tool capable of generating high-quality videos based on text prompts. While the technology has garnered praise for its impressive capabilities, concerns regarding its impact on the video production industry have been raised. The competition in this space includes startups like Runway Gen-2, Pika Labs, and Stability AI, as well as Google’s latest offering, Lumiere.
As OpenAI continues to develop and refine its AI-powered tools, it remains committed to engaging with various stakeholders across government, media, entertainment, education, and civil society to gather feedback and ensure responsible use of these technologies. With a focus on balancing innovation and risk mitigation, OpenAI is navigating the complexities of advancing artificial intelligence in an increasingly digital world.