OpenAI has announced a one-month delay in the release of its much-anticipated Voice Mode for ChatGPT, citing safety concerns as the reason behind the decision. The company stated that more time is needed to enhance the model’s ability to detect and reject certain content, in addition to improving user experience and scaling infrastructure for millions of users.
The decision follows the introduction of a new safety and security committee by OpenAI, tasked with overseeing future projects and operations. It remains unclear whether the postponement was influenced by the committee or internal stakeholders within the company.
In a recent announcement, OpenAI revealed the GPT-4o system in May, highlighting significant advancements in human-computer interaction. The omni model boasts a response time to audio inputs of 320 milliseconds on average, similar to human response times.
The Voice Mode feature promises real-time conversations with human-like emotional responses, raising concerns about potential manipulation and the spread of misinformation. The model’s ability to understand nuances such as tone, non-verbal cues, and background noise blurs the lines between human and machine interaction.
While an alpha release is planned for limited paid subscribers in July, the broader rollout schedule remains uncertain pending user feedback. The company emphasizes a commitment to high safety and reliability standards.
The delay in the Voice Mode rollout comes after controversy erupted over actress Scarlett Johansson accusing OpenAI of using her voice without permission in technology demonstrations. OpenAI clarified that the voice of Sky, one of the five voice modulations offered by Voice Mode, belonged to a voice artist and not Johansson.
Despite efforts to resolve the issue with Johansson, OpenAI withdrew the Sky voice sample due to legal pressure. The incident underscored ethical considerations around deepfakes and synthetic media, prompting a pause in using Sky’s voice in the company’s products.
Additionally, potential regulatory hurdles faced by Apple under the EU’s Digital Markets Act cast uncertainty over the integration of OpenAI’s technology into Apple devices. The partnership aimed at leveraging OpenAI’s capabilities in Apple’s Apple Intelligence system, but regulatory challenges may impact the collaboration’s future.
Overall, the combination of safety concerns, potential for misuse, and regulatory obstacles presents a complex landscape for OpenAI’s Voice Mode. The industry will closely monitor the technology’s evolution, focusing on security vulnerabilities and implications for responsible AI development in the future.