OpenAI has started the roll-out of advanced voice mode for some ChatGPT Plus users, offering a more interactive experience with real-time voice conversations. The Microsoft-backed AI company announced this development on Tuesday in a post on X.
Originally scheduled for late June, the launch was postponed to July to ensure the feature met the company’s standards. The new voice capabilities allow users to have real-time spoken conversations with ChatGPT, including the ability to interrupt the AI mid-speech—features that enhance the realism of the interaction.
In June, OpenAI stated that improvements were being made to the model’s ability to detect and reject inappropriate content. The company has also been enhancing the user experience and preparing its infrastructure for broader deployment.
Limited Availability and Testing
The advanced voice mode is currently available to a small, select group of Plus users. Upon opening the app, these users will receive an invitation to try the new feature, indicated by a notification at the bottom of the screen. Clicking the notification will lead them to a page titled “You’re invited to try the advanced Voice Mode,” where they can activate the feature.
OpenAI did not specify the criteria for selecting participants in this initial rollout, which is referred to as an “alpha” stage. The feature is powered by GPT-4o.
OpenAI has emphasized its commitment to safety and quality in voice interactions. The company has tested the feature with over 100 external “red teamers” across 45 languages. These cybersecurity experts simulate potential attacks and attempt to expose vulnerabilities in the system before they become widely available.
Limited Voice Options
As OpenAI starts the roll-out of advanced voice mode to some ChatGPT Plus users, only a few preset voices are currently available for use. Currently, users can choose from four preset voices. The voice “Sky,” previously controversial for its resemblance to actress Scarlett Johansson, has not been reintroduced.
This rollout marks another step in OpenAI’s efforts to refine and enhance its AI capabilities, as the company continues to improve both functionality and security measures in its products.
OpenAI’s rollout of the advanced voice mode for a select group of ChatGPT Plus users is a significant development in AI technology. While the initiative brings exciting new features, it also raises several concerns and points for consideration.
Advancements and Concerns
The introduction of real-time voice interaction in ChatGPT represents a notable advancement in artificial intelligence. This feature enhances user experience by making conversations with the AI more natural and engaging. The ability to interrupt the AI while it is speaking adds a layer of realism that has been challenging to achieve in AI assistants.
However, this progress is not without its challenges. One major concern is the potential for misuse of voice technology. As with text-based interactions, voice mode can be susceptible to generating harmful or inappropriate content. OpenAI has acknowledged this risk and delayed the feature’s launch to improve the model’s ability to detect and refuse such content. While this is a positive step, it highlights the ongoing struggle to balance innovation with safety and ethical considerations in AI development. As OpenAI starts the roll-out of advanced voice mode to some ChatGPT Plus users, the company emphasizes improved safety measures for voice interactions.
The involvement of over 100 external “red teamers” to test the system’s security is a commendable effort by OpenAI to preemptively identify vulnerabilities. However, the challenge remains in translating these tests into foolproof real-world applications.
Moreover, the decision to test the feature with a limited group of users, without specifying clear eligibility criteria, raises questions about the inclusivity and transparency of the process. The “alpha” rollout phase suggests a cautious approach, likely aimed at minimizing risks and gathering user feedback. However, it also limits broader public scrutiny and feedback, which are crucial for identifying potential flaws and ensuring the technology’s robustness.
Also Read: AMD Is Becoming an AI Chip Company: The Tech Giant’s Bold Shift into the AI Market.