OpenAI is expanding ChatGPT’s capabilities with a new advanced voice feature for premium subscribers. Many are excited as OpenAI launched its advanced voice mode, which allows for real-time voice interactions. This update, announced on Tuesday, enables more natural and fluid voice interactions. The feature is available to those subscribed to OpenAI’s Plus, Team, or Enterprise plans, though it has yet to reach certain regions, including the EU, U.K., Switzerland, and other neighboring countries.
Users who have access to the advanced voice feature can now engage in real-time conversations with ChatGPT. Unlike previous voice functions, this version allows for more dynamic interactions, including the ability to interrupt ChatGPT mid-sentence, speeding up or altering the conversation as needed.
The rollout began earlier this week and will continue progressively. The company has not yet provided an exact timeline for availability in all regions. However, users can expect access by the end of fall, according to OpenAI’s timeline.
Who Can Access the Feature?
Currently, only paid users under the Plus, Team, and Enterprise plans can access the advanced voice mode. The most affordable plan, Plus, is priced at $20 per month. OpenAI has stated that educational users will also gain access soon, though free-tier users will not be able to use this feature at this time.
To use the feature, users need to ensure their ChatGPT app is updated. Once available, a notification will appear in the app, allowing users to begin voice conversations. The app’s voice mode can be accessed by tapping the sound wave icon next to the microphone in the chat interface.
Enhancements and Limitations
OpenAI launched advanced voice mode as part of its ongoing efforts to enhance AI-powered user experiences. One of the major upgrades in the advanced voice mode is improved responsiveness. ChatGPT now listens more attentively, pausing and responding immediately if interrupted. Users can also customize how ChatGPT speaks, asking it to adjust speed, tone, or even mimic different accents. Also, when OpenAI launched advanced voice mode, it introduced new features like mid-sentence interruptions.
However, the feature is not without limits. After about 30 minutes of usage, the app may display a warning message indicating a 15-minute remaining time, suggesting time restrictions for voice usage.
Additionally, early testers have noticed occasional issues, such as static noise or glitches in long conversations. OpenAI has acknowledged these challenges, but improvements, including enhanced accents and language options, are part of the ongoing rollout.
The launch of this feature highlights growing competition in the generative AI space. OpenAI’s ChatGPT, which has maintained a strong lead since its launch in 2022, is facing competition from Google and Meta. Google’s Gemini Live, a voice feature for its chatbot, recently debuted on Android devices. Meta is expected to introduce celebrity voices on its platforms, including Instagram, WhatsApp, and Facebook, later this week.
Despite competition, ChatGPT’s user base remains strong, with over 200 million weekly active.
Voice Impressions and Safety Concerns
One of the more novel aspects of OpenAI’s advanced voice mode is its ability to mimic famous voices. ChatGPT can generate vocal impressions of well-known figures like Donald Trump and Joe Biden, though accuracy varies. OpenAI has implemented safety measures to avoid unauthorized usage of voice likenesses, particularly in cases where legal issues may arise.
In May, OpenAI received criticism after its voice feature bore a striking resemblance to Scarlett Johansson’s character from the movie Her. Legal representatives for Johansson contacted OpenAI, resulting in a pause in the voice’s usage, according to CNBC. Since then, the company has rolled out the voice feature without such direct similarities.
Despite the legal challenges, users have reported the advanced voice mode as highly entertaining, with the chatbot capable of delivering a wide range of emotional tones and accents. While singing remains off-limits due to copyright concerns, ChatGPT can perform humorous impressions, including exaggerated versions of animated characters and public figures.
Also Read: Experts Warn: Godfather of AI Considers OpenAI’s Newest Model Dangerous.