OpenAI has added video streaming and screen-sharing capabilities to its Advanced Voice Mode. These features, launched as part of OpenAI’s “12 Days of Shipmas” event, enhance ChatGPT’s interactivity by enabling real-time video and screen-sharing options. With the latest update, ChatGPT gets screen sharing and real-time video analysis, offering enhanced support for tasks like tech troubleshooting and DIY projects.
Video streaming can now be activated on the ChatGPT mobile app by tapping the voice icon, followed by the video button. Screen sharing is accessed through the three-dot menu, allowing users to navigate outside the app and share their phone screens with the AI.
These updates let ChatGPT respond to visual input in real time, functioning like a virtual assistant. It can identify objects, provide instructions, and even interact conversationally. For example, during a demo, ChatGPT guided a coffee-making process by observing the tools and offering step-by-step advice.
Practical Applications
The video and screen-sharing tools have various practical uses:
-
DIY Assistance:
Users can receive real-time instructions for assembling furniture or fixing devices.
-
Tech Support:
The AI helps troubleshoot issues by analyzing the shared screen.
-
Cooking Guidance:
ChatGPT offers suggestions by watching cooking processes.
-
Educational Aid:
Students can share diagrams or math problems and seek solutions interactively.
Availability and Privacy Features
OpenAI’s new update ensures ChatGPT gets screen sharing and real-time video analysis, making it a versatile tool for both personal and professional use. These features are rolling out to ChatGPT Plus, Pro, and Teams users on iOS and Android apps. ChatGPT Enterprise and Education users will receive access in January. To protect privacy, the video option requires activation for each session to prevent accidental sharing.
Both platforms target mobile users, aiming to improve AI capabilities for on-the-go assistance.
Holiday Cheer with Santa Mode
As part of the 12 Days of Shipmas event, ChatGPT gets screen sharing and real-time video analysis, showcasing OpenAI’s focus on advanced AI capabilities. OpenAI has also introduced “Santa Mode,” a voice preset that mimics Santa Claus. This feature is available in Advanced Voice Mode on mobile, web, and desktop versions until January. Conversations in Santa Mode are not saved in the chat history, maintaining user privacy.
These advancements mark a significant leap toward more immersive AI interactions. The ability to process visual input in real-time could lead to breakthroughs in education, technical support, and collaborative work. OpenAI’s updates signal a step closer to fully interactive and context-aware AI tools.
Google’s Gemini 2.0 and Project Mariner
Google’s Gemini 2.0 offers similar advancements. Available across all subscription tiers, it integrates real-time video and image analysis. Project Astra combines Gemini’s conversational abilities with smart glasses for enhanced environmental awareness.
Additionally, Google launched Project Mariner, which allows AI to perform tasks like clicking and typing on a desktop. A lightweight version of Gemini, called Gemini Flash, was introduced for developers. The Gemini ecosystem now includes tools like Jules, an AI coding assistant, and “Deep Research,” a feature for generating detailed reports.
These innovations reflect the growing competition between OpenAI, Google, and Microsoft in creating smarter, more versatile AI tools.
OpenAI’s privacy safeguards, like requiring users to manually activate video sharing, are a step in the right direction. Yet, real-time visual data sharing introduces significant ethical concerns. Users may inadvertently expose sensitive personal or professional information. OpenAI’s assurances about privacy controls must be matched with clear, transparent policies to build user trust.
Moreover, the competitive race among tech giants like Google and Microsoft highlights the broader challenge of responsibly integrating AI into daily life. Features like Project Astra and Copilot Vision are pushing the boundaries of what AI can do, but this innovation comes with a need for rigorous oversight to prevent misuse.
Also Read: Russia Teams Up with BRICS to Create AI Alliance for Global Power.