OpenAI took the wraps off ChatGPT’s long-promised video capabilities Thursday, letting users point their phones at objects for real-time AI analysis—a feature that’s been gathering dust since its first demo in May.

Previously, you could input text, charts, voice, or still photos and interact with GPT. This feature, released late Thursday, allows GPT to watch you in real time and conversationally provide feedback. For instance, in my tests, this mode was able to solve math problems, give food recipes, tell stories, and even turn itself into my daughter’s new best friend, interacting with her while making pancakes, giving suggestions and encouraging her learning process through different games.

The release comes just a day after Google showed its own take on a camera-enabled AI assistant powered by the newly minted Gemini 2.0. Meta’s been playing in this sandbox too, with its own AI that can see and chat through phone cameras.

ChatGPT’s new tricks aren’t for everyone though. Only Plus, Team, and Pro subscribers can access what OpenAI calls “Advanced Voice Mode with vision.” The Plus subscription costs $20 a month, and the Pro tier costs $200.

“We’re excited to announce that we’re bringing video to Advanced voice mode so you can bring live video and also live screen sharing into your conversations with ChatGPT,” Kevin Weil, OpenAI’s Chief Product Officer, said in a video Thursday.

Go to Source to See Full Article
Author: Jose Antonio Lanz

BTC NewswireAuthor posts

BTC Newswire Crypto News at your Fingertips

Comments are disabled.