Gemini Live, Google’s response to Advanced Voice Mode for OpenAI’s ChatGPT, is set to launch on Tuesday after being announced at Google’s I/O 2024 developer conference and later at the Made by Google 2024 event.
Gemini Live enables users to engage in in-depth voice conversations with Gemini, Google’s generative AI-powered chatbot, on their smartphones. The enhanced speech engine provides consistent, emotionally expressive, and realistic multi-turn dialogues. Users can interrupt Gemini to ask follow-up questions, and the chatbot adapts to their speech patterns in real-time.
According to Google, “With Gemini Live [via the Gemini app], you can talk to Gemini and choose from [10 new] natural-sounding voices it can respond with. You can even speak at your own pace or interrupt mid-response with clarifying questions, just like you would in any conversation.”
Gemini Live allows for hands-free interaction, allowing users to continue conversations with the app running in the background or even when the phone is locked. Conversations can be paused and resumed at any time.
One potential advantage Gemini Live offers over ChatGPT’s Advanced Voice Mode is its better memory retention. This is attributed to the architecture of the generative AI model, allowing for longer context windows to process and reason over a significant amount of data before responding.
Gemini Live is part of Gemini Advanced, a premium service gated behind the Google One AI Premium Plan priced at $20 per month. Multimodal input and support for additional languages, as well as iOS, are planned for later this year.
New features include the ability for Android users to access Gemini overlay on any app, generating images, and integrating with various Google services. Gemini will also be available on Android tablets starting later this week.