Google's Interactive Gemini Sparks Multi-Modal AI Battle with OpenAI

2024-05-14

Google has released a teaser video showcasing its AI chatbot Gemini's ability to easily respond to voice and video inputs. This teaser was released just a few hours before OpenAI showcased ChatGPT-4o, which demonstrated impressive conversational fluency and understanding. The AI chatbot battle is heating up, with OpenAI and Google engaged in a fierce competition for dominance. Just hours before Google's annual I/O conference, the tech giant released a video showcasing what appears to be an updated version of the Gemini chatbot, demonstrating its ability to handle real-time video inputs and voice prompts. The demo video seems to have been shot during the preparations for the I/O conference, showcasing Gemini's performance on Pixel devices. It seamlessly handles real-time video and voice queries, providing accurate information. When asked about the ongoing preparations, Gemini recognizes them as arrangements for a large-scale event. The conversation flows naturally, with Gemini asking the user about noticed details and ultimately revealing that the event is the Google I/O conference, providing a concise description. Google's timing for releasing this teaser video is precise, as it was released a few hours before the highly anticipated upgrade of OpenAI's ChatGPT. If you haven't heard yet, OpenAI's ChatGPT-4o demo is incredibly impressive, showcasing the AI's unparalleled fluency, coherence, understanding, and conversational abilities. While past experiences may make us skeptical of demo videos, today's demonstration undoubtedly confirms the seamless integration of voice and video inputs in these AI chatbots. This multimodal capability, combined with extremely natural and fluent conversations, gives us a glimpse into the future of AI interaction. With anticipation building up, we eagerly await Google's I/O keynote tomorrow, where the company is expected to unveil new AI features and updates for Gemini.