Recently, the rivalry between Google and OpenAI in the AI model sector has intensified once again. Within less than a day after OpenAI's latest GPT-4o topped a specific leaderboard, Google swiftly launched its newest experimental model, Gemini-Exp-1121, successfully reclaiming the top position.
It has been reported that Google's recently released Gemini-Exp-1121 model has improved capabilities in coding, reasoning, and visual comprehension. In numerous performance evaluations, this model excelled in all areas except for style control, securing the leading spot. Notably, Gemini-Exp-1121 has demonstrated significant enhancements in visual capabilities compared to its predecessor.
In practical applications, Gemini-Exp-1121 has showcased formidable performance. For example, when tasked with interpreting the same comic strip, the model provided more comprehensive and detailed responses, effectively utilizing subheadings and bolding key points to highlight critical information. In contrast, the updated GPT-4o's responses were relatively brief and general.
Furthermore, Gemini-Exp-1121 has exhibited exceptional performance in logical reasoning. In a classic river-crossing logic puzzle involving animals, the model was able to provide entirely correct answers, whereas the updated GPT-4o made some errors.
Notably, there are new developments from OpenAI as well. In the latest test version of ChatGPT, users have discovered the code for a "Live Camera" video feature. This functionality includes real-time recording, live processing, voice mode integration, and visual recognition capabilities, indicating that OpenAI is preparing to launch this new feature.
Meanwhile, Google has also demonstrated a similar demo, though it has not been officially released yet. However, given OpenAI's typical approach, they are likely to deploy this feature widely ahead of Google.
As technology continues to advance, AI models will find increasing applications across various sectors. The fierce competition between Google and OpenAI is undoubtedly accelerating the rapid development and progress of AI technology. In the future, individuals may interact more with Chatbots through voice and agents, with the Live Camera feature possibly marking the beginning of this trend.