Google’s experimental Gemini 1.5 Pro model has outperformed OpenAI’s GPT-4o in generative AI benchmarks.
Over the past year, OpenAI’s GPT-4o and Anthropic’s Claude-3 have been the frontrunners in the field. However, the latest iteration of Gemini 1.5 Pro has now taken the lead.
The LMSYS Chatbot Arena is a well-recognized benchmark in the AI community, used to evaluate models on various tasks and assign an overall competency score. In this ranking, GPT-4o scored 1,286, while Claude-3 achieved a respectable 1,271. The previous version of Gemini 1.5 Pro had a score of 1,261.

The experimental release of Gemini 1.5 Pro (known as Gemini 1.5 Pro 0801) surpassed its competitors with an impressive score of 1,300. This notable improvement indicates that Google’s latest model may have superior overall capabilities compared to its rivals.
While benchmarks offer valuable insights into an AI model’s performance, they may not always fully represent its capabilities or limitations in real-world scenarios.
Although Gemini 1.5 Pro is currently available, its status as an early release or in testing suggests that Google may still make adjustments or potentially withdraw the model for safety or alignment reasons.
This advancement signifies a significant milestone in the competitive race for AI dominance among tech giants. Google’s ability to surpass OpenAI and Anthropic in benchmark scores showcases the rapid pace of innovation in the field and the intense competition driving these advancements.
As the AI landscape continues to progress, it will be intriguing to see how OpenAI and Anthropic respond to this challenge from Google. Can they regain their top positions on the leaderboard, or has Google set a new standard for generative AI performance?
(Photo by Yuliya Strizhkina)
See also: Meta’s AI strategy: Building for tomorrow, not immediate profits

Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.
Explore other upcoming enterprise technology events and webinars powered by TechForge here.