The Best AI Model in the World: Google DeepMind’s Gemini Has Surpassed GPT-4
This article is a selection from The Algorithmic Bridge, an educational project aimed at bridging the gap between AI and people. Join me to learn how to navigate a future that has already arrived.
A few hours ago Google and Google DeepMind announced their much-awaited AI model, Gemini. There’s still not much hands-on feedback on how well it works, but the reported performance is outstanding.
This article is a quick overview (divided into easy-to-skim sections) of the info we have so far and my first impressions from what I’ve read (haven’t tested the model yet). I’ll go deeper over the coming days as we acquire a better understanding of what Gemini can do, how it’s built (hopefully), how it works, and what it means for the future of AI.
Here’s the outline:
- Gemini specs, sizes (Ultra, Pro, Nano), and availability.
- Gemini Ultra is better than GPT-4.
- Gemini is natively multimodal.
- My first impressions from the available info.
Gemini specs, sizes (Ultra, Pro, Nano), and availability
Gemini is a family of models that comes in three sizes: Ultra, Pro, and Nano. Here’s a summary of the technical report’s section on Gemini sizes and their specifications.
Gemini Ultra is the version that achieves state-of-the-art (SOTA) benchmarking and surpasses GPT-4 across benchmarks (as we’ll see soon). It’s designed to run on data centers so you won’t install this one on your home computer. It’s still under red-teaming safety review but it will be available in early 2024 on a new version of Google’s chatbot, Bard Advanced.
Gemini Pro is comparable to GPT-3.5 (not always better, though) and it’s optimized for “cost as well as latency.” If you don’t need the best of the best and costs are a constraint, Pro is probably a better choice than Ultra (just like ChatGPT with GPT-3.5 being free is, for most tasks, better than paying $20/month for GPT-4). Gemini Pro is already available on Bard (“its biggest upgrade yet”) in 170…