TLDR
- Gemini 3 Flash is now the default model in the Gemini app, replacing Gemini 2.5 Flash for better performance and efficiency.
- The model outperforms its predecessor in benchmarks, including Humanity’s Last Exam and MMMU-Pro, showcasing enhanced reasoning.
- Gemini 3 Flash is priced lower than earlier models, processing tasks three times faster while using 30% fewer tokens.
- The model excels in multimodal content understanding, allowing users to interact with videos, sketches, and audio recordings.
- Gemini 3 Flash is available for developers through Google’s API and Antigravity platform, supporting high-frequency workflows and data analysis.
Google has unveiled its latest AI model, Gemini 3 Flash, positioning it as the default model within the Gemini app. The model, part of Google’s Gemini 3 series, offers impressive speed, efficiency, and multimodal capabilities at a fraction of the cost of previous models.
Gemini 3 Flash: Superior Speed and Performance
Gemini 3 Flash offers an upgrade over its predecessor, Gemini 2.5 Flash, with major improvements in speed and cost-efficiency. The new model outperforms the previous one in benchmarks, including Humanity’s Last Exam and the MMMU-Pro multimodal reasoning test. For example, Gemini 3 Flash scored 33.7% on the Humanity’s Last Exam, compared to 11% by Gemini 2.5 Flash and 37.5% by Gemini 3 Pro, showcasing its advanced reasoning capabilities.
The model’s efficiency shines through in both performance and cost, processing tasks three times faster than the Gemini 2.5 Pro at a lower price. Google has priced Gemini 3 Flash at $0.50 per million input tokens and $3.00 per million output tokens, offering a more affordable option while maintaining high performance. The model uses 30% fewer tokens on average for certain tasks, which makes it an attractive option for companies seeking bulk processing at lower costs.
Gemini 3 Flash Now the Default Model in the Gemini App
As part of its consumer rollout, Google has made Gemini 3 Flash the default model in the Gemini app. This change replaces the older Gemini 2.5 Flash model, making Gemini 3 Flash accessible to millions of users globally. The new model excels in multimodal content understanding, enabling users to upload videos, sketches, or audio recordings for analysis or interaction, further improving the app’s versatility.
Google also introduced a range of features to enhance user interaction. The model can now better understand the intent behind user queries and generate responses that include visual elements such as images and tables. This makes Gemini 3 Flash more intuitive and capable of handling a broader range of tasks, from everyday queries to more complex multimodal requests.
Enterprise and Developer Access
In addition to consumer use, Gemini 3 Flash is available for enterprises and developers through Google’s Vertex AI and Gemini Enterprise platforms. Companies like JetBrains, Figma, and Harvey are already utilizing the model for various applications. For developers, the model is available through Google’s API and the Antigravity platform, which allows for rapid development of coding solutions and agentic workflows.
Gemini 3 Flash’s capabilities extend to high-frequency workflows, video analysis, and data extraction, making it a robust tool for enterprises looking to enhance their AI-powered systems. Google has also positioned the model as ideal for quick, repeatable tasks, such as video analysis and visual Q&A, which require fast, intelligent responses.


