Google Gemini 3.5 Flash Launches at I/O With 4x Speed Gain

May 20, 20263 min read

TL;DR

Gemini 3.5 Flash debuts at Google I/O 2026 with four-times faster output, agentic multi-hour task handling, and a redesigned app shipping globally today.

Google's fastest model yet just became its most capable. Gemini 3.5 Flash, unveiled Tuesday at Google I/O 2026, generates output tokens four times faster than competing frontier models while undercutting their price by roughly half -- a combination the company says makes it the strongest agentic and coding model it has shipped.

Rollout began immediately. Google set Gemini 3.5 Flash as the default in both the Gemini app and Search's AI Mode, putting it in front of a user base the company pegged at more than 900 million monthly active users across 230 countries, as Android Authority detailed.

The timing is pointed. Gemini 3 landed in November 2025; version 3.1 followed in February 2026. That accelerated cadence puts Google head-to-head against OpenAI's GPT-5.5 and Anthropic's Claude Opus 4.7, CNET reported, with speed and cost as the differentiators rather than raw benchmark scores alone.

Agentic ambitions

Flash's headline claim is endurance. Google DeepMind CTO Koray Kavukcuoglu said the model can handle autonomous sessions lasting several hours, completing full coding research projects without human intervention. That threshold matters in the expanding artificial intelligence agent market, where rivals have already captured developer attention with autonomous platforms.

Sundar Pichai, briefing reporters ahead of the keynote, called the model a "game changer" internally and said it operates at roughly half the cost of competing systems -- a figure developers building on the API will scrutinize closely. Unusually, Google claims Flash outperforms Gemini 3.1 Pro on both coding and agentic benchmarks, reversing the normal hierarchy between the Flash and Pro tiers.

Safety received a parallel overhaul. The Gemini 3.5 family includes new interpretability tools that examine the model's internal reasoning before it generates a response, Mashable reported, reducing harmful outputs and the false refusals that have frustrated users on current-generation systems.

The Spark question

Gemini Spark is the most ambitious product riding Gemini 3.5's capabilities. Google describes it as a 24/7 personal AI agent integrated across its full product suite, capable of taking action on a user's behalf rather than simply surfacing information. TechCrunch reported a staged rollout: trusted testers get access this week, followed by a beta for AI Ultra subscribers the week after.

The cautious pacing signals Google is aware of what it is building. Autonomous agents with persistent access to email, calendar, and third-party services create novel attack surfaces and raise data governance questions that benchmark scores do not answer. Details about Spark's sandboxing architecture and permission model are not yet public.

Daily Brief, a narrower agentic feature, launches today for paid Google AI subscribers in the United States. It aggregates Gmail and Calendar data into a morning digest, prioritizes tasks, and proposes next steps -- a lower-stakes debut than Spark but a clear statement of direction, Engadget noted.

The platform play

Carrying all of it is a redesigned Gemini app, rebuilt from the ground up and shipping today on Android and iOS. A new design language called "Neural Expressive" brings fluid animations and haptic feedback; responses now lead with key information rather than walls of text. Live voice mode integrates directly into the main interface, letting users switch between typing and speaking without navigating a separate screen.

Gemini Omni, announced at the same event, adds multimodal video generation -- users can supply text, audio, images, and video together and receive realistic video output in return. DeepMind CEO Demis Hassabis described it as a pivotal step toward artificial general intelligence, a phrase the industry hears often but one that still moves audiences when a DeepMind founder says it. Omni Flash is available today for paid subscribers and arrives in YouTube Shorts later this week at no cost.

What Google is really announcing is a platform bet. It has rebuilt its consumer app, launched a new model family, staged an autonomous agent, and updated its search product in a single morning -- all coordinated around Gemini 3.5 Flash as the backbone. Gemini 3.5 Pro, confirmed to be in internal testing, arrives next month. If it extends Flash's benchmark advantage into the premium tier, the pressure on OpenAI and Anthropic intensifies considerably.

Google owns the distribution. The question is whether that is enough to own the trust.

FAQ

1. What is Gemini 3.5 Flash and how fast is it?
Gemini 3.5 Flash is Google's new speed-focused model, announced at Google I/O on May 19, 2026. Google says it produces output tokens four times faster than other frontier models and costs roughly half as much.

2. When does Gemini 3.5 Pro release?
Google confirmed Gemini 3.5 Pro is in internal testing and is scheduled to roll out next month, following Flash's launch.

3. What is Gemini Spark and when can I use it?
Gemini Spark is a 24/7 personal AI agent that can take actions across Google's product suite. A small group of trusted users gets access this week; an AI Ultra subscriber beta follows the week after.

4. How does Gemini 3.5 Flash compare to GPT-5.5 and Claude Opus 4.7?
Google positions Flash as comparable on intelligence benchmarks to both models while being faster and cheaper to run. Independent third-party evaluations have not yet been published.