Google Gemini 3.5 Pro targets June release with 2M-token context

June 9, 20263 min read

TL;DR

Google's Gemini 3.5 Pro enters developer preview on Vertex AI with a 2-million-token context and new reasoning engine ahead of its June public launch.

Google previewed Gemini 3.5 Pro on May 19, and Sundar Pichai left developers with a single-sentence timeline at the launch: "Give us until next month to get it to you." That puts the public rollout in June, a date AI teams are now tracking closely.

The model occupies the top of the Gemini 3.5 family under a tier Google calls "Flagship Frontier Intelligence" and is currently accessible through Google Cloud's Vertex AI platform in a limited developer preview. No formal press release has accompanied the rollout; specifics are filtering through industry reports and early testers rather than official documentation.

The 2-million-token pitch

The headline number is the context window: 2 million tokens. That is enough to process roughly 1,500 pages of dense technical text in a single inference call. Most competing frontier models fall well short of that figure, making context length one of the clearest differentiators Google is advertising ahead of general availability, according to herzindagi.com.

Alongside the extended context, the model ships with a "Deep Think" reasoning engine. Rather than predicting the next token from pattern matching alone, Deep Think performs multi-step logical tracking, checks intermediate conclusions, and verifies its own reasoning chain before producing a final output. Practical targets include complex mathematics, framework design, and multi-step problems where prior Gemini versions stumbled. Its predecessor in the Gemini 3.5 line, Flash, prioritized execution speed; Pro is designed to invert that trade-off and recover Google's standing at the frontier of artificial intelligence benchmarks.

The competitive pressure

Google is not moving in a vacuum. CNBC reported in March that OpenAI is targeting an IPO as soon as the fourth quarter of 2026 and is "orienting aggressively" toward enterprise productivity use cases. ChatGPT now logs 900 million weekly active users, and OpenAI's CEO of Applications, Fidji Simo, framed the next phase as converting that base into high-compute customers.

Context windows have become a proxy metric in the artificial intelligence review that enterprise buyers run before committing to a platform. Longer context means fewer chunking workarounds, cheaper retrieval pipelines, and the ability to drop entire codebases or legal documents into a single prompt. Google's 2-million-token ceiling, if it holds in production workloads, is a direct argument against migrating to competing enterprise tiers.

Policy currents

The regulatory environment is shifting around these launches simultaneously. David Sacks, recently the White House Special Advisor for AI and Crypto, publicly dismissed heightened safety concerns as "Hollywood storytelling" just six days after President Trump signed an executive order requesting voluntary model submissions for federal safety testing, as Yahoo News reported. His framing signals that the administration's practical stance on AI oversight is looser than the executive order's text implies.

Microsoft is reading the same moment differently. Sarah Bird, the company's chief product officer for responsible AI, told Mint that oversight builds trust without necessarily constraining engineering teams. For frontier labs, the result is a policy vacuum where rules are still being written while the models are already shipping.

What developers should watch

Vertex AI is the practical entry point today. Teams wanting early access to stress-test the 2-million-token window can apply through Google Cloud before the public launch. Whether Deep Think's reasoning verification performs outside controlled benchmarks is the more important open question; independent artificial intelligence review results will ultimately determine whether the Flagship Frontier label is earned or just marketing.

Pricing remains undisclosed. At 2 million tokens per context call, inference costs will be a deciding factor for any team weighing Gemini 3.5 Pro against shorter-context alternatives. Pichai's June timeline means those numbers are due soon. If benchmarks confirm what Google is implying, this is the most capable model the company has shipped since Gemini 1.0. If they do not, the Flagship Frontier tier will need a new argument.

FAQ

What is the Gemini 3.5 Pro context window size?
Gemini 3.5 Pro supports up to 2 million tokens per inference call, well above most competing frontier models currently on the market.

When does Gemini 3.5 Pro go public?
Sundar Pichai committed to a June 2026 public release at the May 19 launch. No specific date within the month has been announced.

How does Deep Think differ from standard Gemini reasoning?
Deep Think performs multi-step logical tracking and verifies intermediate conclusions before generating a response, rather than relying on pattern-based next-token prediction alone.

Where can developers access Gemini 3.5 Pro before the public launch?
A limited developer preview is available now through Google Cloud's Vertex AI platform.

Google Gemini 3.5 Pro targets June release with 2M-token context

The 2-million-token pitch

The competitive pressure

Policy currents

What developers should watch

FAQ

Read Next

Google Upgrades NotebookLM with Gemini 3.5 and Cloud Compute

David Sacks Calls AI Safety Fears 'Hollywood Storytelling'

Moonshot AI Targets $30B Valuation After Seven-Fold Surge

OpenAI targets 2026 IPO with enterprise productivity push

Google Adds Contact Management to Gemini AI Assistant

Gemini Gets Full Google Contacts Access in Latest Update

OpenAI Hits 800M Users, Targets Trillion-Dollar Valuation

OpenAI Targets $1 Trillion Valuation Ahead of IPO