Jump to Content

Gemini Flash

Lightweight, fast and cost-efficient models featuring multimodal reasoning and a breakthrough long context window of up to one million tokens.

Small, smaller

Flash now comes in two compact variants, giving you the flexibility for whatever you choose to build.

Performance in a flash

Designed to be fast and efficient to serve at scale.

Longer context

Flash models have a one-million-token context window by default, which means you can process one hour of video, 11 hours of audio, codebases with more than 30,000 lines of code, or over 700,000 words.

Relentless innovation

Our research team is continually exploring new ideas at the frontier of AI, building innovative products that show consistent progress on a range of benchmarks.

Research

Technical reports

For developers

Build with Gemini

Integrate Gemini models into your applications with Google AI Studio and Google Cloud Vertex AI.