Jump to Content

Gemini

The most general and capable AI models we've ever built.

Our most flexible models yet

Each Gemini model is built for its own set of use cases, making a versatile model family that runs efficiently on everything from data centers to on-device.

Project Astra

Project Astra explores the future of AI assistants. Building on our Gemini models, we’ve developed AI agents that can quickly process multimodal information, reason about the context you’re in, and respond to questions at a conversational pace, making interactions feel much more natural.

The demo shows two continuous takes: one with the prototype running on a Google Pixel phone and another on a prototype glasses device.

Natively multimodal

Gemini models are built from the ground up for multimodality, seamlessly combining and understanding text, code, images, audio, and video.

Following content is a visual/ descriptive representation of the functionality of Gemini:

Gemini models can generate code based on different kinds of inputs.

Gemini models can generate code based on different kinds of inputs.

Gemini

I see a murmuration of starlings, so I coded a flocking simulation.

Gemini models can generate text and images, combined.

Could Gemini show me ideas for what to make?

Gemini

How about an octopus with blue and pink tentacles?

Gemini models can understand and perform tasks involving several different written languages.

Could Gemini explain what this means?

Gemini

I see the time signature is 6/8. This means there are 6 eighth notes in each measure.

The dynamic marking is piano, which means to play softly. Andante grazioso means to play at a graceful walking pace.

Longer context

1.5 Pro and 1.5 Flash both have a default context window of up to one million tokens — the longest context window of any large scale foundation model. They achieve near-perfect recall on long-context retrieval tasks across modalities, unlocking the ability to process long documents, thousands of lines of code, hours of audio, video, and more. For 1.5 Pro, developers and enterprise customers can also sign up to try a two-million-token context window.

Research

Relentless innovation

Our research team is continually exploring new ideas at the frontier of AI, building innovative products that show consistent progress on a range of benchmarks.

Technical reports

For developers

Build with Gemini

Integrate Gemini models into your applications with Google AI Studio and Google Cloud Vertex AI.

Try the models

Get started

Example prompts for the Gemini API in Google AI Studio.

Responsibility at the core

Our models undergo extensive ethics and safety tests, including adversarial testing for bias and toxicity.

Hands-on

Serving billions of Google users

Gemini models are embedded in a range of Google experiences.

What's new