Gemini 2.0 Flash (by Google)
Overview
Google’s latency-optimized Gemini model for everyday text and multimodal tasks (high-speed multimodal responses).
Quick facts
Modes: Text → Text · Image → Text · Video → Text
What it’s great for
Rapid drafting and Q&A with minimal latency
Generating captions or descriptions for supplied visuals
Converting short clips into concise summaries
Copy-and-paste prompts
Summarize the notes in `input_texts` into three stakeholder takeaways.Provide descriptive alt text for the uploaded lifestyle photo.Create a 5-sentence recap of the attached pitch video.Draft a friendly follow-up email answering the questions in `input_texts`.Parameters
Prompt
Text
—
Natural-language instruction (required)
Input Texts
List
—
Optional supporting snippets to ground responses
Modes & endpoints (summary)
Text → Text
4 s
Prompt, optional Input Texts
Fast drafting and Q&A
Image → Text
4 s
Prompt, Image(s)
Image captions or scene notes
Video → Text
4 s
Prompt, Video(s)
Short clip recaps
Prompt tips
Attach transcripts or briefs in
Input Textsto ground the responseRequest bullet counts or structured formats to manage token usage
Safety
Moderation is enforced under Google AI policies—avoid disallowed or sensitive content.
Last updated
Was this helpful?