Veo 3 (by Google)

Overview

Advanced text→video model with native audio generation and high-quality outputs.


Quick facts

  • Modes: Text → Video · Image → Video.

  • Default clip length: 8 seconds

  • Resolution: 960×960 (square) · 1280×720 (landscape) · 720×1280 (portrait)

  • Aspect ratios: 16:9, 1:1, 9:16


What it’s great for

  • Short cinematic clips with integrated audio.

  • Rapid prototyping with sound design included.

  • Short-form content pipelines.


Example outputs

circle-info

Scene with native audio — fox in misty forest

circle-info

Motion frame — high-contrast macro shot

circle-info

Audio-synced shot — dialogue moment

Copy-and-paste prompts


Parameters

Name
Type
Default
Notes

Prompt

Text

Required

Aspect Ratio

Select

Landscape (16:9)

Landscape (16:9), Square (1:1), Portrait (9:16)


Modes

Mode
Estimated time
Required inputs
Typical use

Text → Video

~4 minutes

Prompt

Videos with audio

Image → Video

~4 minutes

Image, Prompt

Animate stills with sound


Output options

Option
Values / notes

Formats

MP4 (H.264 + AAC)


Prompt tips

  • Describe desired soundscapes in the prompt for richer audio results.

  • Use explicit timing tokens for beats or cuts.


Last updated

Was this helpful?