Skip to main content

Google Veo 3.1

State-of-the-art image-to-video generation powered by Google’s Veo 3.1 model with multi-resolution output and optional audio synthesis.
Veo 3.1 Image-to-Video transforms static images into dynamic, cinematic videos guided by text prompts. It supports resolutions up to 4K, durations of 4-8 seconds, and optional audio generation. Choose between Standard mode for maximum quality or Fast mode for quicker generation at lower cost.

Key capabilities

  • Image animation: Bring any image to life with natural motion and dynamics
  • Multi-resolution output: Generate videos in 720p, 1080p, or 4K resolution
  • Audio generation: Optional synchronized audio for complete multimedia content
  • Flexible durations: 4, 6, or 8 second video outputs
  • Aspect ratio control: 16:9 (landscape) or 9:16 (portrait) formats
  • Motion guidance: Use prompts to direct how the image animates
  • Two generation modes: Standard (highest quality) and Fast (cost-efficient)
  • Flexible input: Accept Base64 encoded images or HTTPS URLs

Use cases

  • Product animation: Animate product photos for e-commerce and advertising
  • Social media content: Transform static posts into engaging video content
  • Portrait animation: Bring photos of people to life with natural movement
  • Art animation: Animate artwork, illustrations, and digital creations
  • Marketing visuals: Convert campaign images into dynamic video ads
  • Storytelling: Create video sequences from storyboard images

Generate with Veo 3.1 Standard

High-quality video generation with maximum fidelity and detail.

POST /v1/ai/image-to-video/veo-3-1

Create a new image-to-video task

GET /v1/ai/image-to-video/veo-3-1

List all Veo 3.1 tasks

GET /v1/ai/image-to-video/veo-3-1/{task-id}

Get task status by ID

Generate with Veo 3.1 Fast

Faster generation at reduced cost, ideal for prototyping and high-volume workflows.

POST /v1/ai/image-to-video/veo-3-1-fast

Create a fast image-to-video task

GET /v1/ai/image-to-video/veo-3-1-fast

List all Veo 3.1 Fast tasks

GET /v1/ai/image-to-video/veo-3-1-fast/{task-id}

Get task status by ID

Parameters

ParameterTypeRequiredDescription
imagestringYesInput image as Base64 string or HTTPS URL (must be publicly accessible)
promptstringYesText describing the motion and animation (max 20,000 chars)
negative_promptstringNoText describing what to avoid in the video
durationintegerNoVideo duration: 4, 6, or 8 seconds (default: 8)
resolutionstringNoOutput resolution: "720p", "1080p", or "4k" (default: "720p")
aspect_ratiostringNoVideo format: "16:9" or "9:16" (default: "16:9")
generate_audiobooleanNoGenerate synchronized audio (default: true)
seedintegerNoRandom seed for reproducibility
webhook_urlstringNoURL for task completion notification

Frequently Asked Questions

Veo 3.1 accepts images via HTTPS URLs (must be publicly accessible) or Base64 encoded strings. Common formats like JPEG, PNG, and WebP are supported.
Standard mode produces the highest quality output with maximum detail and natural motion, ideal for final production content. Fast mode generates videos more quickly at a lower cost, perfect for prototyping and high-volume workflows.
The prompt guides how the image animates. Describe the desired motion, such as “The subject slowly turns their head and smiles” or “Camera slowly zooms in while leaves blow in the wind”. Be specific about movement direction and speed.
Veo 3.1 supports 720p (HD), 1080p (Full HD), and 4K (Ultra HD). The output resolution is independent of your input image resolution.
When generate_audio is enabled (default), Veo 3.1 synthesizes audio that matches the visual content and motion. This includes ambient sounds and effects appropriate to the scene.
Yes, set aspect_ratio to "9:16" to generate vertical videos. For best results, use portrait-oriented source images when creating vertical content.

Best practices

  • Image quality: Use high-resolution, well-lit images with clear subjects for best results
  • Motion prompts: Describe specific movements like “slowly pans left” or “subject walks forward”
  • Negative prompts: Use to avoid unwanted artifacts like “blurry, distorted, flickering”
  • Resolution matching: Consider your source image quality when selecting output resolution
  • Audio control: Disable generate_audio if adding custom audio in post-production
  • Webhook integration: Use webhooks for production workflows to handle async completion
  • Seed for consistency: Use the same seed to reproduce similar animation results