Veo 3.1 – Image-to-Video API

Google Veo 3.1

State-of-the-art image-to-video generation powered by Google’s Veo 3.1 model with multi-resolution output and optional audio synthesis.

Veo 3.1 Image-to-Video transforms static images into dynamic, cinematic videos guided by text prompts. It supports resolutions up to 4K, durations of 4-8 seconds, and optional audio generation. Choose between Standard mode for maximum quality or Fast mode for quicker generation at lower cost.

Key capabilities

Image animation: Bring any image to life with natural motion and dynamics
Multi-resolution output: Generate videos in 720p, 1080p, or 4K resolution
Audio generation: Optional synchronized audio for complete multimedia content
Flexible durations: 4, 6, or 8 second video outputs
Aspect ratio control: 16:9 (landscape) or 9:16 (portrait) formats
Motion guidance: Use prompts to direct how the image animates
Two generation modes: Standard (highest quality) and Fast (cost-efficient)
Flexible input: Accept Base64 encoded images or HTTPS URLs

Use cases

Product animation: Animate product photos for e-commerce and advertising
Social media content: Transform static posts into engaging video content
Portrait animation: Bring photos of people to life with natural movement
Art animation: Animate artwork, illustrations, and digital creations
Marketing visuals: Convert campaign images into dynamic video ads
Storytelling: Create video sequences from storyboard images

Generate with Veo 3.1 Standard

High-quality video generation with maximum fidelity and detail.

POST /v1/ai/image-to-video/veo-3-1

Create a new image-to-video task

GET /v1/ai/image-to-video/veo-3-1

List all Veo 3.1 tasks

GET /v1/ai/image-to-video/veo-3-1/{task-id}

Get task status by ID

Generate with Veo 3.1 Fast

Faster generation at reduced cost, ideal for prototyping and high-volume workflows.

POST /v1/ai/image-to-video/veo-3-1-fast

Create a fast image-to-video task

GET /v1/ai/image-to-video/veo-3-1-fast

List all Veo 3.1 Fast tasks

GET /v1/ai/image-to-video/veo-3-1-fast/{task-id}

Get task status by ID

Parameters

Parameter	Type	Required	Description
`image`	`string`	Yes	Input image as Base64 string or HTTPS URL (must be publicly accessible)
`prompt`	`string`	Yes	Text describing the motion and animation (max 20,000 chars)
`negative_prompt`	`string`	No	Text describing what to avoid in the video
`duration`	`integer`	No	Video duration: `4`, `6`, or `8` seconds (default: `8`)
`resolution`	`string`	No	Output resolution: `"720p"`, `"1080p"`, or `"4k"` (default: `"720p"`)
`aspect_ratio`	`string`	No	Video format: `"16:9"` or `"9:16"` (default: `"16:9"`)
`generate_audio`	`boolean`	No	Generate synchronized audio (default: `true`)
`seed`	`integer`	No	Random seed for reproducibility
`webhook_url`	`string`	No	URL for task completion notification

Frequently Asked Questions

What image formats are supported?

Veo 3.1 accepts images via HTTPS URLs (must be publicly accessible) or Base64 encoded strings. Common formats like JPEG, PNG, and WebP are supported.

What is the difference between Standard and Fast modes?

Standard mode produces the highest quality output with maximum detail and natural motion, ideal for final production content. Fast mode generates videos more quickly at a lower cost, perfect for prototyping and high-volume workflows.

How do prompts affect the animation?

The prompt guides how the image animates. Describe the desired motion, such as “The subject slowly turns their head and smiles” or “Camera slowly zooms in while leaves blow in the wind”. Be specific about movement direction and speed.

What resolutions are supported?

Veo 3.1 supports 720p (HD), 1080p (Full HD), and 4K (Ultra HD). The output resolution is independent of your input image resolution.

How does audio generation work?

When generate_audio is enabled (default), Veo 3.1 synthesizes audio that matches the visual content and motion. This includes ambient sounds and effects appropriate to the scene.

Best practices

Image quality: Use high-resolution, well-lit images with clear subjects for best results
Motion prompts: Describe specific movements like “slowly pans left” or “subject walks forward”
Negative prompts: Use to avoid unwanted artifacts like “blurry, distorted, flickering”
Resolution matching: Consider your source image quality when selecting output resolution
Audio control: Disable generate_audio if adding custom audio in post-production
Webhook integration: Use webhooks for production workflows to handle async completion
Seed for consistency: Use the same seed to reproduce similar animation results

Veo 3.1 Text-to-Video: Generate videos from text prompts with Veo 3.1
Veo 3.1 Reference-to-Video: Generate videos with consistent characters using reference images
Kling 2.6 Pro: Image-to-video with advanced motion control
Kling O1: Alternative image-to-video model
MiniMax Live: Fast image-to-video generation

Get started

APIs

Veo 3.1 – Image-to-Video API | Magnific API

Google Veo 3.1

Key capabilities

Use cases

Generate with Veo 3.1 Standard

POST /v1/ai/image-to-video/veo-3-1

GET /v1/ai/image-to-video/veo-3-1

GET /v1/ai/image-to-video/veo-3-1/{task-id}

Generate with Veo 3.1 Fast

POST /v1/ai/image-to-video/veo-3-1-fast

GET /v1/ai/image-to-video/veo-3-1-fast

GET /v1/ai/image-to-video/veo-3-1-fast/{task-id}

Parameters

Frequently Asked Questions

Best practices

Google Veo 3.1

​Key capabilities

​Use cases

​Generate with Veo 3.1 Standard

POST /v1/ai/image-to-video/veo-3-1

GET /v1/ai/image-to-video/veo-3-1

GET /v1/ai/image-to-video/veo-3-1/{task-id}

​Generate with Veo 3.1 Fast

POST /v1/ai/image-to-video/veo-3-1-fast

GET /v1/ai/image-to-video/veo-3-1-fast

GET /v1/ai/image-to-video/veo-3-1-fast/{task-id}

​Parameters

​Frequently Asked Questions

​Best practices

​Related APIs

Key capabilities

Use cases

Generate with Veo 3.1 Standard

Generate with Veo 3.1 Fast

Parameters

Frequently Asked Questions

Best practices

Related APIs