Google Veo 3.1
State-of-the-art image-to-video generation powered by Google’s Veo 3.1 model with multi-resolution output and optional audio synthesis.
Key capabilities
- Image animation: Bring any image to life with natural motion and dynamics
- Multi-resolution output: Generate videos in 720p, 1080p, or 4K resolution
- Audio generation: Optional synchronized audio for complete multimedia content
- Flexible durations: 4, 6, or 8 second video outputs
- Aspect ratio control: 16:9 (landscape) or 9:16 (portrait) formats
- Motion guidance: Use prompts to direct how the image animates
- Two generation modes: Standard (highest quality) and Fast (cost-efficient)
- Flexible input: Accept Base64 encoded images or HTTPS URLs
Use cases
- Product animation: Animate product photos for e-commerce and advertising
- Social media content: Transform static posts into engaging video content
- Portrait animation: Bring photos of people to life with natural movement
- Art animation: Animate artwork, illustrations, and digital creations
- Marketing visuals: Convert campaign images into dynamic video ads
- Storytelling: Create video sequences from storyboard images
Generate with Veo 3.1 Standard
High-quality video generation with maximum fidelity and detail.POST /v1/ai/image-to-video/veo-3-1
Create a new image-to-video task
GET /v1/ai/image-to-video/veo-3-1
List all Veo 3.1 tasks
GET /v1/ai/image-to-video/veo-3-1/{task-id}
Get task status by ID
Generate with Veo 3.1 Fast
Faster generation at reduced cost, ideal for prototyping and high-volume workflows.POST /v1/ai/image-to-video/veo-3-1-fast
Create a fast image-to-video task
GET /v1/ai/image-to-video/veo-3-1-fast
List all Veo 3.1 Fast tasks
GET /v1/ai/image-to-video/veo-3-1-fast/{task-id}
Get task status by ID
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
image | string | Yes | Input image as Base64 string or HTTPS URL (must be publicly accessible) |
prompt | string | Yes | Text describing the motion and animation (max 20,000 chars) |
negative_prompt | string | No | Text describing what to avoid in the video |
duration | integer | No | Video duration: 4, 6, or 8 seconds (default: 8) |
resolution | string | No | Output resolution: "720p", "1080p", or "4k" (default: "720p") |
aspect_ratio | string | No | Video format: "16:9" or "9:16" (default: "16:9") |
generate_audio | boolean | No | Generate synchronized audio (default: true) |
seed | integer | No | Random seed for reproducibility |
webhook_url | string | No | URL for task completion notification |
Frequently Asked Questions
What image formats are supported?
What image formats are supported?
Veo 3.1 accepts images via HTTPS URLs (must be publicly accessible) or Base64 encoded strings. Common formats like JPEG, PNG, and WebP are supported.
What is the difference between Standard and Fast modes?
What is the difference between Standard and Fast modes?
Standard mode produces the highest quality output with maximum detail and natural motion, ideal for final production content. Fast mode generates videos more quickly at a lower cost, perfect for prototyping and high-volume workflows.
How do prompts affect the animation?
How do prompts affect the animation?
The prompt guides how the image animates. Describe the desired motion, such as “The subject slowly turns their head and smiles” or “Camera slowly zooms in while leaves blow in the wind”. Be specific about movement direction and speed.
What resolutions are supported?
What resolutions are supported?
Veo 3.1 supports 720p (HD), 1080p (Full HD), and 4K (Ultra HD). The output resolution is independent of your input image resolution.
How does audio generation work?
How does audio generation work?
When
generate_audio is enabled (default), Veo 3.1 synthesizes audio that matches the visual content and motion. This includes ambient sounds and effects appropriate to the scene.Can I animate portrait images for social media?
Can I animate portrait images for social media?
Best practices
- Image quality: Use high-resolution, well-lit images with clear subjects for best results
- Motion prompts: Describe specific movements like “slowly pans left” or “subject walks forward”
- Negative prompts: Use to avoid unwanted artifacts like “blurry, distorted, flickering”
- Resolution matching: Consider your source image quality when selecting output resolution
- Audio control: Disable
generate_audioif adding custom audio in post-production - Webhook integration: Use webhooks for production workflows to handle async completion
- Seed for consistency: Use the same seed to reproduce similar animation results
Related APIs
- Veo 3.1 Text-to-Video: Generate videos from text prompts with Veo 3.1
- Veo 3.1 Reference-to-Video: Generate videos with consistent characters using reference images
- Kling 2.6 Pro: Image-to-video with advanced motion control
- Kling O1: Alternative image-to-video model
- MiniMax Live: Fast image-to-video generation