Documentation Index
Fetch the complete documentation index at: https://docs.magnific.com/llms.txt
Use this file to discover all available pages before exploring further.
Overview
PixVerse V5.5 is an image-to-video API that generates cinematic videos from a single image with smooth, expressive motion and stable style across frames. Compared to PixVerse V5, version 5.5 adds native synchronized audio, multi-clip output with dynamic camera changes, an extended 10-second duration option, and a prompt reasoning mode that automatically optimizes inputs before generation.Key capabilities
- Durations of 5, 8, or 10 seconds: 10-second outputs are available up to 720p; 1080p videos are capped at 5 or 8 seconds
- Native synchronized audio: enable
generate_audio_switchto produce background music, sound effects, or dialogue together with the video - Multi-clip with dynamic cameras: enable
generate_multi_clip_switchto produce multi-clip output with camera changes inside a single generation - Prompt reasoning (
thinking_type): chooseenabled,disabled, orautoto control whether the model rewrites the prompt before rendering - Resolutions:
360p,540p,720p,1080p - Aspect ratios:
widescreen_16_9,classic_4_3,square_1_1,traditional_3_4,social_story_9_16 - Camera movement presets: 20 options including
zoom_in,zoom_out,pan_left,pan_right,crane_up,hitchcock,whip_pan,camera_rotation - Style controls:
anime,3d_animation,clay,cyberpunk,comic - Async processing: poll the task endpoint or receive a webhook notification on completion
Use cases
- Marketing and ads: short product videos with synchronized audio and dynamic camera work
- Social content: vertical clips for TikTok, Instagram Reels, and YouTube Shorts in 9:16 at up to 1080p
- Short films and teasers: 10-second 720p shots with multi-clip camera transitions
- Brand storytelling: consistent style and color across cinematic sequences
- Creative experimentation: stylized output (
anime,cyberpunk,clay) for art and concept work
POST /v1/ai/image-to-video/pixverse-v5-5
Generate a video from an image with PixVerse V5.5
GET /v1/ai/image-to-video/pixverse-v5-5/{task-id}
Get task status and result by ID
GET /v1/ai/image-to-video/pixverse-v5-5
List all PixVerse V5.5 tasks
Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
image_url | string (uri) | Yes | - | URL of the image to use as the first frame |
prompt | string | Yes | - | Prompt describing the video to generate |
duration | integer | No | 5 | Video length in seconds: 5, 8, or 10. 10s only up to 720p; 1080p limited to 5 or 8 seconds |
resolution | string | No | - | Output resolution: 360p, 540p, 720p, 1080p |
aspect_ratio | string | No | widescreen_16_9 | widescreen_16_9, classic_4_3, square_1_1, traditional_3_4, social_story_9_16 |
style | string | No | - | Visual style: anime, 3d_animation, clay, cyberpunk, comic |
camera_movement | string | No | - | Camera preset (e.g., zoom_in, pan_left, crane_up, hitchcock) |
negative_prompt | string | No | "" | Negative prompt for the generation |
seed | integer | No | - | Reproducibility seed; same seed + same prompt yields the same video |
generate_audio_switch | boolean | No | false | When true, generates synchronized audio (music, SFX, or dialogue) with the video |
generate_multi_clip_switch | boolean | No | false | When true, produces multi-clip output with dynamic camera changes in a single generation |
thinking_type | string | No | enabled | Prompt reasoning mode: enabled, disabled, or auto |
webhook_url | string (uri) | No | - | Optional callback URL that receives task status notifications |
Frequently Asked Questions
What is the difference between PixVerse V5.5 and PixVerse V5?
What is the difference between PixVerse V5.5 and PixVerse V5?
PixVerse V5.5 adds four capabilities on top of V5: a 10-second duration option (up to 720p), native synchronized audio via
generate_audio_switch, multi-clip output with dynamic camera changes via generate_multi_clip_switch, and a prompt reasoning mode (thinking_type) that can rewrite the prompt before generation. Resolutions, aspect ratios, and camera presets are the same as V5.Which durations and resolutions are supported?
Which durations and resolutions are supported?
PixVerse V5.5 supports
5, 8, and 10 second durations. The 10-second option is only available up to 720p. At 1080p, durations are limited to 5 or 8 seconds. Supported resolutions are 360p, 540p, 720p, and 1080p.How does native audio generation work?
How does native audio generation work?
Set
generate_audio_switch to true and PixVerse V5.5 produces synchronized audio (background music, sound effects, or dialogue) together with the video in a single request. No separate audio call is required.What does `generate_multi_clip_switch` do?
What does `generate_multi_clip_switch` do?
When set to
true, PixVerse V5.5 produces multi-clip output with dynamic camera changes inside a single generation, simulating cuts and camera moves without stitching multiple requests.What does `thinking_type` control?
What does `thinking_type` control?
thinking_type controls prompt reasoning. enabled (default) rewrites the prompt automatically for better results, disabled uses the prompt exactly as written, and auto lets the model decide whether to rewrite the prompt based on the input.What input image works best?
What input image works best?
Use a high-resolution image with clear subjects and balanced lighting. Avoid heavy compression and cluttered backgrounds to maximize motion quality and subject consistency.
What are the rate limits and pricing?
What are the rate limits and pricing?
Rate limits and pricing depend on your subscription tier. See Rate Limits and the Pricing page for current values.
Best practices
- Input quality: provide a sharp, well-lit image with a clear subject for the strongest motion and style stability
- Duration vs resolution: pick
10seconds for 720p storytelling shots, and5/8seconds for 1080p hero shots - Audio: enable
generate_audio_switchonly when you want the model to author audio; if you have your own track, leave itfalseand mix externally - Prompt reasoning: leave
thinking_typeasenabledfor general prompts; switch todisabledwhen you need literal prompt adherence - Production integration: use
webhook_urlinstead of polling for scalable workflows - Error handling: implement retry with exponential backoff for 503 responses
Related APIs
- PixVerse V5: prior version without audio, multi-clip, or 10-second support. See the PixVerse V5 overview.
- PixVerse V5 Transition: image-to-image transition variant for first/last frame interpolation.