Documentation Index
Fetch the complete documentation index at: https://docs.magnific.com/llms.txt
Use this file to discover all available pages before exploring further.
Overview
PixVerse V6 is an image-to-video API that generates cinematic videos from a single image or interpolates a transition between two keyframes in the same endpoint. Compared to PixVerse V5.5, version 6 unlocks flexible 1-15 second durations (any integer length) and adds an optionallast_frame_image field that enables built-in first-to-last frame transition mode, removing the need for a separate transition endpoint. Native synchronized audio, multi-clip output with dynamic camera changes, and prompt reasoning enhancement are all preserved.
Key capabilities
- Transition mode (
last_frame_image): provide a second image and PixVerse V6 generates a smooth video interpolation betweenimage_url(first frame) andlast_frame_image(last frame) in a single request, replacing the dedicated transition endpoint - Flexible duration (1-15 seconds): any integer length from
1to15, with default5(previously fixed to5,8, or10) - Native synchronized audio: enable
generate_audio_switchto produce background music, sound effects, or dialogue together with the video - Multi-clip with dynamic cameras: enable
generate_multi_clip_switchfor multi-clip output with camera changes inside a single generation - Prompt reasoning (
thinking_type): chooseenabled,disabled, orautoto control whether the model rewrites the prompt before rendering - Resolutions:
360p,540p,720p,1080p - Aspect ratios:
widescreen_16_9,classic_4_3,square_1_1,traditional_3_4,social_story_9_16 - Camera movement presets: 20 options including
zoom_in,zoom_out,pan_left,pan_right,crane_up,hitchcock,whip_pan,camera_rotation - Style controls:
anime,3d_animation,clay,cyberpunk,comic - Async processing: poll the task endpoint or receive a webhook notification on completion
Use cases
- Marketing and ads: short product videos with synchronized audio and dynamic camera work in any duration that fits the channel
- Social content: vertical clips for TikTok, Instagram Reels, and YouTube Shorts in 9:16 at up to 1080p
- Story transitions: scene-to-scene cuts and morphs using
last_frame_imageto bridge two shots in a single generation - Short films and teasers: longer 10-15 second sequences with multi-clip camera transitions
- Brand storytelling: consistent style and color across cinematic sequences
- Creative experimentation: stylized output (
anime,cyberpunk,clay) for art and concept work
POST /v1/ai/image-to-video/pixverse-v6
Generate a video from an image (or first/last frame pair) with PixVerse V6
GET /v1/ai/image-to-video/pixverse-v6/{task-id}
Get task status and result by ID
GET /v1/ai/image-to-video/pixverse-v6
List all PixVerse V6 tasks
Parameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
image_url | string (uri) | Yes | - | URL of the image to use as the first frame |
prompt | string | Yes | - | Prompt describing the video to generate |
last_frame_image | string (uri) | No | - | URL of the image to use as the last frame. When provided, PixVerse V6 generates a transition between image_url (first frame) and last_frame_image (last frame) in the same endpoint |
duration | integer | No | 5 | Video length in seconds. Any integer from 1 to 15 |
resolution | string | No | - | Output resolution: 360p, 540p, 720p, 1080p |
aspect_ratio | string | No | widescreen_16_9 | widescreen_16_9, classic_4_3, square_1_1, traditional_3_4, social_story_9_16 |
style | string | No | - | Visual style: anime, 3d_animation, clay, cyberpunk, comic |
camera_movement | string | No | - | Camera preset (e.g., zoom_in, pan_left, crane_up, hitchcock) |
negative_prompt | string | No | "" | Negative prompt for the generation |
seed | integer | No | - | Reproducibility seed; same seed + same prompt yields the same video |
generate_audio_switch | boolean | No | false | When true, generates synchronized audio (music, SFX, or dialogue) with the video |
generate_multi_clip_switch | boolean | No | false | When true, produces multi-clip output with dynamic camera changes in a single generation |
thinking_type | string | No | enabled | Prompt reasoning mode: enabled, disabled, or auto |
webhook_url | string (uri) | No | - | Optional callback URL that receives task status notifications |
Frequently Asked Questions
What is the difference between PixVerse V6 and PixVerse V5.5?
What is the difference between PixVerse V6 and PixVerse V5.5?
PixVerse V6 introduces two main changes on top of V5.5: flexible duration (any integer from
1 to 15 seconds instead of the fixed 5, 8, or 10) and built-in transition mode via the optional last_frame_image field, which replaces the separate transition endpoint. Native synchronized audio (generate_audio_switch), multi-clip output (generate_multi_clip_switch), and prompt reasoning (thinking_type) work the same as in V5.5.How does the new transition mode work?
How does the new transition mode work?
Provide a second image URL in the
last_frame_image field together with the standard image_url. PixVerse V6 interpolates a video between the two frames using your prompt as guidance, producing the transition in a single request. If last_frame_image is omitted, PixVerse V6 behaves as a regular image-to-video model with only the first frame.Which durations and resolutions are supported?
Which durations and resolutions are supported?
PixVerse V6 accepts any integer duration from
1 to 15 seconds (default 5). Supported resolutions are 360p, 540p, 720p, and 1080p. Pick the duration that best matches your shot length without being constrained to fixed steps.How does native audio generation work?
How does native audio generation work?
Set
generate_audio_switch to true and PixVerse V6 produces synchronized audio (background music, sound effects, or dialogue) together with the video in a single request. No separate audio call is required.What does `generate_multi_clip_switch` do?
What does `generate_multi_clip_switch` do?
When set to
true, PixVerse V6 produces multi-clip output with dynamic camera changes inside a single generation, simulating cuts and camera moves without stitching multiple requests.What does `thinking_type` control?
What does `thinking_type` control?
thinking_type controls prompt reasoning. enabled (default) rewrites the prompt automatically for better results, disabled uses the prompt exactly as written, and auto lets the model decide whether to rewrite the prompt based on the input.What input image works best?
What input image works best?
Use a high-resolution image with clear subjects and balanced lighting. Avoid heavy compression and cluttered backgrounds to maximize motion quality and subject consistency. For transition mode, the two frames should share scene context (subject, framing, lighting) for the smoothest interpolation.
What are the rate limits and pricing?
What are the rate limits and pricing?
Rate limits and pricing depend on your subscription tier. See Rate Limits and the Pricing page for current values.
Best practices
- Transition mode: when using
last_frame_image, choose two frames that share scene context (subject, framing, lighting) for a smoother interpolation - Duration selection: pick the shortest duration that conveys your shot to reduce processing time and cost; reserve longer 10-15s durations for narrative sequences
- Input quality: provide sharp, well-lit images with a clear subject for the strongest motion and style stability
- Audio: enable
generate_audio_switchonly when you want the model to author audio; if you have your own track, leave itfalseand mix externally - Prompt reasoning: leave
thinking_typeasenabledfor general prompts; switch todisabledwhen you need literal prompt adherence - Production integration: use
webhook_urlinstead of polling for scalable workflows - Error handling: implement retry with exponential backoff for 503 responses
Related APIs
- PixVerse V5.5: prior version with fixed
5/8/10second durations and no built-in transition mode. See the PixVerse V5.5 overview. - PixVerse V5: earlier release without audio, multi-clip, or 10-second support. See the PixVerse V5 overview.
- PixVerse V5 Transition: dedicated first/last frame transition endpoint for V5. With V6, this is now available natively via
last_frame_image.