Skip to main content

AI Video Generation

Generate videos from text prompts or images using state-of-the-art AI models. Choose the model that best fits your quality, speed, and budget requirements.
The Video Generation API provides access to multiple AI video models through a unified async workflow: submit a request, receive a task ID, then poll for results or get notified via webhook.

Available models

ModelModesResolutionsDurationsHighlights
Kling 3T2V, I2Vup to 1080p3-15sMulti-shot, element consistency
Kling 3 OmniT2V, I2Vup to 1080p3-15sVideo reference for motion/style
Kling O1T2V, I2Vup to 1080p5-10sHigh-performance generation
WAN 2.6T2V, I2V720p, 1080p5-15sHigh resolution, long duration
WAN 2.5T2V, I2V480p-1080p5-15sMultiple resolution tiers
Runway Gen 4.5T2V, I2Vup to 1080p5-10sCinematic quality
SeedanceI2V480p-1080p5sPro and Lite tiers
MiniMax LiveI2V720p, 1080p5sFast generation
LTX 2.0T2V, I2V768p3-9sFast and Pro tiers

How it works

All video models follow the same async pattern:
  1. POST a generation request with your prompt and/or image
  2. Receive a task_id immediately
  3. Poll with GET /v1/ai/{model}/{task_id} or receive a webhook notification
  4. Download the resulting video from the video_url in the response

Common parameters

ParameterDescription
promptText description of the video to generate
image_url / image_listSource image(s) for image-to-video modes
aspect_ratioVideo aspect ratio (16:9, 9:16, 1:1)
durationVideo length in seconds
webhook_urlURL for async completion notification
Parameter names and accepted values vary by model. Check each model’s endpoint reference for exact specifications.

Choosing a model

  • Highest quality: Kling 3 Pro, Runway Gen 4.5
  • Best value: Kling 3 Standard, LTX 2.0 Fast
  • Longest videos: WAN 2.6 (up to 15s at 1080p)
  • Fastest generation: LTX 2.0 Fast, MiniMax Live
  • Multi-scene narratives: Kling 3 (multi-shot mode)
  • Motion/style reference: Kling 3 Omni (video reference input)