Analytics API
Track your team’s AI credit consumption over time with the new Analytics API. Query usage with per-user, per-project, and per-group breakdowns using day, week, month, or year granularity.- POST /v1/analytics/team-credit-usage: Query team credit consumption with time-series aggregation
- GET /v1/analytics/team-members: List team members with email, role, and status
- GET /v1/analytics/team-api-keys: List API keys with name and status
- GET /v1/analytics/team-groups: List team groups with member count
- GET /v1/analytics/team-projects: List team projects with name
- Time-series aggregation: day, week, month, or year granularity
- Per-user breakdown with email addresses, per-project and per-group context
- Filter by group IDs, project references, or API keys
- Discovery endpoints to retrieve filter values for your team
PixVerse V6 - Image to Video API
Generate cinematic videos from a single image or interpolate a transition between two keyframes with PixVerse V6. Builds on PixVerse V5.5 with flexible 1-15 second durations (any integer length) and built-in transition mode via the new optionallast_frame_image field.- POST /v1/ai/image-to-video/pixverse-v6: Generate a video from an image (or first/last frame pair) with PixVerse V6
- GET /v1/ai/image-to-video/pixverse-v6: List all PixVerse V6 tasks
- GET /v1/ai/image-to-video/pixverse-v6/{task-id}: Get PixVerse V6 task status and result
- Flexible duration: any integer value from
1to15seconds (previously fixed to5,8, or10) - Transition mode via the new optional
last_frame_imagefield: generate a smooth interpolation betweenimage_url(first frame) andlast_frame_image(last frame) in a single request, replacing the separate transition endpoint - Preserves V5.5 features: native synchronized audio (
generate_audio_switch), multi-clip output (generate_multi_clip_switch), prompt reasoning (thinking_type) - Same resolutions (
360p,540p,720p,1080p), aspect ratios, styles, and camera presets as V5.5 - Async polling or webhook notifications on completion
PixVerse V5.5 - Image to Video API
Generate cinematic videos from a single image with PixVerse V5.5. Builds on PixVerse V5 with a new 10-second duration option (up to 720p), native synchronized audio, multi-clip output with dynamic camera changes, and a prompt reasoning mode.- POST /v1/ai/image-to-video/pixverse-v5-5: Generate a video from an image with PixVerse V5.5
- GET /v1/ai/image-to-video/pixverse-v5-5: List all PixVerse V5.5 tasks
- GET /v1/ai/image-to-video/pixverse-v5-5/{task-id}: Get PixVerse V5.5 task status and result
- Durations of 5, 8, or 10 seconds (10s only up to 720p; 1080p capped at 5 or 8 seconds)
- Native synchronized audio via
generate_audio_switch(music, SFX, or dialogue with the video) - Multi-clip output with dynamic cameras via
generate_multi_clip_switch - Prompt reasoning via
thinking_type(enabled,disabled,auto) - Same resolutions (
360p,540p,720p,1080p), aspect ratios, styles, and camera presets as V5 - Async polling or webhook notifications on completion
Happy Horse 1.0 - Video Generation, Reference, and Editing API
Generate AI videos with Happy Horse 1.0, Alibaba ATH’s flagship video model, ranked #1 on the Artificial Analysis Video Arena (April 2026) for both text-to-video and image-to-video. Four endpoint categories: text-to-video, image-to-video, reference-to-video (up to 9 character references), and video editing with optional audio control.- POST /v1/ai/text-to-video/happy-horse-1: Generate video from a text prompt (3-15 seconds)
- POST /v1/ai/image-to-video/happy-horse-1: Animate a single image into video (3-15 seconds)
- POST /v1/ai/reference-to-video/happy-horse-1: Generate video from up to 9 reference images, referenced as
character1..character9in the prompt (3-15 seconds) - POST /v1/ai/video-edit/happy-horse-1: Edit videos with natural-language instructions and up to 5 reference images via
@Image1..@Image5tokens (output capped at 15 seconds)
- State-of-the-art quality: #1 on Artificial Analysis Video Arena (April 2026) for T2V and I2V
- Resolutions: 720P (1280x720) and 1080P (1920x1080)
- 5 aspect ratios:
16:9,9:16,1:1,4:3,3:4 - Durations: 3 to 15 seconds (default 5)
- R2V supports up to 9 character references with optional bottom-right “Happy Horse” watermark
- Video Edit supports
audio_setting(autoororigin) to control audio handling - Reproducible generation via
seed(0 to 2147483647) - Webhook notifications and async polling
Kling 4K I2V – Video Generation API
Generate 4K videos from an input image with Kling 4K I2V. Async jobs return a task ID for polling or webhooks, with duration and prompt control for 4K output.- POST /v1/ai/video/kling-4k-i2v: Generate video from an image with Kling 4K I2V
Kling 4K T2V – Video Generation API
Generate 4K videos from a text prompt with Kling 4K T2V. Async jobs return a task ID for polling or webhooks, with duration and prompt control for 4K output.- POST /v1/ai/video/kling-4k-t2v: Generate video from text with Kling 4K T2V
Google Imagen 4 API (Fast & Ultra)
Generate high-quality images from text with Google Imagen 4. Two variants: Imagen 4 Fast for speed-optimized and cost-effective generation, and Imagen 4 Ultra for highest-quality professional output. Both variants share the same parameter schema, multilingual prompt support (9 languages), 5 aspect ratios, configurable person-generation and safety controls, PNG/JPEG output, and deterministic generation via seed.- POST /v1/ai/text-to-image/imagen4-fast: Create image with Imagen 4 Fast
- GET /v1/ai/text-to-image/imagen4-fast: List all Imagen 4 Fast tasks
- GET /v1/ai/text-to-image/imagen4-fast/{task-id}: Get Imagen 4 Fast task status and results
- POST /v1/ai/text-to-image/imagen4-ultra: Create image with Imagen 4 Ultra
- GET /v1/ai/text-to-image/imagen4-ultra: List all Imagen 4 Ultra tasks
- GET /v1/ai/text-to-image/imagen4-ultra/{task-id}: Get Imagen 4 Ultra task status and results
- Two quality tiers: Fast (speed/cost-optimized) and Ultra (maximum quality)
- Multilingual prompts:
en,es,pt,ja,ko,hi,zh,zh-CN,auto - Aspect ratios:
square_1_1,social_story_9_16,widescreen_16_9,traditional_3_4,classic_4_3 - Person generation controls:
dont_allow,allow_adult,allow_all - Safety settings: from
block_low_and_abovetoblock_none - Output formats: PNG or JPEG with configurable compression
- Deterministic generation with
seed(1 - 4,294,967,295) - LLM-based prompt rewriting via
enhance_prompt
WAN 2.7 - Video Generation, Editing, and Reference API
Generate AI videos with WAN 2.7, Alibaba’s latest video model. Four endpoint categories: text-to-video, image-to-video (with video continuation), reference-to-video for character-consistent generation, and video editing with style transfer. All categories support 720P and 1080P resolution, audio-guided generation, and automatic prompt expansion.- POST /v1/ai/text-to-video/wan-2-7: Generate video from a text prompt (2-15 seconds)
- POST /v1/ai/image-to-video/wan-2-7: Generate video from image, first+last frame, or extend a video (2-15 seconds)
- POST /v1/ai/reference-to-video/wan-2-7: Generate video featuring characters from reference images/videos (2-10 seconds)
Veed Fabric 1.0 and Veed Fabric 1.0 Fast - Lip Sync API
Generate realistic talking videos from a portrait image and audio file with Veed Fabric 1.0. Two variants available: standard for highest quality and Fast for reduced generation time. Output at 720p or 480p resolution in MP4 format.- POST /v1/ai/lip-sync/veed-fabric-1-0: Generate a lip-synced talking video
- POST /v1/ai/lip-sync/veed-fabric-1-0-fast: Generate a lip-synced talking video (faster processing)
- GET /v1/ai/lip-sync/veed-fabric-1-0: List all Veed Fabric 1.0 tasks
- GET /v1/ai/lip-sync/veed-fabric-1-0-fast: List all Veed Fabric 1.0 Fast tasks
Video Upscaler Precision - AI Video Upscaling API
AI diffusion-based precision video upscaling with faithful detail recovery. Supports output resolutions of 1K, 2K, and 4K with adjustable sharpening, smart grain, and upscaling strength controls. FPS boost available for smoother motion.- POST /v1/ai/video-upscaler-precision: Create a precision video upscaling task
- GET /v1/ai/video-upscaler-precision: List all precision upscaler tasks
- GET /v1/ai/video-upscaler-precision/{task-id}: Get task status and results
Kling 3 Motion Control - Video Generation API
Transfer motion from reference videos to character images with Kling 3 Motion Control. Preserves character appearance while applying motion patterns from 3-30 second reference videos. Available in Pro and Standard tiers.- POST /v1/ai/video/kling-v3-motion-control-pro: Generate motion-controlled video with Kling 3 Pro
- POST /v1/ai/video/kling-v3-motion-control-std: Generate motion-controlled video with Kling 3 Standard
Sound Effects API
Search, filter, and download royalty-free sound effects from the Magnific catalog. Browse 42 categories including ambience, foley, and transitions with full-text search, category filtering, and flexible sorting.- GET /v1/sfx: Search and filter sound effects
- GET /v1/sfx/download: Download a sound effect audio file
Music API
Search, filter, and download royalty-free music from the Magnific Music catalog. Filter by genre, mood, and premium status with sorting by relevance, popularity, duration, or tempo.- GET /v1/music: Search and filter music
- GET /v1/music/download: Download a music audio file
Nano Banana Pro Flash - Text To Image API
Generate images from text with Nano Banana Pro Flash, powered by Google’s Gemini 3.1 Flash model. Faster generation with Google Search grounding for real-world accuracy, reference image support (up to 3 images), 10 aspect ratios, and resolutions up to 4K.- POST /v1/ai/text-to-image/nano-banana-pro-flash: Create a new image generation task
- GET /v1/ai/text-to-image/nano-banana-pro-flash: List all Nano Banana Pro Flash tasks
- GET /v1/ai/text-to-image/nano-banana-pro-flash/task-id: Get task status and results by ID
Video Upscaler Turbo Endpoint and Frame-Based Pricing
New dedicated Turbo endpoint for Video Upscaler with faster processing and premium quality automatically applied. Pricing model updated to frame-based billing that varies by output resolution. Theturbo and premium_quality parameters have been removed from the standard endpoint in favor of the separate Turbo path. Turbo tasks use the same list and get-task endpoints as standard tasks.- POST /v1/ai/video-upscaler/turbo: Create a turbo video upscaling task
Runway Gen 4.5 - Video Generation API
Generate high-quality AI videos from text prompts or images with Runway Gen 4.5. Supports both text-to-video and image-to-video workflows with async task processing, polling, and webhook notifications.- POST /v1/ai/text-to-video/runway-4-5: Generate video from a text prompt
- POST /v1/ai/image-to-video/runway-4-5: Generate video from an image
Change Camera - Image Perspective API
Transform the camera angle and perspective of any image with AI. Control horizontal rotation (0-360 degrees), vertical tilt (-30 to 90 degrees), and zoom level (0-10) to generate multi-angle views from a single photo.- POST /v1/ai/image-change-camera: Create a new camera angle transformation task
Seedream V4.5 – Image Expand API
Outpaint and expand images with Seedream V4.5 by setting per-edge pixel growth. Optional prompt guidance with async tasks, polling, and webhooks support.- POST /v1/ai/image-expand/seedream-v4-5: Expand an image beyond its boundaries using Seedream V4.5 outpainting
Ideogram Image Edit – Inpainting API
Edit images with Ideogram inpainting using masks and prompts. Choose TURBO/DEFAULT/QUALITY modes, MagicPrompt, and async tasks with webhooks and polling.- POST /v1/ai/ideogram-image-edit: Inpaint and edit an image using a mask plus a prompt with Ideogram Image Edit
Ideogram Image Expand API
Expand images beyond their original boundaries with AI-powered outpainting using the Ideogram model. Control expansion independently on each edge (left, right, top, bottom) up to 2048 pixels, with optional prompt guidance and auto-prompt generation.- POST /v1/ai/image-expand/ideogram: Create a new image expansion task
Kling 3 - Video Generation API
Generate AI videos with Kling 3, the latest video generation model from Kuaishou.- POST /v1/ai/video/kling-v3-pro: Generate video with Kling 3 Pro
- POST /v1/ai/video/kling-v3-std: Generate video with Kling 3 Standard
Google Veo 3.1 Reference-to-Video API
Generate videos with consistent characters and objects using reference images. Maintain visual identity across scenes for storytelling and multi-scene projects.- POST /v1/ai/reference-to-video/veo-3-1: Create video with reference images
- GET /v1/ai/reference-to-video/veo-3-1: List all reference-to-video tasks
- GET /v1/ai/reference-to-video/veo-3-1/task-id: Get task status and results
- Character/object consistency using 1-3 reference images
- Multi-resolution output: 720p, 1080p, or 4K
- Native audio generation with dialogue and sound effects
- Fixed 8-second duration at 24 FPS
- Aspect ratios: 16:9 (landscape) or 9:16 (portrait)
- Up to 20,000 character prompts
Nano Banana Pro (Text-to-Image)
Generate high-quality images with Google’s Nano Banana Pro (Gemini 3) model. Supports reference images for guided generation, multiple aspect ratios, and resolution options.- POST /v1/ai/text-to-image/nano-banana-pro: Create image from text with optional reference images
- Up to 3 reference images for guided generation
- Multiple aspect ratios (1:1, 16:9, 9:16, 4:3, 3:4, etc.)
- Resolution options: low, medium (default), high (4K)
- Webhook notifications