Happy Horse 1.0 Video Edit API

Alibaba ATH Happy Horse 1.0 integration

Happy Horse 1.0 Video Edit modifies an existing video using natural-language instructions, optionally guided by up to 5 reference images. Output is capped at 15 seconds.

Happy Horse 1.0 Video Edit is an AI video editing API that transforms an existing video with natural-language instructions. Built by Alibaba ATH (Taotian Innovation Unit, led by Zhang Di), the model edits the input MP4/MOV video based on a text prompt and up to 5 optional reference images. Output is delivered at 720P (1280x720) or 1080P (1920x1080) and capped at 15 seconds. Audio handling is configurable: auto lets the model decide whether to regenerate audio based on the prompt; origin preserves the input video’s original audio.

Key capabilities

Natural-language editing: Describe the desired edit in a prompt up to 2500 characters
Up to 5 reference images: Guide the edit with reference images via @Image1..@Image5 tokens
Audio control: auto (model decides) or origin (preserve input audio) via audio_setting
Resolution options: 720P (1280x720) and 1080P (1920x1080) output
Output cap: Edited video is at most 15 seconds long, regardless of input length
Wide input support: MP4 or MOV with H.264 codec, 3 to 60 seconds, max 100 MB
Reproducible results: Fixed seed support (0 to 2147483647) for consistent generation
Async processing: Webhook notifications or polling for task completion

How reference images work

Provide up to 5 reference images in the image_urls array. Reference each one in the prompt using @Image1..@Image5 tokens (1-based, in array order): Example prompt:

“Replace the background with the cityscape in @Image1 and change the protagonist’s outfit to match @Image2”

Use cases

Background replacement: Swap backgrounds for product, fashion, or marketing videos
Style transfer: Change the visual style or color grading of a clip
Object replacement: Replace specific elements while preserving motion and composition
Costume/wardrobe edits: Modify outfits or props using reference images
Visual effects: Add or remove elements with natural-language instructions
Quick iterations: Rapidly produce variants of a video without re-shooting

API operations

Edit videos by submitting a video URL, prompt, and optional reference images. The service returns a task ID for async polling or webhook notification.

POST /v1/ai/video-edit/happy-horse-1

Create a new video edit task

GET /v1/ai/video-edit/happy-horse-1

List all Happy Horse 1.0 Video Edit tasks

GET /v1/ai/video-edit/happy-horse-1/{task-id}

Get task status and results by ID

Parameters

Parameter	Type	Required	Default	Description
`video_url`	`string`	Yes	-	URL of the source video (MP4/MOV with H.264, 3-60s, longer side max 2160 px, shorter side min 320 px, aspect ratio 1:2.5 to 2.5:1, max 100 MB)
`prompt`	`string`	Yes	-	Editing instructions. Reference images via `@Image1`..`@Image5`. Max 2500 characters
`image_urls`	`array`	No	-	Up to 5 reference images (JPEG/JPG/PNG/WEBP, min 300 px, max 10 MB each)
`resolution`	`string`	No	`"1080P"`	Output resolution: `"720P"` or `"1080P"`
`audio_setting`	`string`	No	`"auto"`	Audio handling: `"auto"` (model decides) or `"origin"` (preserve input audio)
`seed`	`integer`	No	Random	Seed for reproducibility (0 to 2147483647)
`webhook_url`	`string`	No	-	URL for async status notifications

Frequently Asked Questions

What is Happy Horse 1.0 Video Edit and how does it work?

Happy Horse 1.0 Video Edit is an AI model from Alibaba ATH that edits an existing video based on natural-language instructions. You provide a video URL, a prompt describing the desired changes, and optionally up to 5 reference images. The model returns a new video at 720P or 1080P, capped at 15 seconds.

What input video formats are supported?

Happy Horse 1.0 Video Edit accepts MP4 and MOV files with H.264 codec via publicly accessible URLs. Input duration must be between 3 and 60 seconds, frame rate greater than 8 fps, longer side at most 2160 px, shorter side at least 320 px, aspect ratio between 1:2.5 and 2.5:1, and max file size 100 MB.

Why is the output capped at 15 seconds?

The model produces edits up to 15 seconds long regardless of input duration. If your input video is longer than 15 seconds, the output is truncated. Plan accordingly when selecting input clips.

How do I reference images in the prompt?

Use position-based tokens that match the order of the image_urls array: the 1st image is @Image1, the 2nd is @Image2, and so on up to @Image5. Example: “Replace the background with the scene in @Image1 and add the object from @Image2 in the foreground.”

What does the audio_setting parameter control?

audio_setting controls how the original audio track is handled. "auto" lets the model decide based on the prompt — if the prompt involves audio descriptions, the model may regenerate audio; otherwise the original audio is preserved. "origin" forces the original audio track to be preserved without regeneration.

What are the rate limits for Happy Horse 1.0?

Rate limits depend on your subscription tier. See the Rate Limits page for current limits by plan.

How much does Happy Horse 1.0 Video Edit cost?

See the Pricing page for current rates and subscription options.

Best practices

Input video selection: Use clips at or below 15 seconds to avoid truncation. Higher-quality inputs produce better edits.
Prompt clarity: Be explicit about which elements to change and which to keep (“Replace the background but keep the subject’s pose and clothing”).
Reference images: Use clean reference images with the target element clearly visible. Match aspect ratio and lighting where possible.
Audio choice: Use "origin" when the input has dialogue or music you want preserved. Use "auto" when the prompt involves audio changes.
Iterate with seed: Save the seed from successful edits to recreate similar results.
Production integration: Use webhooks for scalable applications instead of polling.
Error handling: Implement retry with exponential backoff for 503 errors during high-demand periods.

Happy Horse 1.0 Text-to-Video: Generate videos from text prompts
Happy Horse 1.0 Image-to-Video: Animate a single image into video
Happy Horse 1.0 Reference-to-Video: Generate videos from up to 9 character reference images

Get started

APIs

Happy Horse 1.0 Video Edit API

Alibaba ATH Happy Horse 1.0 integration

Key capabilities

How reference images work

Use cases

API operations

POST /v1/ai/video-edit/happy-horse-1

GET /v1/ai/video-edit/happy-horse-1

GET /v1/ai/video-edit/happy-horse-1/{task-id}

Parameters

Frequently Asked Questions

Best practices

Alibaba ATH Happy Horse 1.0 integration

​Key capabilities

​How reference images work

​Use cases

​API operations

POST /v1/ai/video-edit/happy-horse-1

GET /v1/ai/video-edit/happy-horse-1

GET /v1/ai/video-edit/happy-horse-1/{task-id}

​Parameters

​Frequently Asked Questions

​Best practices

​Related APIs

Key capabilities

How reference images work

Use cases

API operations

Parameters

Frequently Asked Questions

Best practices

Related APIs