Skip to main content

Alibaba ATH Happy Horse 1.0 integration

Happy Horse 1.0 Image-to-Video animates a single image into a high-quality video, with #1 ranking on the Artificial Analysis Video Arena (April 2026) for I2V quality and motion fidelity.
Happy Horse 1.0 Image-to-Video is an AI video generation API that animates a single still image into an MP4 video. Built by Alibaba ATH (Taotian Innovation Unit, led by Zhang Di), it currently ranks #1 on the Artificial Analysis Video Arena leaderboard (April 2026) for image-to-video. The model uses your image as the first frame and an optional text prompt to guide motion and style, producing videos at 720P (1280x720) or 1080P (1920x1080) resolution with durations from 3 to 15 seconds.

Key capabilities

  • State-of-the-art quality: #1 on Artificial Analysis Video Arena (April 2026) for image-to-video
  • First-frame animation: Use any supported image as the starting frame for the video
  • Resolution options: 720P (1280x720) and 1080P (1920x1080) output
  • Flexible durations: 3 to 15 seconds of video output (default 5 seconds)
  • Prompt-guided motion: Optional prompt (up to 2500 characters) controls motion and style
  • Wide image format support: JPEG, JPG, PNG, BMP, WEBP (300px+ on each side, max 10 MB)
  • Reproducible results: Fixed seed support (0 to 2147483647) for consistent generation
  • Async processing: Webhook notifications or polling for task completion

Use cases

  • Product animation: Bring product imagery to life with smooth motion and camera movements
  • Marketing videos: Animate brand visuals and hero shots into short-form video content
  • Social media content: Create video posts from static images for TikTok, Instagram, and YouTube
  • Storyboarding: Animate concept art or wireframes to preview motion sequences
  • Photography to motion: Add subtle motion to still photos for cinemagraphs and immersive content
  • Creative exploration: Experiment with motion prompts on the same image for varied results

API operations

Generate videos by submitting an image URL and optional prompt to the API. The service returns a task ID for async polling or webhook notification.

POST /v1/ai/image-to-video/happy-horse-1

Create a new image-to-video generation task

GET /v1/ai/image-to-video/happy-horse-1

List all Happy Horse 1.0 I2V tasks with status

GET /v1/ai/image-to-video/happy-horse-1/{task-id}

Get task status and results by ID

Parameters

ParameterTypeRequiredDefaultDescription
image_urlstringYes-URL of the image to use as the first frame (JPEG/JPG/PNG/BMP/WEBP, 300px+, max 10 MB, aspect ratio 1:2.5 to 2.5:1)
promptstringNo-Text prompt to guide motion and style. Max 2500 characters
resolutionstringNo"1080P"Output resolution: "720P" or "1080P"
durationintegerNo5Video length in seconds: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15
seedintegerNoRandomSeed for reproducibility (0 to 2147483647)
webhook_urlstringNo-URL for async status notifications

Frequently Asked Questions

Happy Horse 1.0 Image-to-Video is an AI model from Alibaba ATH that animates a single image into an MP4 video. It uses your image as the first frame and applies natural motion guided by an optional text prompt. As of April 2026, it ranks #1 on the Artificial Analysis Video Arena leaderboard for image-to-video quality.
Happy Horse 1.0 I2V accepts JPEG, JPG, PNG, BMP, and WEBP images via publicly accessible URLs. Each side must be at least 300 pixels, the aspect ratio must be between 1:2.5 and 2.5:1, and the file size must not exceed 10 MB.
No. The prompt parameter is optional but recommended. Without a prompt, the model generates natural motion based on the image content. With a prompt, you can describe specific motions, camera movements, or style cues for better control.
Happy Horse 1.0 I2V supports integer durations from 3 to 15 seconds. The default is 5 seconds. Allowed values: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15.
The output aspect ratio is determined automatically by the dimensions of the input image_url. Provide an image with the aspect ratio you want for the output video.
Rate limits depend on your subscription tier. See the Rate Limits page for current limits by plan.
See the Pricing page for current rates and subscription options.

Best practices

  • Image quality: Use high-resolution images with clear subjects and balanced lighting. Avoid heavily compressed or noisy inputs.
  • Aspect ratio: Provide an image already cropped to the target output ratio — output ratio matches the input image.
  • Prompt writing: Even though prompts are optional, adding motion and camera directions (“subject walks forward, slow zoom in”) improves results.
  • Duration selection: Start with shorter durations (3-5 seconds) for quick iterations, then increase for final outputs.
  • Production integration: Use webhooks for scalable applications instead of polling.
  • Error handling: Implement retry with exponential backoff for 503 errors during high-demand periods.