Skip to main content

Alibaba ATH Happy Horse 1.0 integration

Happy Horse 1.0 is Alibaba ATH’s flagship video model, ranked #1 on the Artificial Analysis Video Arena (April 2026) for both text-to-video and image-to-video, delivering state-of-the-art motion fidelity, prompt adherence, and visual quality.
Happy Horse 1.0 Text-to-Video is an AI video generation API that creates MP4 videos from text descriptions. Built by Alibaba ATH (Taotian Innovation Unit, led by Zhang Di), it currently holds the #1 position on the Artificial Analysis Video Arena leaderboard (April 2026) for both T2V and I2V. The model produces high-fidelity video with cinematic motion at 720P (1280x720) or 1080P (1920x1080) resolution, supports 5 configurable aspect ratios, and durations from 3 to 15 seconds.

Key capabilities

  • State-of-the-art quality: #1 on Artificial Analysis Video Arena (April 2026) for both T2V and I2V
  • Resolution options: 720P (1280x720) and 1080P (1920x1080) output
  • 5 aspect ratios: 16:9 landscape, 9:16 portrait, 1:1 square, 4:3 standard, 3:4 portrait
  • Flexible durations: 3 to 15 seconds of video output (default 5 seconds)
  • Multilingual prompts: Up to 2500 characters describing scene, motion, and style
  • Reproducible results: Fixed seed support (0 to 2147483647) for consistent generation
  • Async processing: Webhook notifications or polling for task completion

Use cases

  • Marketing videos: Create product showcases and brand content from text descriptions
  • Social media content: Generate short-form videos for TikTok, Instagram, and YouTube in portrait or landscape
  • Concept visualization: Transform ideas and scripts into motion for rapid prototyping
  • Educational content: Illustrate concepts with AI-generated video explanations
  • Storyboarding: Pre-visualize scenes from a script before live production
  • Creative exploration: Experiment with text prompts and aspect ratios for unique visual content

API operations

Generate videos by submitting a text prompt to the API. The service returns a task ID for async polling or webhook notification.

POST /v1/ai/text-to-video/happy-horse-1

Create a new text-to-video generation task

GET /v1/ai/text-to-video/happy-horse-1

List all Happy Horse 1.0 T2V tasks with status

GET /v1/ai/text-to-video/happy-horse-1/{task-id}

Get task status and results by ID

Parameters

ParameterTypeRequiredDefaultDescription
promptstringYes-Text description of the video. Max 2500 characters
aspect_ratiostringNo"16:9"Output ratio: "16:9", "9:16", "1:1", "4:3", "3:4"
resolutionstringNo"1080P"Output resolution: "720P" or "1080P"
durationintegerNo5Video length in seconds: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15
seedintegerNoRandomSeed for reproducibility (0 to 2147483647)
webhook_urlstringNo-URL for async status notifications

Frequently Asked Questions

Happy Horse 1.0 is an AI video generation model developed by Alibaba ATH (Taotian Innovation Unit), led by Zhang Di. As of April 2026, it ranks #1 on the Artificial Analysis Video Arena leaderboard for both text-to-video (T2V) and image-to-video (I2V) categories, ahead of other state-of-the-art video models.
Submit a text prompt describing your desired video, receive a task ID immediately, then poll the GET endpoint or wait for a webhook notification when processing completes. The model generates an MP4 video at 720P or 1080P resolution in durations from 3 to 15 seconds.
Happy Horse 1.0 T2V supports 5 aspect ratios: 16:9 (landscape widescreen), 9:16 (portrait/mobile), 1:1 (square), 4:3 (standard landscape), and 3:4 (standard portrait). The default is 16:9.
Happy Horse 1.0 T2V supports integer durations from 3 to 15 seconds. The default is 5 seconds. Allowed values: 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15.
Processing time depends on resolution, duration, and server load. Higher resolution (1080P) and longer durations take more time. For production workflows, use webhooks instead of polling for scalable integration.
Rate limits depend on your subscription tier. See the Rate Limits page for current limits by plan.
See the Pricing page for current rates and subscription options.
Both are Alibaba video models. Happy Horse 1.0 is built by the ATH (Taotian) Innovation Unit and currently ranks #1 on the Artificial Analysis Video Arena (April 2026). WAN 2.7 is built by the WAN team and offers audio-guided generation and longer minimum durations (2 seconds vs 3 seconds for Happy Horse). Choose Happy Horse 1.0 for benchmark-leading quality and Reference-to-Video with up to 9 character references; choose WAN 2.7 for audio-guided generation.

Best practices

  • Prompt writing: Be specific about scenes, characters, camera movements (zoom, pan, tilt), lighting, and atmosphere. Detailed prompts produce better results than vague descriptions.
  • Duration selection: Start with shorter durations (3-5 seconds) for quick iterations, then increase for final outputs.
  • Reproducibility: Save the seed value from successful generations to recreate similar results with the same parameters.
  • Production integration: Use webhooks for scalable applications instead of polling.
  • Error handling: Implement retry with exponential backoff for 503 errors during high-demand periods.