Question 1

What is Grok Imagine Video 1.5?

Accepted Answer

Grok Imagine Video 1.5 is xAI's next-generation AI image-to-video model, officially released May 31, 2026. It animates input images into cinematic video clips up to 15 seconds at 480p or 720p, with native audio generated in sync. It currently holds the #1 position on the Image-to-Video Arena leaderboard with a +52 Elo improvement over version 1.0.

Question 2

How much does Grok Imagine Video 1.5 cost?

Accepted Answer

Grok Imagine Video 1.5 pricing is pay-per-second: 480p costs $0.08/sec and 720p costs $0.14/sec. Each input image adds $0.01. A 5-second 480p clip runs $0.40; a 10-second 720p clip is $1.41. See the full breakdown on the pricing page at https://grokimaginevideo.app/pricing.

Question 3

How does Grok Imagine Video 1.5 compare to Kling 3.0?

Accepted Answer

Grok Imagine Video 1.5 outranks Kling 3.0 on the Arena.ai image-to-video leaderboard. Grok 1.5 also generates native audio automatically, while Kling 3.0 does not. For image-to-video workflows, Grok 1.5 is the top-ranked choice.

Question 4

Grok Imagine Video 1.5 vs Veo 3.1 — which is better?

Accepted Answer

Grok Imagine Video 1.5 ranks #1 on the Image-to-Video Arena, ahead of Veo 3.1. Grok 1.5 ships native audio in every clip; Veo 3.1 still needs separate audio workflows. For still-image animation, Grok 1.5 is the stronger pick as of May 2026.

Question 5

What file formats does Grok Imagine Video 1.5 accept?

Accepted Answer

Grok Imagine Video 1.5 accepts JPG, PNG, and WebP images as input. Output is MP4 video at 480p or 720p, up to 15 seconds per generation. Upload through the web UI or pass image URLs via the xAI API.

Question 6

Is Grok Imagine Video 1.5 free to try?

Accepted Answer

Grok Imagine Video 1.5 is not fully free — it uses pay-per-second pricing via the xAI API. You can test with small clips at $0.08/sec for 480p output. Credit packs and starter options are available on the pricing page.

Question 7

What resolutions and video lengths does Grok Imagine Video 1.5 support?

Accepted Answer

Grok Imagine Video 1.5 supports 480p and 720p output. Video duration goes up to 15 seconds per generation. It accepts multiple aspect ratios to match your input image. For longer content, the Video Extend feature lets you chain clips while maintaining character and scene consistency.

Question 8

Does Grok Imagine Video 1.5 generate audio automatically?

Accepted Answer

Yes. Grok Imagine Video 1.5 includes native audio generation — the model produces synchronized ambient sounds, dialogue, and music as part of the same generation pass. No separate audio tool is required. Audio is included in the base pricing with no additional cost beyond input charges.

Question 9

How do I access the Grok Imagine Video 1.5 API?

Accepted Answer

The Grok Imagine Video 1.5 API is available via the xAI API platform at docs.x.ai. The model name is grok-imagine-video-1.5-preview, with the alias grok-imagine-video-1.5-2026-05-30. Rate limits are 60 requests per minute. Regions supported: us-east-1 and eu-west-1.

Question 10

Can I use Grok Imagine Video 1.5 for commercial projects?

Accepted Answer

Yes. Videos generated through paid plans carry full commercial usage rights for advertising, marketing, and distribution. Always review xAI's terms of service for content restrictions, particularly around generating likeness content or regulated industries.

Rank	Rank Spread	Model	Score	Votes
1	1-2	grok-imagine-video-1.5-preview-720pPreliminary xAI · Proprietary	1473 ±9	5,564
2	1-2	dreamina-seedance-2.0-720p Bytedance · Proprietary	1467 ±11	56,710
3	3-3	happyhorse-1.0 Alibaba-ATH · Proprietary	1443 ±12	33,267
4	4-4	grok-imagine-video-720p xAI · Proprietary	1421 ±6	380,580
5	5-8	veo-3.1-audio Google · Proprietary	1397 ±11	25,113
6	5-9	veo-3.1-audio-1080p Google · Proprietary	1393 ±10	24,381
7	5-9	veo-3.1-fast-audio Google · Proprietary	1384 ±9	99,851
8	5-9	grok-imagine-video-480p xAI · Proprietary	1383 ±9	19,415
9	6-11	veo-3.1-fast-audio-1080p Google · Proprietary	1374 ±11	24,874
10	9-11	vidu-q3-pro Shengshu · Proprietary	1360 ±8	36,674

Feature	Grok Imagine 1.5Best	Kling 3.0	Google Veo 3.1	Sora 2
Image-to-Video Rank	✦ #1	Top 5	Top 5	Top 5
Max Resolution	720p	1080p	1080p	1080p
Max Duration	15 sec	10 sec	8 sec	20 sec
Native Audio	Included	✗	✓	✗
Starting Price / sec	$0.08	~$0.14	~$0.34+	~$0.10
Generation Speed	~15 seconds	2–4 min	2–5 min	1–3 min
Public API	60 RPM	✓	Enterprise only	✓
Video Extend	✓	✓	✗	✗
Face Accuracy (blind test)	Top-rated	Strong	Strong	Moderate

Grok Imagine Video 1.5
#1 Image-to-Video
AI Generator

Six Major Upgrades
Over Version 1.0

Native Audio Generation

Advanced Face Accuracy

Temporal Coherence & Video Extend

Photorealism & Lighting Quality

Superior Motion Control

Faster Generation & Wider API

See What Grok Imagine Video 1.5
Actually Creates

Ranked #1
Image-to-Video Arena

How It Stacks Up
Against the Competition

Built for Real
Production Workflows

Advertising & Brand Video

UGC & Social Content

Product Animation & E-Commerce

Film & Narrative Pre-Viz

Game & App Trailers

Developer & API Workflows

Frequently Asked Questions

Start Generating.
Right Now.

Grok Imagine Video 1.5#1 Image-to-VideoAI Generator

Six Major UpgradesOver Version 1.0