Kling Video O1 Launches as the World’s First All-in-One Model for Generation and Text-Based Editing

22:08, 06 December

Edited by: Veronika Radoslavskaya

iframe { display: none; }

Kling Video O1 Launches as the World’s First All-in-One Model for Generation and Text-Based Editing

The AI video landscape has undergone a major transformation with the launch of Kling Video O1 (Omni One), a powerful new foundation model positioned as the world’s first unified multimodal engine for both video generation and advanced editing. Developed by Kuaishou, the model breaks down the previous fragmentation of the creative workflow, eliminating the need for creators to switch between separate tools for creation, editing, and refinement.

iframe { display: none; }

The core technological breakthrough lies in O1’s ability to accept a complex mix of inputs—including text prompts, multiple reference images (up to seven), and video clips—in a single, seamless workflow. This unified multimodal engine allows creators to generate high-fidelity 1080p scenes, and immediately apply post-production edits using only natural language commands. Users can now type prompts like "remove the passerby in the background," "change daytime to dusk," or "swap the main character’s outfit," and the model understands the visual context to execute these modifications precisely.

Kling Video O1 tackles long-standing industry challenges, particularly in visual coherence. It is engineered to maintain exceptional character consistency and style across extended sequences and complex camera movements, acting like a human director to prevent visual "drift" or flickering artifacts. Furthermore, the model provides granular control through features like Start and End Frame control, allowing editors to define exactly where a shot begins and ends, enabling smooth transitions and precise animation of still images. While base clips are typically around 5 to 10 seconds, O1’s architecture supports generating longer, more coherent narrative clips with reports suggesting extendable lengths up to two minutes.

Technical strengths include a Chain-of-Thought (CoT) reasoning system for enhanced prompt analysis and physics understanding, and strong benchmark results showing significant performance advantages over competitors like Google Veo 3.1 and Runway Aleph in complex transformation tasks. By merging these seven key creative capabilities—from text-to-video to scene extension and editing—Kling Video O1 sets a new standard for professional efficiency, ensuring high quality and consistency from concept to final cut.

46 Views

Sources

מגזין גאדג'טים וטכנולוגיה - Gadgety.co.il | גאדג'טי
Kling's Video O1 launches as the first all-in-one video model for generation and editing
Kling AI Launches O1, the Industry's First Unified Multimodal Video Model, Revolutionizing Content Creation and Editing - Barchart.com
Kling AI releases unified video model - Kr Asia
'Nano Banana' of AI Video: Chinese platform Kling AI Launches O1 AI Video Editing Model
Creativity AI #52: Runway claims the top spot, Kling goes multimodal, and Midjourney rethinks its UI - Medium

Notification Center

Kling Video O1 Launches as the World’s First All-in-One Model for Generation and Text-Based Editing

Sources

Read more articles on this topic: