Kling VIDEO 2.6 introduces simultaneous video and audio generation

Kling VIDEO 2.6 generates video with integrated voiceovers, sound effects, and ambient audio in a single pass, eliminating the need to create silent footage for later dubbing. VIDEO 2.6 generates videos up to 10 seconds long and supports Chinese and English voices.

Kuaishou Technology has launched Kling VIDEO 2.6, a breakthrough model that includes a "Native Audio" feature that enables the generation of video clips with audio in a single workflow. Like its competitor Veo 3, Kling VIDEO 2.6 eliminates the traditional workflow of creating silent footage and adding any audio effects separately.

The "Native Audio" feature supports two modes of generation: traditional text-to-audio-visual generation based on text prompts only, and image-to-audio-visual generation that accepts text prompts alongside reference images as input. Both generation modes enable users to create videos up to 10 seconds long.

Kling VIDEO 2.6 can generate videos with perfectly synced human voices, sound effects, and ambient audio in a single pass. The model supports Chinese and English voice generation and maintains a world-leading position in Chinese voice generation. From a more technical viewpoint, Kling Video 2.6 excels in three critical areas: audio-visual synchronization that tightly aligns voice rhythm with visual motion, high-quality audio output with rich layering that mirrors professional mixing standards, and robust semantic understanding of complex storylines and colloquial expressions.

The technology supports diverse audio types, such as speech, dialogue, narration, singing, rap, and mixed sound effects. This makes Kling VIDEO 2.6 valuable across advertising, social media, and e-commerce applications. Advertisers can generate complete product showcases with narration and sound effects with a single click, while social media creators can produce multi-character dialogues and music performances more efficiently. Kling VIDEO 2.6's release notes showcase several examples of outputs suitable for these and other use cases.

Subscribe

Kling VIDEO 2.6 introduces simultaneous video and audio generation

Comments

Read Next

Interloom raises $16.5M to build an operational knowledge "memory" for enterprise AI agents

Mistral AI takes on closed-source custom model enterprise services with Mistral Forge

Cursor launches Composer 2: a model more capable, cheaper and faster than its predecessor

Yann LeCun's AMI Labs just raised Europe's largest seed round for its world models

Encyclopedia Britannica and Merriam-Webster are the latest publishers to sue OpenAI