Kuaishou Technology (“Kuaishou” or the “Company”; HKD Counterstock Code: 01024 / RMB Counterstock Code: 81024), a leading content community and social platform, announced that Kling AI released the Kling Video 2.6 model on December 3, 2025. This update introduces the milestone feature of “simultaneous audiovisual generation,” which fundamentally changes the workflow of the traditional AI video production model of silent visuals followed by manual dubbing. This model reimagines AI video creation workflows and significantly accelerates creative efficiency by allowing visuals, natural narration, sound effects, and ambient atmosphere to be generated simultaneously in a single pass.
Redefine your AI video creation workflow with world-leading Chinese audio generation
The Kling Video 2.6 model upgrades two major features: text-to-audiovisual generation and image-to-audiovisual generation. Whether or not If you enter any text Or by combining images and prompts, users can directly generate videos with audio, sound effects, and ambient sounds. This model currently supports Chinese and English audio generation and can create video content up to 10 seconds in length.
This upgrade restructures traditional AI video creation workflows, which typically require generating silent footage first and using separate software for post-production audio. The Kling Video 2.6 model allows creators to instantly generate fully integrated videos that include narration, sound effects, and ambient sounds, greatly increasing creator efficiency.
Kling Video 2.6 models leverage deep semantic coordination between real-world sounds and dynamic visuals to provide superior performance in audio-visual synchronization, audio quality, and semantic understanding.
The Kling Video 2.6 model focuses on audio-visual coordination, achieving tight coordination between audio rhythm, environmental sounds, and visual movement. This careful tuning ensures that the visual dynamics match the rhythm of the audio, eliminating the disjointed “audio-video mismatch” experience often seen in traditional workflows.
In terms of sound quality, it not only supports a variety of sounds such as voices, sound effects, and environmental sounds, but also provides cleaner and more layered sound quality. The overall aural experience faithfully reflects realistic audio mixing and meets rigorous standards for audio detail in professional-grade productions.
In terms of semantic understanding, this model demonstrates robust understanding of textual descriptions, colloquialisms, and complex storylines across a variety of scenarios. Deliver logically consistent audiovisual content that accurately captures the creator's intent and precisely meets the user's requirements. Additionally, the Kling Video 2.6 model maintains its world-leading position in Chinese speech generation performance.
One-click “audio-visual co-generation” drives an efficiency revolution in diverse creative scenarios such as advertising, marketing, social media, e-commerce, etc.
The Kling Video 2.6 model supports the generation of standalone or combined audio types, including speech, dialogue, narration, singing, rap, ambient sound effects, and mixed sound effects. This versatility facilitates a wide range of applications in video content creation across industries such as advertising, marketing, social media, and e-commerce, greatly increasing creative efficiency.
For example, for advertising and marketing, the Kling Video 2.6 model allows you to generate short ads featuring voiceovers, character dialogue, and product showcases with comprehensive sound effects in one click. This significantly reduces ad production costs and increases efficiency.
In social media, the Kling Video 2.6 model offers a wide range of applications. Multi-character interaction capabilities allow creators to create a variety of content including interviews, scripted performances, and comedy skits. Furthermore, the music performance function enables a variety of creative expressions such as singing, rapping, and playing musical instruments. The Kling Video 2.6 model allows creators to significantly reduce costs while streamlining their workflow, making creating social media content easier and more budget-friendly.
In e-commerce, the Kling Video 2.6 model leverages monologue and voice-over features to effectively automate the creation of e-commerce product showcase videos that highlight key selling points, helping merchants improve operational efficiency.
The launch of the Kling Video 2.6 model further reduces the cost and complexity of video production for the content production industry. Kling AI continues to develop practical features to provide creators with better, easier-to-use AI video creation tools that deliver higher performance.
