Text-to-audio-visual short scene
A concise scene prompt that describes the action, mood, environment, and the sound or voice that should match the video.
The official Kling 2.6 announcement emphasizes simultaneous audio-visual generation. The model page should therefore explain not only motion quality, but also when users should enable sound.