NEW: Unlock the Future of Finance with CRYPTO ENDEVR - Explore, Invest, and Prosper in Crypto!
Crypto Endevr
  • Top Stories
    • Latest News
    • Trending
    • Editor’s Picks
  • Media
    • YouTube Videos
      • Interviews
      • Tutorials
      • Market Analysis
    • Podcasts
      • Latest Episodes
      • Featured Podcasts
      • Guest Speakers
  • Insights
    • Tokens Talk
      • Community Discussions
      • Guest Posts
      • Opinion Pieces
    • Artificial Intelligence
      • AI in Blockchain
      • AI Security
      • AI Trading Bots
  • Learn
    • Projects
      • Ethereum
      • Solana
      • SUI
      • Memecoins
    • Educational
      • Beginner Guides
      • Advanced Strategies
      • Glossary Terms
No Result
View All Result
Crypto Endevr
  • Top Stories
    • Latest News
    • Trending
    • Editor’s Picks
  • Media
    • YouTube Videos
      • Interviews
      • Tutorials
      • Market Analysis
    • Podcasts
      • Latest Episodes
      • Featured Podcasts
      • Guest Speakers
  • Insights
    • Tokens Talk
      • Community Discussions
      • Guest Posts
      • Opinion Pieces
    • Artificial Intelligence
      • AI in Blockchain
      • AI Security
      • AI Trading Bots
  • Learn
    • Projects
      • Ethereum
      • Solana
      • SUI
      • Memecoins
    • Educational
      • Beginner Guides
      • Advanced Strategies
      • Glossary Terms
No Result
View All Result
Crypto Endevr
No Result
View All Result

China’s $9 AI Video Tool Kling 2.1 Adds Audio—Can It Beat Google’s $250 Veo 3?

China’s  AI Video Tool Kling 2.1 Adds Audio—Can It Beat Google’s 0 Veo 3?
Share on FacebookShare on Twitter

rewrite this content

In brief

-Chinese AI tool Kling 2.1 now generates videos with synchronized audio, including footsteps, rain, and ambient effects.

  • At just $9 a month, Kling undercuts Google’s Veo 3 by more than 20 times.
  • We tested both tools head-to-head: Kling shines on pricing and flexibility, but Veo still leads in dialogue and sound design quality.

Chinese short video platform Kuaishou has added an audio generation feature to Kling 2.1, its AI-powered video creation tool, enabling users to produce clips with synchronized sound effects such as footsteps, rainfall, and ambient noise.

The feature, which launched quietly last week, is available in Kling’s image-to-video mode, where users upload a still image and the platform animates it with both motion and audio generated by artificial intelligence.

The timing pits Kling against Google’s Veo 3, which launched with integrated audio capabilities from day one.

Early users on X praised Kling’s seamless audio-visual synchronization, with creator Roberto Nickson calling it “one of the most useful models on the market” for producing generative video content.

The feature is free during initial rollout, accessible through Kling’s website and mobile app.

Kling 2.1 one of the most useful models on the market

— Roberto Nickson (@rpnickson) June 12, 2025

Kling 2.1 generates 5- to 10-second clips at up to 1080p resolution, utilizing what the company describes as “3D spatiotemporal attention mechanisms” to synchronize sounds with visuals.

The audio tool currently generates sound effects only—no dialogue or music—and produces something similar to Southeast Asian language audio when text is involved—very tonal, and completely unintelligible. But that by itself isn’t enough to crown Google as the undisputed King of generative video.

We tested Kling 2.1’s new audio features against Google’s Veo 3 to see how the upstart stacks up.

The Price of Creation

The price gap between the two platforms turns out to be massive.

Kling 2.1’s audio feature is only compatible with the standard version, not the higher-end Master edition. However, at current rates, users can generate more than 20 videos on Kling for every single Veo 3 creation.

For example, using Freepik’s credit system, one generation with Google Veo 3 is currently on sale for 4,000 credits (with the normal price being 8,000 credits per video), whereas Kling 2.1 costs 300 credits per video.

Google’s model runs exclusively through its $250-per-month Ultra subscription. Kling is available on its official site, offering some free generations, with subscriptions starting at around $9 per month.

Even with Google’s current promotional pricing, Veo 3 remains ten times more expensive than Kling.

For creators who know video generation involves plenty of trial and error, with failure rates that frustrate even patient users, Kling’s economics make experimentation feasible.

The Premium plan on Kling unlocks 1080p resolution, improving overall video quality while still maintaining the cost advantage.

Audio Capabilities

But you get what you pay for. Veo 3 offers sophisticated sound generation, accurately synthesizing speech and matching complex audio elements to visual scenes.

Its understanding of spatial audio and contextual sounds surpassed Kling’s offerings by a wide margin.

While Kling 2.1 can’t compete, in fairness, it aimed at something different: ambient sounds and background effects—no dialogue, no music. So forget about those viral AI street interviews for now. Attempts to generate audio produce speech gibberish.

Yet for scenes or videos requiring atmospheric audio, its results were serviceable.

2. An off-road SUV drives through rocky, muddy, and wet forest terrain.

You hear the crunch, the splash, the growl of the engine. Felt like a real shoot. pic.twitter.com/S0gVhCAQjk

— ZOYA ✪ (@Zoya_ai) June 12, 2025

The platform’s new ability to add effects to existing silent videos gives it an edge that Veo 3 couldn’t match.

Users can upload finished videos and retrofit them with appropriate soundscapes, a workflow that Google’s model doesn’t support. Weirdly, Veo can create videos, but it can’t edit them.

Besides the ability to create sounds for silent videos, Kling also offers a lip-syncing feature.

Users can upload a photo and a speech or dialogue separately, and the model will make a video in which the subjects interact naturally, as if they were speaking to each other according to the uploaded audio.

【Kling AI(@Kling_ai)】リップシンク update!!📢
動画に登場するキャラクターを選択して、どの人物が話しているかを選択できたり、音声のタイミングを調整するリップシンクの編集機能が追加されました。… pic.twitter.com/brvGUOgLKs

— SEIIIRU😈動画生成AI×AfterEffects (@seiiiiiiiiiiru) June 10, 2025

The twenty-to-one generation ratio meant creators can experiment with different audio approaches on Kling while Veo 3 users have to nail their sound design in fewer attempts.

For hobbyists and those learning generative video, Kling’s approach offers more room for trial and error.

But professional creators needing precise audio-visual synchronization and dialogue will find Veo 3’s sophisticated sound engine worth the premium.

Video Generation Quality

Video quality testing produced unexpected results. In a test scene featuring a woman fleeing from a giant spider, Kling 2.1’s standard version outperformed both Veo 3 and its own Master edition.

The standard model accurately represented the scene dynamics, exhibiting fluid motion and proper directional movement. Veo 3 inexplicably generated the woman running toward the spider instead of away from it.

The Master edition typically produces sharper, crisper visuals, but the standard version demonstrated superior scene comprehension and more fluid movement.

This is odd since higher resolution should always translate to better results, but maybe the problem boiled down to prompt technique issues or simply bad luck in the generation.

That said, Kling 2.1 standard with 1080p generations is a great model that holds its own against Google Veo 3 here.

Platform Workflows and Limitations

Platform limitations shape each tool’s workflow differently. Kling 2.1’s audio feature works only with image-to-video generation, not text-to-video, which remains exclusive to the Master edition without audio support—yes, this is odd, but it is what it is.

The best workaround is using Kolors, Kuaishou’s image generator, to create starting frames before converting them to video with synchronized audio. Kolors produces highly realistic images that serve as excellent starting points for video generation.

However, you might find that models including Reve, MidJourney, Recraft, Flux, and even ChatGPT are easier to prompt.

Veo 3 took the opposite approach, offering only text-to-video generation without any image-to-video option.

This forces users to rely entirely on prompt engineering, with no way to control the starting visual.

Google’s decision also seems particularly odd given that the previous Veo 2 does actually support image-to-video through its separate Flow platform.

The lack of visual control means users have to generate videos blindly, hoping their text prompts will produce the desired starting frames.

Content Moderation Approaches

Content moderation revealed contrasting philosophies. Veo 3 employs aggressive keyword filtering and post-generation checks, blocking content that violates Google’s policies.

The system flags potentially problematic prompts before generation and analyzes completed videos for policy violations.

Kling applies more liberal restrictions, allowing content that Veo will block outright.

However, the model’s training data naturally excluded explicit content—the model generates figures without anatomical details and violence without gore.

So, users can generate certain types of content that bypass keyword filters while still maintaining safety boundaries.

Both platforms refund credits when post-generation censorship blocks a video, but Kling’s lighter touch allows more creative freedom within boundaries.

Conclusions

Veo 3 might still be the king, but Kling 2.1 is definitely close to a populist on a mission to overthrow the monarchy.

Its audio feature is pretty revolutionary when you consider it’s a $9 tool competing against a $250 subscription.

The atmospheric sounds work, the rain sounds like rain, footsteps match the movement most of the time, and you can generate twenty attempts while Veo users carefully craft their single shot.

That retrofit feature, where you add sound to finished videos, is something Google doesn’t offer, and it’s genuinely useful for salvaging silent clips.

Things will look completely different if your primary goal is speech. Kling’s gibberish won’t fool anyone.

For this kind of specific requirement, Google Veo 3 is the obvious and only choice. The king is (almost) dead. Long live the Kling!

Edited by Josh Quittner and Sebastian Sinclair

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

in well organized HTML format with all tags properly closed. Create appropriate headings and subheadings to organize the content. Ensure the rewritten content is approximately 1500 words. Do not include the title and images. please do not add any introductory text in start and any Note in the end explaining about what you have done or how you done it .i am directly publishing the output as article so please only give me rewritten content. At the end of the content, include a “Conclusion” section and a well-formatted “FAQs” section.

cryptoendevr

cryptoendevr

Related Stories

SBF Portrayed in Sold-Out Prison Musical With Luigi Mangione, ‘Diddy’ as Inmates

SBF Portrayed in Sold-Out Prison Musical With Luigi Mangione, ‘Diddy’ as Inmates

June 17, 2025
0

rewrite this content In brief "Luigi: The Musical" imagines three controversial figures as cellmates in Brooklyn prison. Sam Bankman-Fried delivers...

Juventus Deal Vaults Crypto Exchange WhiteBIT’s Token to All-Time High Price

Juventus Deal Vaults Crypto Exchange WhiteBIT’s Token to All-Time High Price

June 16, 2025
0

rewrite this content In brief WhiteBIT's token (WBT) surged to a new all-time high following news of its Juventus sponsorship....

Hyperliquid, Solana Lead Altcoin Rally as Institutions Pour .9B Into Crypto Funds

Hyperliquid, Solana Lead Altcoin Rally as Institutions Pour $1.9B Into Crypto Funds

June 16, 2025
0

rewrite this content In brief Altcoins including Solana, Hyperliquid and XRP posted gains Monday morning. Ethereum also rose, as institutional...

Vietnam Passes Landmark Law Defining Digital Assets, Boosting AI and Chip Sectors

Vietnam Passes Landmark Law Defining Digital Assets, Boosting AI and Chip Sectors

June 16, 2025
0

rewrite this content In brief Vietnam’s National Assembly passed a landmark law regulating digital assets and formally categorizing them into...

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recommended

Shiba Inu (SHIB) Users Should Stay Alert for This Dangerous Scam

Shiba Inu (SHIB) Users Should Stay Alert for This Dangerous Scam

June 11, 2025
ETH is in a 95-100% Trading Range! 👀

ETH is in a 95-100% Trading Range! 👀

June 11, 2025
World Chain and Circle join forces to strengthen identity-driven finance with native USDC

World Chain and Circle join forces to strengthen identity-driven finance with native USDC

June 11, 2025
Circle Stock Jumps as USDC Stablecoin Expands to Sam Altman’s World Chain

Circle Stock Jumps as USDC Stablecoin Expands to Sam Altman’s World Chain

June 11, 2025
Xapo’s Wences Casares on How Bitcoin Makes a Fairer World

Xapo’s Wences Casares on How Bitcoin Makes a Fairer World

June 11, 2025

Our Newsletter

Join TOKENS for a quick weekly digest of the best in crypto news, projects, posts, and videos for crypto knowledge and wisdom.

CRYPTO ENDEVR

About Us

Crypto Endevr aims to simplify the vast world of cryptocurrencies and blockchain technology for our readers by curating the most relevant and insightful articles from around the web. Whether you’re a seasoned investor or new to the crypto scene, our mission is to deliver a streamlined feed of news and analysis that keeps you informed and ahead of the curve.

Links

Home
Privacy Policy
Terms and Services

Resources

Glossary

Other

About Us
Contact Us

Our Newsletter

Join TOKENS for a quick weekly digest of the best in crypto news, projects, posts, and videos for crypto knowledge and wisdom.

© Copyright 2024. All Right Reserved By Crypto Endevr.

No Result
View All Result
  • Top Stories
    • Latest News
    • Trending
    • Editor’s Picks
  • Media
    • YouTube Videos
      • Interviews
      • Tutorials
      • Market Analysis
    • Podcasts
      • Latest Episodes
      • Featured Podcasts
      • Guest Speakers
  • Insights
    • Tokens Talk
      • Community Discussions
      • Guest Posts
      • Opinion Pieces
    • Artificial Intelligence
      • AI in Blockchain
      • AI Security
      • AI Trading Bots
  • Learn
    • Projects
      • Ethereum
      • Solana
      • SUI
      • Memecoins
    • Educational
      • Beginner Guides
      • Advanced Strategies
      • Glossary Terms

Copyright © 2024. All Right Reserved By Crypto Endevr