Google Unveils Veo 3.1 to Rival OpenAI’s Sora 2 with Realistic AI Videos and Sound

Cointribune2025/10/16 22:51

By:Cointribune

Summarize this article with:

ChatGPT Perplexity Grok

The war of artificial intelligences reaches a peak. With every announcement, a new model emerges, more daring, more immersive, more… expensive. In this battle of innovations, Google did not want to remain a spectator. By releasing Veo 3.1, it unveils a video AI armed with sounds, dialogues, and new editing capabilities. Facing the viral popularity of Sora 2, the Mountain View firm plays another card: that of narrative precision and creative control.

Google Unveils Veo 3.1 to Rival OpenAI’s Sora 2 with Realistic AI Videos and Sound image 0

Google Unveils Veo 3.1 to Rival OpenAI’s Sora 2 with Realistic AI Videos and Sound image 1

In brief

Veo 3.1 integrates audio, dialogues and sound effects to enrich the AI-generated scenes.
The tool targets serious creators, with editing options and professional formats.
Three key modules: image composition, creative transitions, and smooth clip extension.
Google’s AI favors visual coherence, sometimes at the expense of action speed.

Technological Duel: Google attacks the queens of AI video

When OpenAI, valued at $500 billion without IPO , launched Sora 2 on September 30, the success was immediate. The app was downloaded more than one million times in only five days, climbing to the top of the App Store. Its approach? A “TikTok-ized” interface, designed for sharing and remixing.

Google did not choose this path. With Veo 3.1 , the goal is clear: to address creators, not influencers. The model allows generating videos with 1080p resolution, in horizontal or vertical format, integrating sound atmosphere, synchronized voices, and realistic effects. Accessible via Flow, Vertex AI and Gemini API, it offers two plans: a fast version at $0.15/second, and a standard one at $0.40/second.

The firm emphasizes the audio capabilities, now present in all modules. It promises an unprecedented rendering: the lip synchronization of Veo 3.1 surpasses that of all other models.

Where Sora favors visual dynamism, Veo chooses coherence. Movements are slower, but elements remain stable. It is the price of precision. A positioning that contrasts with the ambitions of Meta or Luma Labs, who focus more on speed and the wow effect.

Stories that speak: Google’s AI wants to tell

One of Veo 3.1’s major bets is narrative immersion. The addition of sound allows Google to take a step forward: no longer just illustrating, but telling with images and voices. Three features stand out:

Ingredients to Video: you combine several reference images, and the AI generates a scene with objects and characters;
Frames to Video: you provide a starting image and an ending one, and the AI produces a coherent transition;
Extend: the AI extends a clip by generating the continuation from the last second.

The tool also allows adding or removing elements, taking shadows and lights into account. This level of detail is the strength of the approach: a film studio within an artificial intelligence interface.

But not everything is perfect. When instructions stray too far from visual logic, the AI goes off track. Some scenes jump from one shot to another, lose characters or completely change atmosphere. It remains a technology under development.

As Google explained in its official blog:

We’re also introducing Veo 3.1, which brings richer audio, more narrative control, and enhanced realism that captures true-to-life textures.

Veo 3.1 does not want to entertain: it wants to move. And this is probably where it differs radically from its competitors.

Demanding UX, stunning result: when artificial intelligence becomes a creative tool

The user experience provided by Veo 3.1 is not that of a social network. It is not a product to consume, but a tool to master. Creators must learn to speak the language of AI. A poorly written prompt or one too far from reference images can produce an incoherent result.

Some tips are already circulating among users. For example, going through Seedream to generate a faithful initial image before importing it into Veo. Or using an audio-aware construction, explicitly mentioning the desired sounds in prompts.

In this regard, here are some concrete facts:

Veo has generated more than 275 million videos since the launch of Flow;
Three creative modules are available: Ingredients, Frames, Extend;
The usage cost is up to 2 times lower than that of Sora 2 Pro;
Videos can last up to one minute, with integrated sound;
Only three models handle spoken voices: Sora, Grok, and now Veo.

The tool is not easily tamed. But once understood, it delivers videos of rare realism, with accurate intonations and credible characters. It just requires patience, skill… and some credits.

Google no longer hides its ambition to dominate generative AI. Veo 3.1 shows that the firm does not just want to follow. It wants to impose its tempo. And to confirm this thirst for achievement, one of its robots has just solved a math problem considered impossible . The message is clear: the AI giant is just starting to speak.

Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.

PoolX: Earn new token airdrops

Lock your assets and earn 10%+ APR

Lock now!

- Bitcoin fell 32% below $90,000 in 2025, raising bear market fears driven by Fed policy shifts, regulatory uncertainty, and institutional exits. - Fed's 0.25% rate cut and delayed inflation data created volatility, while the GENIUS Act's reserve rules may reduce Bitcoin's appeal unless rates drop further. - SEC's Project Crypto and Senate bills increased regulatory clarity risks, while $3.79B ETF outflows triggered self-reinforcing price declines. - 2026 outcomes depend on Fed clarity, regulatory resoluti

Bitget-RWA•2025/12/08 16:28

Bitcoin Experiences Sharp Decline: Underlying Reasons and Potential Impact for 2026

QT is Over: What It Means for Crypto Markets

DailyCoin•2025/12/08 16:12

Exploring the Challenges and Potential in Financial Markets After a Crisis

- IMF's 2025 report highlights global financial risks from stretched asset valuations, sovereign bond pressures, and interconnected market vulnerabilities. - Emerging markets face contagion risks via currency mismatches and narrow investor bases, exemplified by debt challenges in Turkey and Argentina. - Strategic asset allocation shifts recommend value equities, short-duration bonds, and alternatives like commodities to hedge volatility and inflation. - Fiscal sustainability and regulatory vigilance are cr

Bitget-RWA•2025/12/08 16:08

PENGU Price Forecast: What Factors Are Fueling the Latest Spike in Attention?

- PENGU's 2025 price surge reflects crypto's sentiment-driven volatility, fueled by retail investor enthusiasm and social media hype. - Market analysis shows investor sentiment Granger-causes crypto returns, with FOMO and meme-driven demand overriding traditional metrics. - Global crypto adoption in UAE/Saudi Arabia and U.S. regulatory shifts like the GENIUS Act could stabilize PENGU's speculative trajectory. - Experts caution PENGU's momentum remains fragile, requiring institutional adoption and macroecon

Bitget-RWA•2025/12/08 16:08

Google Unveils Veo 3.1 to Rival OpenAI’s Sora 2 with Realistic AI Videos and Sound

In brief

Technological Duel: Google attacks the queens of AI video

Stories that speak: Google’s AI wants to tell

Demanding UX, stunning result: when artificial intelligence becomes a creative tool

You may also like

Trending news

Crypto prices