Google Unveils Veo 3.1 to Rival OpenAI’s Sora 2 with Realistic AI Videos and Sound
The war of artificial intelligences reaches a peak. With every announcement, a new model emerges, more daring, more immersive, more… expensive. In this battle of innovations, Google did not want to remain a spectator. By releasing Veo 3.1, it unveils a video AI armed with sounds, dialogues, and new editing capabilities. Facing the viral popularity of Sora 2, the Mountain View firm plays another card: that of narrative precision and creative control.
In brief
- Veo 3.1 integrates audio, dialogues and sound effects to enrich the AI-generated scenes.
- The tool targets serious creators, with editing options and professional formats.
- Three key modules: image composition, creative transitions, and smooth clip extension.
- Google’s AI favors visual coherence, sometimes at the expense of action speed.
Technological Duel: Google attacks the queens of AI video
When OpenAI, valued at $500 billion without IPO , launched Sora 2 on September 30, the success was immediate. The app was downloaded more than one million times in only five days, climbing to the top of the App Store. Its approach? A “TikTok-ized” interface, designed for sharing and remixing.
Google did not choose this path. With Veo 3.1 , the goal is clear: to address creators, not influencers. The model allows generating videos with 1080p resolution, in horizontal or vertical format, integrating sound atmosphere, synchronized voices, and realistic effects. Accessible via Flow, Vertex AI and Gemini API, it offers two plans: a fast version at $0.15/second, and a standard one at $0.40/second.
The firm emphasizes the audio capabilities, now present in all modules. It promises an unprecedented rendering: the lip synchronization of Veo 3.1 surpasses that of all other models.
Where Sora favors visual dynamism, Veo chooses coherence. Movements are slower, but elements remain stable. It is the price of precision. A positioning that contrasts with the ambitions of Meta or Luma Labs, who focus more on speed and the wow effect.
Stories that speak: Google’s AI wants to tell
One of Veo 3.1’s major bets is narrative immersion. The addition of sound allows Google to take a step forward: no longer just illustrating, but telling with images and voices. Three features stand out:
- Ingredients to Video: you combine several reference images, and the AI generates a scene with objects and characters;
- Frames to Video: you provide a starting image and an ending one, and the AI produces a coherent transition;
- Extend: the AI extends a clip by generating the continuation from the last second.
The tool also allows adding or removing elements, taking shadows and lights into account. This level of detail is the strength of the approach: a film studio within an artificial intelligence interface.
But not everything is perfect. When instructions stray too far from visual logic, the AI goes off track. Some scenes jump from one shot to another, lose characters or completely change atmosphere. It remains a technology under development.
As Google explained in its official blog:
We’re also introducing Veo 3.1, which brings richer audio, more narrative control, and enhanced realism that captures true-to-life textures.
Veo 3.1 does not want to entertain: it wants to move. And this is probably where it differs radically from its competitors.
Demanding UX, stunning result: when artificial intelligence becomes a creative tool
The user experience provided by Veo 3.1 is not that of a social network. It is not a product to consume, but a tool to master. Creators must learn to speak the language of AI. A poorly written prompt or one too far from reference images can produce an incoherent result.
Some tips are already circulating among users. For example, going through Seedream to generate a faithful initial image before importing it into Veo. Or using an audio-aware construction, explicitly mentioning the desired sounds in prompts.
In this regard, here are some concrete facts:
- Veo has generated more than 275 million videos since the launch of Flow;
- Three creative modules are available: Ingredients, Frames, Extend;
- The usage cost is up to 2 times lower than that of Sora 2 Pro;
- Videos can last up to one minute, with integrated sound;
- Only three models handle spoken voices: Sora, Grok, and now Veo.
The tool is not easily tamed. But once understood, it delivers videos of rare realism, with accurate intonations and credible characters. It just requires patience, skill… and some credits.
Google no longer hides its ambition to dominate generative AI. Veo 3.1 shows that the firm does not just want to follow. It wants to impose its tempo. And to confirm this thirst for achievement, one of its robots has just solved a math problem considered impossible . The message is clear: the AI giant is just starting to speak.
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
Solana News Update: Fidelity Supports Solana’s Rapid Expansion Despite Ongoing Stability Issues
- Fidelity expands crypto offerings by adding Solana (SOL) to its platforms, enabling U.S. investors to trade the high-speed blockchain asset. - Solana's 60,000 TPS and low fees position it as a DeFi and RWA hub, with recent Hong Kong ETF approval and institutional staking partnerships boosting adoption. - Despite network outages and hardware challenges, Solana gains traction as the third crypto ETF-approved asset in Asia, with JPMorgan forecasting $1.5B in U.S. inflows. - Analysts remain divided on SOL's

CD Yields Reach 4.35% APY—However, Imminent Fed Rate Cuts Urge Savers to Move Quickly
- CDs offer 4.35% APY as of Oct. 24, 2025, but Fed rate cuts risk further declines, urging savers to act quickly. - Smaller banks and online institutions outperform major banks like Chase, offering higher rates due to lower overhead costs. - Fed's 2024-2025 rate cuts reduced federal funds to 4.00%-4.25%, driving down CD yields from 2024 peaks. - Investors balance term lengths and liquidity via strategies like CD laddering while monitoring Fed policy shifts. - Historical context shows current rates exceed e

Hyperliquid News Today: The Reason Behind Infinex Re-Locking 20% of Tokens: Fostering Confidence with a Lasting Commitment
- Infinex relocked 20% of tokens post-TGE for 12 months, with linear vesting to align team incentives with community interests. - Hyperliquid and Lighter drove $1.3T+ October trading volume, contrasting Infinex's liquidity control with aggressive token treasury expansion. - Analysts view token relocking as bullish signals, while Hyperliquid's HYPE token faces volatility from monthly unlocks and buybacks. - DeFi trends show growing emphasis on transparent tokenomics, with lock-up agreements becoming standar

SUI News Today: 21Shares Launches Sui Yield ETF on Nasdaq, Introducing Regulated Staking
- 21Shares updates its Sui ETF filing with SEC, adding staking functionality and confirming a Nasdaq listing. - The ETF partners with Coinbase for staking and BNY Mellon for custody, generating passive yields via Sui's proof-of-stake mechanism. - SUI's price rose 3% post-filing as the product becomes one of the first U.S. ETFs to combine regulated staking rewards with liquidity access. - The filing highlights 5-7% annual staking yields but omits expense ratios, with SEC approval pending amid heightened cry

