- Google Developers Blog

2025年7月17日 / Gemini

Build with Veo 3, now available in the Gemini API

Veo 3, Google’s latest AI video generation model, is now available in paid preview via the Gemini API and Google AI Studio. Unveiled at Google I/O 2025, Veo 3 can generate both video and synchronized audio, including dialogue, background sounds, and even animal noises. This model delivers realistic visuals, natural lighting, and physics, with accurate lip syncing and sound that matches on-screen action.

2025年7月16日 / AI

Unlock Gemini’s reasoning: A step-by-step guide to logprobs on Vertex AI

The `logprobs` feature has been officially introduced in the Gemini API on Vertex AI, provides insight into the model's decision-making by showing probability scores for chosen and alternative tokens. This step-by-step guide will walk you through how to enable and interpret this feature and apply it to powerful use cases such as confident classification, dynamic autocomplete, and quantitative RAG evaluation.

2025年7月16日 / Cloud

Stanford’s Marin foundation model: The first fully open model developed using JAX

The Marin project aims to expand the definition of 'open' in AI to include the entire scientific process, not just the model itself, by making the complete development journey accessible and reproducible. This effort, powered by the JAX framework and its Levanter tool, allows for deep scrutiny, trust in, and building upon foundation models, fostering a more transparent future for AI research.

2025年7月16日 / Gemini

Simplify your Agent "vibe building" flow with ADK and Gemini CLI

The updated Agent Development Kit (ADK) simplifies and accelerates the process of building AI agents by providing the CLI with a deep, cost-effective understanding of the ADK framework, allowing developers to quickly ideate, generate, test, and improve functional agents through conversational prompts, eliminating friction and keeping them in a productive "flow" state.

ADK + Gemini CLI: Supercharge Your Agent Building Vibe

2025年7月14日 / Gemini

Gemini Embedding now generally available in the Gemini API

The Gemini Embedding text model is now generally available in the Gemini API and Vertex AI. This versatile model has consistently ranked #1 on the MTEB Multilingual leaderboard since its experimental launch in March, supports over 100 languages, has a 2048 maximum input token length, and is priced at $0.15 per 1M input tokens.

2025年7月10日 / Gemini

宣布推出 GenAI Processors：构建强大而灵活的 Gemini 应用

GenAI Processors 是 Google DeepMind 推出的一个全新开源 Python 库，旨在为从输入处理到模型调用和输出处理之间的所有步骤提供一致的“Processor”接口，以实现无缝链接和并发执行，从而简化 AI 应用的开发，特别是那些用于处理多模态输入且需要实时响应的应用。

Announcing GenAI Processors: Streamline your Gemini application development

2025年7月10日 / Cloud

Advancing agentic AI development with Firebase Studio

Updates in Firebase Studio include new Agent modes, foundational support for the Model Context Protocol (MCP), and Gemini CLI integration, all designed to redefine AI-assisted development allow developers to create full-stack applications from a single prompt and integrate powerful AI capabilities directly into their workflow.