- Google Developers Blog

2025年7月17日 / Gemini

Build with Veo 3, now available in the Gemini API

Veo 3, Google’s latest AI video generation model, is now available in paid preview via the Gemini API and Google AI Studio. Unveiled at Google I/O 2025, Veo 3 can generate both video and synchronized audio, including dialogue, background sounds, and even animal noises. This model delivers realistic visuals, natural lighting, and physics, with accurate lip syncing and sound that matches on-screen action.
2025年7月10日 / Gemini

「GenAI Processors」を発表: 強力で柔軟な Gemini アプリケーションをビルド

GenAI Processors は、Google DeepMind の新しいオープンソース Python ライブラリです。入力処理からモデル呼び出しと出力処理までのすべてのステップに一貫した「Processor」インターフェースを提供することで、シームレスなチェーンと同時実行を実現します。特にマルチモーダル入力を処理し、リアルタイムの応答性を必要とする AI アプリケーションの開発を簡素化します。
2025年6月24日 / Gemini

Gemini API と Google AI Studio で Imagen 4 を公開

Google の高度なテキスト画像変換モデル Imagen 4 が、Gemini API と Google AI Studio で有料プレビュー版として公開されました。このモデルでは、画像内テキスト生成の品質が大幅に向上しています。Imagen 4 ファミリーには、汎用タスク向けの Imagen 4、細かいところまでプロンプトに従うことができる Imagen 4 Ultra があり、すべての生成される画像に目に見えない SynthID 透かしが含まれています。
2025年6月24日 / Gemini

ロボティクスと身体性知能（Embodied Intelligence）を実現する Gemini 2.5

コーディング、推論、空間理解を含むマルチモーダル機能が強化された Gemini 2.5 Pro および Flash が、ロボティクスに変革を起こします。この 2 つのモデルは、安全性の向上とコミュニティアプリケーションに重点を置いており、場面の意味の理解、ロボット制御コードの生成、Live API によるインタラクティブアプリケーションの開発に役立てることができます。

検索

コンテンツタイプ

プロダクト

テクノロジー

Build with Veo 3, now available in the Gemini API

「GenAI Processors」を発表: 強力で柔軟な Gemini アプリケーションをビルド

Gemini API と Google AI Studio で Imagen 4 を公開

ロボティクスと身体性知能（Embodied Intelligence）を実現する Gemini 2.5

コンテンツタイプ

プロダクト

テクノロジー

コンテンツ タイプ

プロダクト

テクノロジー

Build with Veo 3, now available in the Gemini API

「GenAI Processors」を発表: 強力で柔軟な Gemini アプリケーションをビルド

Gemini API と Google AI Studio で Imagen 4 を公開

ロボティクスと身体性知能（Embodied Intelligence）を実現する Gemini 2.5

コンテンツ タイプ

プロダクト

テクノロジー

コンテンツタイプ

コンテンツタイプ