Google has announced that its Gemini model will be available on Google Distributed Cloud (GDC), bringing its advanced AI capabilities to on-premises environments. The public preview is scheduled for release in Q3 2025.
This move aims to allow organizations to leverage Gemini's AI while complying with stringent regulatory, sovereignty, and data residency requirements. Google is collaborating with NVIDIA, using NVIDIA Blackwell systems to achieve this goal, enabling customers to purchase the necessary hardware through Google or other channels.
Sachin Gupta, Vice President and General Manager of Infrastructure and Solutions at Google Cloud, stated in an NVIDIA blog post:
"By bringing our Gemini models on-premises and leveraging NVIDIA Blackwell's breakthrough performance and confidential computing capabilities, we enable enterprises to unlock the full potential of agent AI."
GDC is a fully managed on-premises (launched in 2021) and edge cloud solution, offered in connected and air-gapped configurations. It can scale from a single server to hundreds of racks and provides Infrastructure as a Service (IaaS), security, data, and AI services. GDC is designed to simplify infrastructure management, allowing developers to focus on building AI-driven applications, assistants, and agents.
According to Google, introducing Gemini to GDC will enable organizations to utilize cutting-edge AI technology without compromising their need to keep data on-premises. GDC air-gapped offerings have been authorized for U.S. government classified and top-secret missions, providing a high level of security and compliance.
Keith Townsend noted in a LinkedIn post:
"For security-focused industries like manufacturing, this is a game-changer. Imagine running a complex OT environment where machines generate vast amounts of telemetry data—temperature, vibration patterns, runtime. With distributed Gemini Flash, you can deploy lightweight agents behind your on-premises firewall to analyze this data in real-time."
The Gemini models are designed to deliver groundbreaking AI performance. They can analyze contexts of millions of tokens, process various data formats (text, images, audio, and video), and operate across more than 100 languages. The Gemini API simplifies AI inference by abstracting infrastructure, operating system management, and model lifecycle management. Key features include:
- Retrieval-Augmented Generation (RAG) to personalize and enhance AI model outputs.
- Tools for automated information processing and knowledge extraction.
- Capabilities to create interactive conversational experiences.
- Tools to customize agents for specific industry use cases.
In addition to Gemini, Google also highlighted that Vertex AI is already available on GDC. Vertex AI is a platform that accelerates the development, deployment, and management of AI applications. It offers pre-trained APIs, generative AI building tools, RAG, and built-in embedding APIs with the AlloyDB vector database.
Finally, the company also announced that Google Agentspace Search will be available on GDC (public preview in Q3 2025). Google Agentspace Search aims to provide enterprise knowledge workers with out-of-the-box functionality to access data in a secure, permission-aware manner.