Current Google Cloud healthcare GenAI designs start with Cloud Healthcare API and Vertex AI
The staging document emphasized MedLM and Healthcare Data Engine. Those references now need updating. Google’s current Vertex AI model docs mark MedLM as deprecated, so current healthcare GenAI architectures should be framed around Cloud Healthcare API, Vertex AI, Gemini-era model access, and product-specific intended-use boundaries.
That does not make the older architecture lessons useless. It means architects should carry the durable ideas forward: keep structured healthcare data in a healthcare-aware store, retrieve narrowly, and design approval boundaries explicitly instead of assuming a medically branded model removes the need for governance.
Cloud Healthcare API FHIR concepts
Official Google Cloud guidance on managed FHIR stores and healthcare-specific data handling.
Review Cloud Healthcare API FHIRGoogle models on Vertex AI
Official model catalog page stating that MedLM is deprecated and pointing users toward current Vertex AI model options.
Review current Vertex AI model guidanceGoogle Cloud Healthcare API projects, datasets, and data stores
Official Google Cloud page explaining the dataset and store hierarchy behind FHIR, HL7v2, and DICOM isolation.
Review the resource hierarchyVertex AI Search for Healthcare is a grounded retrieval product with explicit boundaries
Google’s healthcare search experience is valuable because it keeps FHIR ingestion and grounded retrieval explicit. But its documentation also states the product boundary unusually clearly: it is not intended for direct diagnosis or treatment use, outputs are draft rather than final, and the service currently runs in the US multi-region for healthcare search workloads.
Important current boundary
As of March 12, 2026, Google documents that Vertex AI Search for Healthcare provides search services only in the US multi-region (us). That matters immediately for Australian residency-sensitive clinical data strategies.
That does not make the product unusable. It means the safe use cases are primarily assistive retrieval, draft summarization, and administrative support where the data-location and intended-use rules are acceptable. If a program needs Australian patient-data residency or direct clinical decision support, this boundary becomes a design veto unless another approved architecture is chosen.
Reviewed GCP healthcare search pattern
Loading diagram...
The interface example is instructive because it visibly keeps retrieved records in view under the generated summary. That is the right mental model for healthcare search: draft synthesis paired with inspectable evidence, not hidden one-shot automation.
Create a healthcare search app
Official Google Cloud documentation covering intended-use restrictions, review warnings, and current `us` multi-region deployment for healthcare search.
Review healthcare search restrictionsGet search results for healthcare data
Google Cloud guidance reiterating intended-use restrictions for healthcare search outputs.
Review the search result workflowGoogle’s reference architecture separates RAG ingestion from serving
The newer Google Cloud RAG reference architecture is more useful for current healthcare design than the older jump starts because it draws a hard line between data ingestion and serving. That maps well to healthcare, where de-identification, chunking, embedding generation, and index refresh should happen on a controlled backend path while the clinical or operations-facing app stays on a narrower serving path with explicit review rules.
Healthcare-adapted GCP RAG topology
Loading diagram...
That split matters in healthcare because the ingestion path often needs broader backend permissions to read source systems, run de-identification, and refresh indexes, while the serving path should stay narrowly focused on query handling, retrieval, safety instructions, and reviewed presentation. If those concerns blur together, the user-facing application can inherit more power and more data exposure than it actually needs.
RAG infrastructure using Vertex AI and Vector Search
Google Cloud reference architecture describing the ingestion and serving split, plus the operational guidance around Cloud Run, Vector Search, and safety controls.
Review the RAG reference architectureGenerative AI architecture guides
Google Cloud overview page that links the RAG reference architectures together with reusable knowledge-base, customer-support, and document-summarization patterns.
Open the GenAI architecture guideUse de-identification and service perimeters to control derived-data exposure
Google Cloud’s de-identification and perimeter tooling is most useful when it supports a disciplined data-derivation strategy: keep an authoritative healthcare dataset under tight controls, create specific derived copies for search or model workflows, and protect those flows with VPC Service Controls and IAM instead of broad project-level trust.
- Create de-identified copies rather than modifying the source dataset in place
- Keep source and destination datasets in the same Google Cloud location
- Use
useRegionalDataProcessingwhen same-location processing matters - Apply VPC Service Controls to reduce exfiltration risk across projects and services
- Treat de-identification as a technical control that still needs legal and clinical review
Google’s healthcare-search guidance adds a second architectural rule that teams often miss: if you place Cloud Healthcare API data in one project and the healthcare search app in another, Google documents that the projects must be in the same VPC Service Controls perimeter whenever you are using perimeter controls. That means separate projects are not the same thing as separate security boundaries.
De-identifying sensitive data with Cloud Healthcare API
Official guidance for dataset, FHIR, and DICOM de-identification, including same-location rules and processing notes.
Review de-identification guidanceOverview of VPC Service Controls
Official Google Cloud overview of service perimeters and exfiltration-risk controls.
Review VPC Service ControlsKnowledge Check
Test your understanding with this quiz. You need to answer all questions correctly to mark this section as complete.