Genomics planning must start from the current service reality
The biggest mistake in Google Cloud life-sciences content is to assume the old execution stack is still current. It is not. Cloud Life Sciences shut down on July 8, 2025, so any new genomics architecture has to start from the supported present-day platform.
That does not mean Google Cloud stopped being relevant for genomics or biomedical research. It means the execution layer has changed. Architects should now think in terms of current compute, storage, analytics, and AI services rather than carrying retired product names into new designs.
That also means inherited runbooks need a careful review. Service accounts, job submission patterns, retry behavior, and observability assumptions that made sense in Cloud Life Sciences should not be copied forward without validating how they map onto the current Batch model.
Google now documents an explicit migration path from Cloud Life Sciences to Batch. That guidance is the right starting point when inherited genomics tooling or old dsub wrappers still mention the retired service name.
Exact date matters
Cloud Life Sciences shut down on July 8, 2025. If a document written after that date still treats it as an active default service, the architecture guidance is stale.
Batch release notes
Official Batch release notes including the July 17, 2023 entry that added migration guidance for former Cloud Life Sciences users.
Review the Batch release notesThe modern GCP genomics stack is pipeline-oriented
Genomics workloads typically look more like research or engineering pipelines than like transactional patient portals. Data lands in cloud storage, workflows run in batch compute, results flow into analytics stores, and governance decides which outputs can cross back into clinical or research systems.
Operationally, Batch is not a drop-in rename of the retired service. Jobs use the Batch service model and regional infrastructure, so IAM, networking, and allowed-location assumptions need to be revalidated when old genomics runbooks are modernized.
Two details matter in production. First, Batch jobs can declare retry behavior so transient task failures do not automatically become pipeline-ending incidents. Second, Google documents running dsub-based genomics pipelines on Batch, which matters for teams modernizing existing bioinformatics workflows rather than rewriting every pipeline definition at once.
Batch-era genomics reference stack
Loading diagram...
Representative Batch retry lifecycle for one genomics task
Loading diagram...
This lifecycle is why Batch status and event visibility matter. A failed shard in a variant-calling workflow should move through an explicit retry or terminal-failure path, not disappear into a black box that leaves operators guessing whether the cohort result is trustworthy.
How genomics workloads usually differ from transactional clinical apps
| Concern | Genomics bias | Clinical app bias |
|---|---|---|
| Execution model | Batch pipelines and long-running jobs | Interactive transactions and low-latency APIs |
| Primary data shape | Large files and derived results | Patient-centric resources and workflow events |
| Typical consumers | Researchers, bioinformaticians, and analytics teams | Clinicians, operations staff, and patient apps |
Automate task retries in Batch
Official Batch guide for retry-policy behavior in long-running task workloads.
Review Batch retry behaviorView job status and events in Batch
Official Batch guide for inspecting job states and status events during execution.
Review Batch job statesRun dsub pipelines using Batch
Official Batch guide for orchestrating dsub-based pipelines on the current execution platform.
Review dsub on BatchAI in life sciences is adjacent to, not identical with, clinical AI
Google for Health models such as TxGemma sit closer to therapeutic and life-sciences research than to patient-facing search alone. That distinction matters because research workloads often care about compound properties, experiment support, and discovery acceleration rather than bedside summarization or operational search.
This does not isolate life sciences from the rest of the healthcare platform. It means architects should be explicit about where research outputs live, how they are governed, and which parts of the workflow can move into clinical environments only after separate validation and review.
Agentic AI framework in life sciences for R&D
Official Google Cloud architecture that places TxGemma and related model steps inside a multi-phase discovery workflow with scientist review.
Review the life-sciences R&D architectureGoogle for Health AI models
Official model catalog that includes TxGemma and other current health-focused model families.
Review the AI model catalogKnowledge Check
Test your understanding with this quiz. You need to answer all questions correctly to mark this section as complete.