HealthOmics combines workflow execution with omics-specific storage
AWS HealthOmics is not a general-purpose batch service with a genomics label. It is a purpose-built service for bioinformatics workflows and omics data handling, with separate primitives for running workflows and for storing sequence, variant, and annotation data.
That split is the key design boundary. Workflows manage computation, while the omics stores manage durable research artifacts. Architectures that treat those as one thing usually lose provenance, access control clarity, or cost control.
As of March 10, 2026, AWS documents that variant stores and annotation stores are not open to new customers, although existing customers can continue to use them. That changes the teaching arc: new omics programs should think of HealthOmics first as a workflow and sequence-data platform, then make an explicit plan for how long-lived derived data and interpretation assets will be stored and governed.
HealthOmics service surface
Loading diagram...
Availability-aware HealthOmics planning
Loading diagram...
Availability status is now part of service selection
Older diagrams that treat all HealthOmics storage primitives as equally available can mislead new programs. Check the current customer-eligibility status before locking in the data architecture.
What is AWS HealthOmics?
Official overview covering workflow concepts, sequence stores, variant stores, annotation stores, and related service boundaries.
Read the HealthOmics overviewHealthOmics variant and annotation store availability change
Official HealthOmics page documenting that variant and annotation stores are closed to new customers and explaining migration options.
Review current availabilityPrivate workflows add versioning, run groups, and call caching to execution control
AWS documents private workflows as user-created workflows written in WDL, CWL, or Nextflow. Around those definitions, HealthOmics adds the operational controls that research and production programs actually need: immutable workflow versions, run groups to control compute consumption, and call caching to avoid unnecessary recomputation.
The StartRun API makes those controls concrete. Operators decide which workflow version is being invoked, which IAM role executes it, which run group governs compute, where outputs land, and how much temporary storage is allocated for the run. That turns workflow execution into an explicit operations surface instead of a black-box batch job.
HealthOmics workflow execution lifecycle
Loading diagram...
Representative StartRun request
Simplified HealthOmics workflow submission showing the control fields that matter for execution governance and output traceability.
Click on an annotation to highlight it in the JSON
Workflow control is a governance feature, not just a developer convenience
Versions, run groups, and call caching help teams answer which workflow ran, under which controls, and why a rerun consumed or avoided compute.
Private workflows in AWS HealthOmics
Official guide describing WDL, CWL, and Nextflow support plus run groups, workflow versions, and call caching.
Review workflow controlsStartRun API reference
Official API reference for submitting HealthOmics workflow runs, including runtime storage, IAM role, input, and output controls.
Read the run APIStore boundaries keep provenance and analytics honest
HealthOmics separates sequence storage from variant and annotation analysis for a reason. Sequence stores keep genomics reads and related files. Variant stores hold called variants for analytics. Annotation stores add contextual knowledge that is imported against the same coordinate system as the variant data.
What each HealthOmics store is for
| Store | Availability for new customers | Primary artifact | Typical use |
|---|---|---|---|
| Sequence store | Available | Read sets such as FASTQ, BAM, or CRAM-based genomics sequence data | Durable sequence ingestion and controlled access to source omics data |
| Variant store | Not open to new customers as of March 10, 2026 | Called variants and related analysis records | Query and analytics over detected genomic variation |
| Annotation store | Not open to new customers as of March 10, 2026 | Contextual annotation aligned to the same coordinate system | Interpretation support and enrichment of variant analysis |
Do not flatten omics outputs into one undifferentiated research bucket
Omics programs need to preserve which assets are raw sequence inputs, which are derived variants, and which are reference annotations. That distinction drives reproducibility and access design.
HealthOmics storage concepts
Official documentation for sequence stores, variant stores, and annotation stores within the HealthOmics service model.
Review storage conceptsKnowledge Check
Test your understanding with this quiz. You need to answer all questions correctly to mark this section as complete.