AWS HealthOmics

HealthOmics combines workflow execution with omics-specific storage

AWS HealthOmics is not a general-purpose batch service with a genomics label. It is a purpose-built service for bioinformatics workflows and omics data handling, with separate primitives for running workflows and for storing sequence, variant, and annotation data.

That split is the key design boundary. Workflows manage computation, while the omics stores manage durable research artifacts. Architectures that treat those as one thing usually lose provenance, access control clarity, or cost control.

As of March 10, 2026, AWS documents that variant stores and annotation stores are not open to new customers, although existing customers can continue to use them. That changes the teaching arc: new omics programs should think of HealthOmics first as a workflow and sequence-data platform, then make an explicit plan for how long-lived derived data and interpretation assets will be stored and governed.

HealthOmics service surface

100%drag to pan

Loading diagram...

Availability-aware HealthOmics planning

100%drag to pan

Loading diagram...

Availability status is now part of service selection

Older diagrams that treat all HealthOmics storage primitives as equally available can mislead new programs. Check the current customer-eligibility status before locking in the data architecture.

What is AWS HealthOmics?

Official overview covering workflow concepts, sequence stores, variant stores, annotation stores, and related service boundaries.

Read the HealthOmics overview

HealthOmics variant and annotation store availability change

Official HealthOmics page documenting that variant and annotation stores are closed to new customers and explaining migration options.

Review current availability

Private workflows add versioning, run groups, and call caching to execution control

AWS documents private workflows as user-created workflows written in WDL, CWL, or Nextflow. Around those definitions, HealthOmics adds the operational controls that research and production programs actually need: immutable workflow versions, run groups to control compute consumption, and call caching to avoid unnecessary recomputation.

The StartRun API makes those controls concrete. Operators decide which workflow version is being invoked, which IAM role executes it, which run group governs compute, where outputs land, and how much temporary storage is allocated for the run. That turns workflow execution into an explicit operations surface instead of a black-box batch job.

HealthOmics workflow execution lifecycle

100%drag to pan

Loading diagram...

Representative StartRun request

Simplified HealthOmics workflow submission showing the control fields that matter for execution governance and output traceability.

JSON Message

Expand:

{

"workflowId": "wf-1234567890",

"workflowType": "PRIVATE",

"name": "somatic-panel-20260310",

"roleArn": "arn:aws:iam::123456789012:role/HealthOmicsRunRole",

"inputUri": "s3://omics-input/somatic-panel-20260310/",

"outputUri": "s3://omics-output/somatic-panel-20260310/",

"runGroupId": "rg-production",

"storageCapacity": 2400

}

Annotations (5)

Click on an annotation to highlight it in the JSON

Workflow control is a governance feature, not just a developer convenience

Versions, run groups, and call caching help teams answer which workflow ran, under which controls, and why a rerun consumed or avoided compute.

Private workflows in AWS HealthOmics

Official guide describing WDL, CWL, and Nextflow support plus run groups, workflow versions, and call caching.

Review workflow controls

StartRun API reference

Official API reference for submitting HealthOmics workflow runs, including runtime storage, IAM role, input, and output controls.

Read the run API

Store boundaries keep provenance and analytics honest

HealthOmics separates sequence storage from variant and annotation analysis for a reason. Sequence stores keep genomics reads and related files. Variant stores hold called variants for analytics. Annotation stores add contextual knowledge that is imported against the same coordinate system as the variant data.

What each HealthOmics store is for

Store	Availability for new customers	Primary artifact	Typical use
Sequence store	Available	Read sets such as FASTQ, BAM, or CRAM-based genomics sequence data	Durable sequence ingestion and controlled access to source omics data
Variant store	Not open to new customers as of March 10, 2026	Called variants and related analysis records	Query and analytics over detected genomic variation
Annotation store	Not open to new customers as of March 10, 2026	Contextual annotation aligned to the same coordinate system	Interpretation support and enrichment of variant analysis

Do not flatten omics outputs into one undifferentiated research bucket

Omics programs need to preserve which assets are raw sequence inputs, which are derived variants, and which are reference annotations. That distinction drives reproducibility and access design.

HealthOmics storage concepts

Official documentation for sequence stores, variant stores, and annotation stores within the HealthOmics service model.

Review storage concepts

Knowledge Check

Test your understanding with this quiz. You need to answer all questions correctly to mark this section as complete.

Quiz Progress

Question 1 of 5

Which combination best describes the main HealthOmics surface area?

← PreviousAWS HealthImaging Next →Clinical Documentation & Point-of-Care AI on AWS

Store

Availability for new customers

Primary artifact

Typical use

Sequence store

Available

Read sets such as FASTQ, BAM, or CRAM-based genomics sequence data

Durable sequence ingestion and controlled access to source omics data

Variant store

Not open to new customers as of March 10, 2026

Called variants and related analysis records

Query and analytics over detected genomic variation

Annotation store

Not open to new customers as of March 10, 2026

Contextual annotation aligned to the same coordinate system

Interpretation support and enrichment of variant analysis

Knowledge Check

Test your understanding with this quiz. You need to answer all questions correctly to mark this section as complete.

Quiz Progress

Question 1 of 5

AWS HealthOmics

Knowledge Tree

HealthOmics combines workflow execution with omics-specific storage

HealthOmics service surface

Availability-aware HealthOmics planning

Availability status is now part of service selection

What is AWS HealthOmics?

HealthOmics variant and annotation store availability change

Private workflows add versioning, run groups, and call caching to execution control

HealthOmics workflow execution lifecycle

Representative StartRun request

Workflow control is a governance feature, not just a developer convenience

Private workflows in AWS HealthOmics

StartRun API reference

Store boundaries keep provenance and analytics honest

Do not flatten omics outputs into one undifferentiated research bucket

HealthOmics storage concepts

Knowledge Check

Which combination best describes the main HealthOmics surface area?

AWS HealthOmics - Knowledge Tree

AWS HealthOmics

Knowledge Tree

HealthOmics combines workflow execution with omics-specific storage

HealthOmics service surface

Availability-aware HealthOmics planning

Availability status is now part of service selection

What is AWS HealthOmics?

HealthOmics variant and annotation store availability change

Private workflows add versioning, run groups, and call caching to execution control

HealthOmics workflow execution lifecycle

Representative StartRun request

Workflow control is a governance feature, not just a developer convenience

Private workflows in AWS HealthOmics

StartRun API reference

Store boundaries keep provenance and analytics honest

Do not flatten omics outputs into one undifferentiated research bucket

HealthOmics storage concepts

Knowledge Check

Which combination best describes the main HealthOmics surface area?

AWS HealthOmics - Knowledge Tree