HL7 v2.x Message Syntax | Delimiters, Encoding, Parsing

Encoding Characters and Delimiters

HL7 v2.x messages use a delimiter hierarchy defined in MSH. Correct handling of these characters is required for reliable parsing and interoperability.

Delimiter hierarchy

Character	Name	MSH position	Purpose
\|	Field separator	MSH-1	Separates fields within each segment
^	Component separator	MSH-2[0]	Splits fields into components
~	Repetition separator	MSH-2[1]	Supports repeated values in one field
\	Escape character	MSH-2[2]	Escapes delimiter characters in literal data
&	Subcomponent separator	MSH-2[3]	Further splits component values

Visual hierarchy: Segment -> Field (|) -> Component (^) -> Subcomponent (&). Repetitions (~) can appear at the field level.

Official HL7 v2 message construction flowchart showing the loop through segments, fields, repetitions, and the final segment terminator. — Official HL7 v2 message-construction flowchart from the Control chapter, showing how segment, field, and repetition delimiters are applied when a message is serialized.

Source: HL7 v2.8.2 Control chapter message construction rulesLast verified: 2026-03-11

This flowchart explains why HL7 syntax errors can cascade so badly: delimiters are not cosmetic separators added at the end. They are the actual construction rules for the message, so one malformed field or missing terminator shifts the parse boundary for everything that follows.

Delimiter Usage Example

Message showing delimiter usage, components, repetitions, and encoded structure from MSH.

HL7 Message ExampleHover or click fields for details

MSH|^~\&|SENDING_APP|SENDING_FACILITY|RECEIVING_APP|RECEIVING_FACILITY|20240301120000|—|ADT^A08|123456789|P|2.5

EVN|A08|20240301120000

PID|1|—|12345678^^^MRN^MR|—|DOE^JOHN^MICHAEL|—|19800115|M|—|—|123 MAIN ST^^ANYTOWN^ST^12345^USA|—|(555)123-4567|—|—|S|—|12345678|—|—|—|—|—|—|—|—|—|—|—|—|—|—|—|—|—|—|—|—

PV1|1|I|ICU^101^A|—|—|—|—|—|—|—|—|—|—|—|—|—|—|—|12345678

Annotations (4)

Click on a segment above to highlight its fields

Hover or click any field value for detailed explanation

Escape Sequences Reference

Common HL7 escapes

Escape	Represents
\F\	Field separator \|
\S\	Component separator ^
\T\	Subcomponent separator &
\R\	Repetition separator ~
\E\	Escape character \
\.br\	Line break
\Xdd\	Hexadecimal data
\Zxx\	Custom escape sequence

Example: 123\F\456 renders as 123|456. Unescaped delimiters in payload data are a common parsing failure source.

HL7 v2+ Chapter 2 (Control)

Official HL7 v2+ specification for encoding characters and delimiters

HL7 v2+ Control

NIST IGAMT Validation Tool

Free HL7 conformance profile validation

NIST IGAMT

Z-Segments

Z-segments are custom extensions outside core HL7 standards. They are powerful for local requirements but increase interface coupling and mapping complexity.

Naming Convention

All Z-segments begin with Z and include two additional characters (for example ZPD, ZPV, ZIN).
They are not part of standard HL7 segment definitions.
Each implementation should document Z-segment structure in interface specifications.
Trading partners should version Z-segment contracts explicitly.

Common Z-segment examples

Segment	Domain	Typical fields
ZPD	Patient demographics extension	Language, religion, VIP status, interpreter requirements
ZPV	Visit extension	Attending detail, consult requests, accommodation metadata
ZIN	Insurance extension	Authorization, referral, payer-specific codes
ZEF	Encounter financial extension	Charge master, revenue code, DRG-related data

Z-Segment Example

Message with standard ORU structure extended by ZPD and ZPV custom segments.

HL7 Message ExampleHover or click fields for details

MSH|^~\&|EPIC|HOSPITAL|LAB|HOSPITAL|20240301120000|—|ORU^R01|123456789|P|2.5

PID|1|—|12345678^^^MRN^MR|—|DOE^JOHN|—|19800115|M

OBR|1|LAB123|LAB123|CBC^COMPLETE BLOOD COUNT|—|—|20240301110000

OBX|1|NM|WBC^WHITE BLOOD COUNT|—|7.5|10*3/uL|4.0-11.0|N|—|—|F

ZPD|1|ENGLISH|PROTESTANT|N|Y|2024-01-15|VIP|PATIENT_PREFERS_MORNING_VISITS

ZPV|1|ATTENDING^SMITH^JANE^MD|20240301|CONSULT_REQUESTED^CARDIOLOGY

Annotations (6)

Click on a segment above to highlight its fields

Hover or click any field value for detailed explanation

Integration Challenges and Solutions

Unknown Z-Segments

Receivers may not recognize custom segments. Use configurable handling (ignore/store/reject), document contracts, and preserve raw payloads for replay.

Vendor-Specific Formats

Different vendors encode similar custom data differently. Mitigate with vendor-specific mappings and a normalized internal canonical model.

Version Conflicts

Custom segment definitions may change over time. Version the schema and provide backward compatibility layers where migration windows overlap.

Regex Parsing Patterns

Regex is useful for validation and extraction, but production parsers should still use deterministic segment/field models with strict error handling.

Hierarchical parser execution path

100%drag to pan

Loading diagram...

hl7_parser.py

Python

import re
from dataclasses import dataclass
from typing import List

@dataclass
class Segment:
    name: str
    fields: List[str]

def parse_message(raw: str) -> List[Segment]:
    normalized = raw.replace("\r\n", "\n").replace("\r", "\n")
    lines = [ln.strip() for ln in normalized.split("\n") if ln.strip()]
    segments: List[Segment] = []

    for line in lines:
        fields = line.split("|")
        segments.append(Segment(name=fields[0], fields=fields[1:]))

    return segments

Regex-based extraction flow

100%drag to pan

Loading diagram...

hl7_regex_patterns.py

Python

import re

SEGMENT_PATTERN = re.compile(r'^([A-Z][A-Z0-9]{2})\|(.*)$', re.MULTILINE)
ESCAPE_PATTERN = re.compile(r'\\(F|S|T|E|R|\.br)\\')

def find_segments(message: str):
    return SEGMENT_PATTERN.findall(message)

def replace_escapes(value: str):
    mapping = {
        'F': '|',
        'S': '^',
        'T': '&',
        'R': '~',
        'E': '\\',
    }

    def _replace(match):
        token = match.group(1)
        return mapping.get(token, match.group(0))

    return ESCAPE_PATTERN.sub(_replace, value)

Regex pattern quick reference

Pattern	Purpose
^([A-Z][A-Z0-9]{2})\\|	Match three-character segment ID followed by field separator
(?<!\\)\\|	Split on non-escaped field separators
\\(F\|S\|T\|E\|R)\\	Match HL7 delimiter escape sequences
PID\\|[^\n]\\|[^\\|]\\|([^\^\\|]+)	Extract a primary patient identifier from PID-3

Error Handling and Recovery

Production HL7 interfaces need severity-based failure handling. Not every parse defect should trigger the same response.

References: HL7 v2 Control chapter and ACK semantics (AA, AE, AR). Recommended behavior aligns with common interface engines such as Mirth, Rhapsody, and Camel MLLP.

Warning Flow (Recoverable)

Warning path

100%drag to pan

Loading diagram...

Error Flow (Configurable)

Strict vs lenient path

100%drag to pan

Loading diagram...

Fatal Flow (Unrecoverable)

Fatal reject path

100%drag to pan

Loading diagram...

hl7_error_handler.py

Python

class ParseError(Exception):
    def __init__(self, message: str, segment: str | None = None, field: int | None = None):
        self.segment = segment
        self.field = field
        super().__init__(message)

def require_msh(segments: list[str]) -> None:
    if not segments or not segments[0].startswith("MSH|"):
        raise ParseError("Missing or invalid MSH segment", segment="MSH")

def classify_error(err: ParseError) -> str:
    if err.segment == "MSH":
        return "fatal"
    if err.field is not None:
        return "error"
    return "warning"

Common Parsing Failure Scenarios

Unescaped delimiters in data

Bad: OBX||ST|NOTE|Patient has | diabetes. Good: OBX||ST|NOTE|Patient has \F\ diabetes.

Missing or corrupted MSH

Validate first segment, check for BOM artifacts, and ensure MSH-1/MSH-2 are present before parsing.

Line ending mismatches

Normalize CRLF, LF, and CR line endings to a single internal delimiter before segmentation.

Encoding character mismatch

Always read separators from MSH, never hardcode assumptions about delimiters.

Truncated messages

Use size checks, required-segment validation, and transport framing integrity checks before ACKing.

Parsing Best Practices

Input Validation

Validate message size limits and reject suspicious payloads.
Require MSH as first segment and verify delimiter metadata.
Check version compatibility and required segment sets.
Sanitize untrusted input before downstream processing.

Parser Design

Use streaming parsers for large messages.
Support strict and lenient modes by trading partner.
Handle all HL7 escape sequences deterministically.
Cache delimiter definitions from MSH for hot-path parsing.

Error Management

Classify failures as warning, error, or fatal.
Include segment and field context in error reports.
Store raw payload for replay and forensic debugging.
Use dead-letter queues for unresolved failures.

Performance

Precompile regex patterns and reuse parser objects safely.
Minimize string copying in high-throughput paths.
Profile parsing hotspots before optimization.
Scale consumers independently from transport listeners.

Encoding Examples - Real-World Patterns

Understanding encoding patterns through concrete examples accelerates HL7 v2.x proficiency. This section provides annotated examples showing proper delimiter usage, escape sequences, and common encoding scenarios.

Example 1: Patient Name with Special Characters

When patient names contain delimiter characters, proper escaping is required:

PID with escaped pipe in name

Text

PID|1||123456^^^MRN^MR||O'NEILL^PATRICK^M||19850615|M
PID|1||789012^^^MRN^MR||SMITH-JONES^MARY^ANN||19900220|F
PID|1||345678^^^MRN^MR||VAN DER BERG^JAN||19880310|M

O'NEILL: Apostrophe is allowed without escaping (not a delimiter).
SMITH-JONES: Hyphen is allowed without escaping.
VAN DER BERG: Spaces are allowed in name components.
If name contained pipe (|), it would be: SMITH\F\JONES^MARY.

Example 2: Multi-Component Address

HL7 addresses use XPN data type with multiple components:

PID with structured address

Text

PID|1||123456^^^MRN^MR||DOE^JOHN||19800101|M|||123 MAIN ST^APT 4B^ANYTOWN^CA^90210^USA^HOME^^123456789
PID|1||789012^^^MRN^MR||SMITH^JANE||19850515|F|||456 OAK AVE^^SPRINGFIELD^IL^62701^USA^^^987654321

XAD address component breakdown

Position	Component	Example Value
1	Street Address	123 MAIN ST^APT 4B
2	Other Designation	(empty)
3	City	ANYTOWN
4	State/Province	CA
5	Postal Code	90210
6	Country	USA

Example 3: Repeating Fields and Components

HL7 supports repeating values using the tilde (~) separator:

PID with multiple phone numbers and aliases

Text

PID|1||123456^^^MRN^MR||DOE^JOHN||19800101|M|||123 MAIN ST^^ANYTOWN^CA^90210
PID|1||123456^^^MRN^MR~987654^^^SSN^SS||DOE^JOHN^M||19800101|M|||(555)123-4567^HOME^(555)987-6543^WORK

PID-3: Two patient identifiers (MRN and SSN) separated by ~.
PID-13: Two phone numbers (home and work) with type codes.
Each phone: (555)123-4567^HOME means value=phone, type=HOME.

Example 4: Coded Fields with CWE Data Type

Coded fields use CWE (Coded with Exceptions) for standardized values:

OBX with coded result

Text

OBX|1|CWE|718-7^HEMOGLOBIN^LN||LA6568-5^NORMAL^LN|g/dL|12.0-16.0|N|||F
OBX|2|CWE|33747-0^SARS-COV-2^LN||260373001^DETECTED^SCT|||A|||F

CWE component structure

Component	Meaning	Example
1	Identifier	LA6568-5
2	Text	NORMAL
3	Coding System	LN (LOINC)
4-6	Alternate coding	(optional)

HL7 Data Types Reference

Official HL7 v2.x data type specifications

HL7 Data Types

Validation Patterns - Production-Ready Checks

Robust HL7 validation prevents data corruption and ensures interoperability. This section covers validation patterns from basic syntax checks to conformance profile validation.

Validation Levels

Progressive validation strategy

Level	Checks	Action on Failure
1 - Syntax	MSH present, delimiters valid, segments parseable	Reject with AE (error) ACK
2 - Structure	Required segments present, field counts valid	Reject with AR (reject) ACK
3 - Data Type	TS format valid, NM is numeric, CWE codes valid	Warning or reject based on configuration
4 - Business Rules	PV1-2 matches patient class, DOB is reasonable	Warning with AA ACK, flag for review
5 - Conformance Profile	IGAMT/NIST profile validation	Reject non-conformant messages

Required Field Validation

TypeScript validation example

TypeScript

interface ValidationResult {
  isValid: boolean;
  errors: string[];
  warnings: string[];
}

function validateMSH(segment: string): ValidationResult {
  const errors: string[] = [];
  const warnings: string[] = [];
  const fields = segment.split('|');
  
  // MSH-1: Must be field separator
  if (fields[0] !== '^~\\&') {
    errors.push('MSH-1: Invalid encoding characters');
  }
  
  // MSH-9: Message type required
  if (!fields[8] || !fields[8].includes('^')) {
    errors.push('MSH-9: Message type required (format: TYPE^EVENT)');
  }
  
  // MSH-10: Control ID required
  if (!fields[9]) {
    errors.push('MSH-10: Message control ID required');
  }
  
  // MSH-12: Version ID recommended
  if (!fields[11]) {
    warnings.push('MSH-12: Version ID not specified');
  }
  
  return {
    isValid: errors.length === 0,
    errors,
    warnings,
  };
}

Timestamp Validation (TS Data Type)

HL7 timestamps follow strict format: YYYYMMDDHHMMSS[.SSSS][+/-ZZZZ]:

Timestamp validation regex

TypeScript

const TS_PATTERN = /^\d{4}(0[1-9]|1[0-2])(0[1-9]|[12]\d|3[01])([01]\d|2[0-3])([0-5]\d)([0-5]\d)(\.\d{1,4})?([+-][01]\d[0-5]\d)?$/;

function validateTimestamp(ts: string): boolean {
  if (!ts) return true; // Empty is allowed for optional fields
  return TS_PATTERN.test(ts);
}

// Examples:
// Valid: 20240315143022, 20240315143022.1234, 20240315143022+0500
// Invalid: 2024-03-15, 03/15/2024, 20240315

Conformance Profile Validation

NIST IGAMT allows creation of implementation guides with custom validation rules:

Define required segments and fields per message type.
Specify value sets and allowed codes for coded fields.
Set cardinality constraints (e.g., PID-5 is required, repeat 1).
Add conditional rules (e.g., if PV1-2=I, then PV1-3 is required).
Export profiles for automated validation in interface engines.

NIST IGAMT Tool

The NIST IGAMT (Implementation Guide and Message Tool) provides free profile creation and validation. Use it to define your organization's HL7 implementation guide and validate messages against it.

HL7 v2 Data Type Specification

Official HL7 v2.x data type definitions

HL7 Data Types

AWS Healthcare Data Validation

AWS patterns for HL7 message validation

AWS Data Validation

NIST IGAMT Tool

Free tool for HL7 conformance profile creation and validation

NIST IGAMT

AWS Healthcare Data Validation

AWS patterns for HL7 message validation

AWS Data Validation

HL7 v2 Data Type Specification

Official HL7 v2.x data type definitions

HL7 Data Types

Knowledge Check

Test your understanding with this quiz. You need to answer all questions correctly to mark this section as complete.

Quiz Progress

Question 1 of 22

What character is used as the field separator in HL7 v2.x messages?

← PreviousTransport Methods Next →Trigger Events

Character

Name

MSH position

Purpose

Field separator

MSH-1

Separates fields within each segment

Component separator

MSH-2[0]

Splits fields into components

Repetition separator

MSH-2[1]

Supports repeated values in one field

Escape character

MSH-2[2]

Escapes delimiter characters in literal data

Subcomponent separator

MSH-2[3]

Further splits component values

Escape

Represents

\F\

Field separator |

\S\

Component separator ^

\T\

Subcomponent separator &

\R\

Repetition separator ~

\E\

Escape character \

\.br\

Line break

\Xdd\

Hexadecimal data

\Zxx\

Custom escape sequence

Segment

Domain

Typical fields

ZPD

Patient demographics extension

Language, religion, VIP status, interpreter requirements

ZPV

Visit extension

Attending detail, consult requests, accommodation metadata

ZIN

Insurance extension

Authorization, referral, payer-specific codes

ZEF

Encounter financial extension

Charge master, revenue code, DRG-related data

Pattern

Purpose

^([A-Z][A-Z0-9]{2})\|

Match three-character segment ID followed by field separator

(?<!\\)\|

Split on non-escaped field separators

\\(F|S|T|E|R)\\

Match HL7 delimiter escape sequences

PID\|[^\n]*\|[^\|]*\|([^\^\|]+)

Extract a primary patient identifier from PID-3

Position

Component

Example Value

Street Address

123 MAIN ST^APT 4B

Other Designation

(empty)

City

ANYTOWN

State/Province

Postal Code

90210

Country

USA

Component

Meaning

Example

Identifier

LA6568-5

Text

NORMAL

Coding System

LN (LOINC)

4-6

Alternate coding

(optional)

Level

Checks

Action on Failure

1 - Syntax

MSH present, delimiters valid, segments parseable

Reject with AE (error) ACK

2 - Structure

Required segments present, field counts valid

Reject with AR (reject) ACK

3 - Data Type

TS format valid, NM is numeric, CWE codes valid

Warning or reject based on configuration

4 - Business Rules

PV1-2 matches patient class, DOB is reasonable

Warning with AA ACK, flag for review

5 - Conformance Profile

IGAMT/NIST profile validation

Reject non-conformant messages

Knowledge Tree

Encoding Characters and Delimiters

Delimiter Usage Example

Escape Sequences Reference

HL7 v2+ Chapter 2 (Control)

NIST IGAMT Validation Tool

Z-Segments

Naming Convention

Z-Segment Example

Integration Challenges and Solutions

Unknown Z-Segments

Vendor-Specific Formats

Version Conflicts

Regex Parsing Patterns

Hierarchical parser execution path

Regex-based extraction flow

Error Handling and Recovery

Warning Flow (Recoverable)

Warning path

Error Flow (Configurable)

Strict vs lenient path

Fatal Flow (Unrecoverable)

Fatal reject path

Common Parsing Failure Scenarios

Unescaped delimiters in data

Missing or corrupted MSH

Line ending mismatches

Encoding character mismatch

Truncated messages

Parsing Best Practices

Input Validation

Parser Design

Error Management

Performance

Encoding Examples - Real-World Patterns

Example 1: Patient Name with Special Characters

Example 2: Multi-Component Address

Example 3: Repeating Fields and Components

Example 4: Coded Fields with CWE Data Type

HL7 Data Types Reference

Validation Patterns - Production-Ready Checks

Validation Levels

Required Field Validation

Timestamp Validation (TS Data Type)

Conformance Profile Validation

NIST IGAMT Tool

HL7 v2 Data Type Specification

AWS Healthcare Data Validation

NIST IGAMT Tool

AWS Healthcare Data Validation

HL7 v2 Data Type Specification

Further Reading

HL7 v2+ Chapter 2 (Control)

HAPI HL7 v2 (Java library)

HL7 v2 Control chapter reference

Knowledge Check

What character is used as the field separator in HL7 v2.x messages?

Knowledge Tree

Encoding Characters and Delimiters

Delimiter Usage Example

Escape Sequences Reference

HL7 v2+ Chapter 2 (Control)

NIST IGAMT Validation Tool

Z-Segments

Naming Convention

Z-Segment Example

Integration Challenges and Solutions

Unknown Z-Segments

Vendor-Specific Formats

Version Conflicts

Regex Parsing Patterns

Hierarchical parser execution path

Regex-based extraction flow

Error Handling and Recovery

Warning Flow (Recoverable)

Warning path

Error Flow (Configurable)

Strict vs lenient path

Fatal Flow (Unrecoverable)

Fatal reject path