Understanding YAML: The Flexible Data Serialization Format

Atul KaintyuraAtul Kaintyura
3 min read

YAML, originally an acronym for Yet Another Markup Language, has evolved into YAML Ain't Markup Language, reflecting its departure from being a traditional markup language. It is a versatile data serialization format widely used for configuration files, data exchange, and more.

What is YAML?

YAML is akin to formats like JSON and XML but focuses solely on data storage, not commands. It is widely used for applications like configuration files in Docker, Kubernetes, and for managing logs, caches, and more.


What is Data Serialization?

Data serialization refers to converting structured data into a format that can be easily transmitted or stored. YAML excels in this domain, providing a human-readable and easily editable approach.


Benefits of YAML

  1. Simplicity: YAML’s syntax is clean and easy to read.

  2. Strict Syntax: Indentation is crucial in YAML, ensuring clarity and structure.

  3. Interoperability: Easily convertible to JSON, XML, and other formats.

  4. Wide Adoption: Supported by most programming languages.

  5. Expressiveness: Better at representing complex data structures.

  6. Tool Compatibility: Works well with parsers and other tools, making parsing straightforward.


How to Use YAML

YAML organizes data using key-value pairs and lists. Here’s a brief overview:

Key-Value Pairs

yamlCopy codeapple: "I am a red fruit"

Lists

yamlCopy codefruits:
  - apple
  - mango
  - banana
  - Apple # YAML is case-sensitive

Separators and Comments

  • Separate sections with --- and end with ....

  • Comments start with #.


Datatypes in YAML

  1. Strings: Enclosed in ", ', or left plain.

     yamlCopy codename: Atul
     greeting: "Hello, World!"
    
  2. Multiline Strings:

    • Use | for preserving line breaks.

    • Use > to fold text into a single line.

    yamlCopy codemessage: |
      This is a
      multiline string.

    foldedMessage: >
      This will
      appear as a single line.
  1. Numbers: Supports integers, floats, and binary numbers.

     yamlCopy codeage: !!int 25
     binaryNumber: !! 0b11001
    
  2. Boolean Values:

     yamlCopy codeisEnabled: true
    

Advanced Data Structures

  1. Sequences:

     yamlCopy codecities: 
       - Delhi
       - Uttarakhand
    
  2. Nested Sequences:

     yamlCopy code- 
       - first item
       - second item
    
  3. Mappings (Key-Value Pairs):

     yamlCopy codeuser:
       name: Atul
       age: 20
    
  4. Nested Maps:

     yamlCopy coderole:
       age: 78
       job: student
    
  5. Sets: Unique values only.

     yamlCopy codenames: !!set
       ? Atul
       ? Genius
    
  6. Ordered Maps:

     yamlCopy codepeople: !!omap
       - Atul:
           name: AK
           age: 20
    

Anchors and Reusability

YAML supports anchors (&) and aliases (*) for reusing properties:

yamlCopy codelikings: &likes
  favorite_fruit: mango
  dislikes: grapes

person:
  name: Joe
  <<: *likes

Validation and Tools

Validating YAML files is crucial, especially for Kubernetes configurations. Here’s a guide to validate Kubernetes YAML files.

Tools for YAML Creation

  • Datree

  • Monokle by Kubeshop

  • Lens: The Kubernetes platform


Conclusion

YAML is an indispensable tool for developers and system administrators, providing a balance of simplicity and power. Its ability to represent complex data structures while remaining human-readable makes it ideal for numerous use cases. Whether you're managing configurations or exchanging data, YAML is a format worth mastering!

10
Subscribe to my newsletter

Read articles from Atul Kaintyura directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

Atul Kaintyura
Atul Kaintyura