Click here to Skip to main content
15,860,972 members
Articles / Programming Languages / C#

YAML Parser in C#

Rate me:
Please Sign up or sign in to vote.
4.83/5 (26 votes)
17 Feb 2011CPOL2 min read 328.3K   4.5K   70   39
An almost feature complete YAML parser.

Image 1

Image 2

Introduction

YAML is a human-friendly, cross language, Unicode based data serialization language designed around the common native data types of agile programming languages. It is broadly useful for programming needs ranging from configuration files to Internet messaging to object persistence to data auditing.

Visit the official YAML website for more information.

YAML Basics

A YAML file may contain zero or more YAML documents, separated by document markers. A YAML document contains one root DataItem. There are three types of DataItems: Scalar, Sequence, and Mapping. DataItems may be nested to form structured data.

Each DataItem type has several formatting styles for good human readability.

yamlDataItem.png

Some rules:

  • A block style item can be nested to a block style item but not a flow style item.
  • A flow style item can be nested to either a block style or a flow style item.
  • Block structure is denoted by indentation.
  • All indentations only use space char, tab is not allowed.

Here are some examples:

Block Scalar

Literal Text
|
The text using
literal style.
|-
The text using
literal style.
"The text using\nliteral style.\n""The text using\nliteral style."
Folded Text
>
The text using
folded style.
>-
The text using
folded style.
"The text using folded style.\n""The text using folded style."
Flow Scalar
Plain Text
  • Can not start with ,[]{}#&*!|>'\"%@`
  • Can start with -?: followed by non space char
  • ": " and " #" cannot appear in between
'Single Quoted Text'
  • Line breaks are folded
  • "'" is escaped with "''"
"Double Quoted Text"
  • Line breaks are folded
  • Escape sequences can be used
Sequence
Block Sequence
- Item one
- Item two
- Item three
Flow Sequence
[Item one, Item two,
Item three]
Mapping
Block Mapping
Key1: Item one
Key2: Item two
? Key3
: Item three 
Flow Mapping
{Key1: Item one, Key2: Item two, 
Key3: Item three} 

Other

Anchor and Alias
Key1: &items
  A: Item A
  B: Item B
Key2: *items
Key1:
  A: Item A
  B: Item B
Key2:
  A: Item A
  B: Item B
Comment
# whole line comment
Data Item # inline comment

Background

There is already a Yaml Library for .NET project, but the features supported are limited.

Using the Code

The parser code is generated using a homemade tool based on grammar specified in the YAML.PEG.txt file. This grammar is not completely equal to the official YAML specification. Here are some differences:

A separator “,” is not allowed following the last entry of Sequence or Mapping in this parser. The 32-bit Unicode escape sequence “U” (ns-hex-digit × 8) is not supported.

The parser can be used like this:

C#
YamlParser parser = new YamlParser();
TextInput input = new TextInput(File.ReadAllText(yamlFilePath));
bool success;
YamlStream yamlStream = parser.ParseYamlStream(input, out success);
if (success)
{
    foreach (YamlDocument doc in yamlStream.Documents)
    {
        // access DataItem by doc.Root
    }
}
else
{
    MessageBox.Show(parser.GetEorrorMessages());
}

Or:

C#
YamlStream yamlStream = YamlParser.Load(yamlFilePath);

Points of Interest

The main shortcoming of this parser is that error messages are not intuitive. You are welcome to give suggestions.

History

  • 2008-08-21: Article submitted.
  • 2011-02-16: Redesigned the UI, fixed some parser bugs.

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)


Written By
Architect YunCheDa Hangzhou
China China
This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Comments and Discussions

 
GeneralNew to YAML Pin
rko10020-Aug-10 2:42
rko10020-Aug-10 2:42 
GeneralNested sequences not working Pin
Member 293651324-Mar-10 12:35
Member 293651324-Mar-10 12:35 
GeneralRe: Nested sequences not working Pin
Liu Junfeng20-Jul-10 0:44
Liu Junfeng20-Jul-10 0:44 
GeneralWell done. Pin
CooperWu30-Jul-09 19:50
CooperWu30-Jul-09 19:50 
GeneralDoesn't work with non-indent list Pin
sgwong15-Jul-09 17:38
sgwong15-Jul-09 17:38 
GeneralRe: Doesn't work with non-indent list Pin
Liu Junfeng19-Jul-09 21:59
Liu Junfeng19-Jul-09 21:59 
GeneralRe: Doesn't work with non-indent list Pin
sgwong21-Jul-09 17:25
sgwong21-Jul-09 17:25 
GeneralRe: Doesn't work with non-indent list Pin
iamcharliemouse23-May-11 5:55
iamcharliemouse23-May-11 5:55 
Yes, this is still an issue. According to the Yaml spec 8.22, this should be accepted:

sequence: !!seq
- entry
- !!seq
 - nested
mapping: !!map
 foo: bar


This library will only accept the sequence indented, i.e.:

sequence: !!seq
 - entry
 - !!seq
  - nested
mapping: !!map
 foo: bar


A small flaw, but painful to work around.

By the way, this is a superb library, so thank you very much - it's extremely useful. Thanks!
GeneralThe mapping for YAML should be a dictionary instead of a List Pin
sgwong29-Jun-09 1:19
sgwong29-Jun-09 1:19 
GeneralRe: The mapping for YAML should be a dictionary instead of a List Pin
Liu Junfeng2-Jul-09 18:13
Liu Junfeng2-Jul-09 18:13 
GeneralRe: The mapping for YAML should be a dictionary instead of a List Pin
sgwong15-Jul-09 17:41
sgwong15-Jul-09 17:41 
GeneralAccessing Data Items Pin
anandamd3-Jun-09 20:21
anandamd3-Jun-09 20:21 
GeneralRe: Accessing Data Items Pin
Liu Junfeng7-Jun-09 19:27
Liu Junfeng7-Jun-09 19:27 
QuestionVery Nice Pin
Duddy9-Mar-09 17:45
Duddy9-Mar-09 17:45 
GeneralYaml for .Net Pin
briviere25-Nov-08 9:50
briviere25-Nov-08 9:50 
GeneralGreat Job Pin
daaharper20-Sep-08 22:34
daaharper20-Sep-08 22:34 
RantI hate recursive acronyms Pin
chaiguy133725-Aug-08 12:52
chaiguy133725-Aug-08 12:52 
GeneralGreat work.. Pin
breakthrough622-Aug-08 12:23
breakthrough622-Aug-08 12:23 
GeneralRe: Great work.. Pin
Liu Junfeng24-Aug-08 20:27
Liu Junfeng24-Aug-08 20:27 
GeneralVery nice Pin
Jonathan C Dickinson21-Aug-08 1:29
Jonathan C Dickinson21-Aug-08 1:29 

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.