Parsing Expression Grammar Support for C# 3.0 Part 1 - PEG Lib and Parser Generator

Martin.Holzherr

Rate me:

4.95/5 (49 votes)

7 Oct 2008CPOL40 min read

203.9K

2.1K

118

Introduction to the parsing method PEG with library and parser generator

peg_grammarexplorer.zip
- PEG_GrammarExplorer
  - Documents
    - PEG_GrammarExplorer_fromCP.htm
    - PEG_GrammarExplorer_fromCP-Dateien
      - CodeProject.css
    - PEG_GrammerExplorer.jpg
  - mssccprj.scc
  - PEG Explorer
    - mssccprj.scc
    - PEG Explorer.csproj
    - PEG Explorer.csproj.user
    - PegExplorer.cs
    - PegExplorer.Designer.cs
    - PegExplorer.Designer_in_err.cs
    - PegExplorer.resx
    - Program.cs
    - Properties
      - AssemblyInfo.cs
      - Resources.Designer.cs
      - Resources.resx
      - Settings.Designer.cs
      - Settings.settings
      - vssver2.scc
    - vssver2.scc
  - PEG_GrammarExplorer.sln
  - PegBase
    - mssccprj.scc
    - PEG Base.csproj
    - PegBase.cs
    - Properties
      - AssemblyInfo.cs
      - vssver2.scc
    - vssver2.scc
  - PegSamples
    - BasicEncodingRules
      - BER.cs
      - BER_peg.txt
      - BERConvert.cs
      - BERTree.cs
      - BERTree_peg.txt
      - input
        
        CDURKR2URKR125195
        
        DefiniteLengthForm
        
        CDURKR2URKR125195
        
        hello
        
        TDAUTPTEUR0100011.tap3
        
        vssver2.scc
        
        hello
        
        IndefiniteLengthForm
        
        DefiniteLengthForm
        
        TDAUTPTEUR0100011.tap3
        
        TDAUTPTEUR0100011.tap3
        
        TDAUTPTEUR0100011.tap3
        
        TDAUTPTEUR0100011_withError.tap3
        
        vssver2.scc
      - vssver2.scc
    - C_KernighanRitchie2
      - C_KernighanRitchie2.cs
      - C_KernighanRitchie2_peg.txt
      - input
        
        approx_preprocessed.c
        
        bignum1_preprocessed.c
        
        bignum2_preprocessed.c
        
        binsrch_preprocessed.c
        
        bitarray_preprocessed.c
        
        bitcnt_preprocessed.c
        
        bitstrng_preprocessed.c
        
        deque_preprocessed.c
        
        vssver2.scc
      - vssver2.scc
    - calc0_direct
      - calc0_direct.cs
      - calc0_direct_peg.txt
      - input
        
        calc0_direct_01.txt
        
        vssver2.scc
      - vssver2.scc
    - calc0_tree
      - calc0_tree.cs
      - calc0_tree_eval.cs
      - calc0_tree_peg.txt
      - input
        
        calc0_tree_01.txt
        
        calc0_tree_02.txt
        
        calc0_tree_03.txt
        
        calc0_tree_04.txt
        
        vssver2.scc
      - vssver2.scc
    - CSharp3
    - EMail
    - Json
    - mssccprj.scc
    - PEG Samples.csproj
    - PEG Samples.csproj.user
    - PegGenerator
      - input
        
        BER_peg.txt
        
        BERTree_peg.txt
        
        C#
        
        BER.cs
        
        BERTree.cs
        
        calc0_direct.cs
        
        calc0_tree.cs
        
        IntSum.cs
        
        python_2_5_2_i.cs
        
        TestPeg.cs
        
        C_KernighanRitchie2_peg.txt
        
        calc0_direct_peg.txt
        
        calc0_tree_peg.txt
        
        CSharp3_peg.txt
        
        CSharp3Fast_peg.txt
        
        email_rfc2821_peg.txt
        
        json_check_peg.txt
        
        json_fast_peg.txt
        
        json_tree_peg.txt
        
        python_2_5_2_peg.txt
        
        TestCases
        
        C#
        
        .cs
        
        calc0 direct.cs
        
        calc0_direct .cs
        
        calc0_direct.cs
        
        vssver2.scc
        
        calc0_direct_GrammarNameMissing_peg.txt
        
        calc0_direct_NameAttributeHasLowerCase_peg.txt
        
        calc0_direct_NameAttributeHasWrongCase_peg.txt
        
        calc0_direct_test01_peg.txt
        
        vssver2.scc
        
        TestPeg.txt
        
        vssver2.scc
      - PegCSharpGenerator.cs
      - PegGrammarParser.cs
      - PegParserGenerator.cs
      - vssver2.scc
    - Properties
      - AssemblyInfo.cs
      - vssver2.scc
    - python_2_5_2
      - input
        
        adwords
        
        awapi_python_samples_1.0.0
        
        src
        
        add_ad_group.py
        
        add_campaign.py
        
        add_image_ad.py
        
        add_keyword.py
        
        add_local_business_ad.py
        
        add_text_ad.py
        
        add_website.py
        
        check_keyword_traffic.py
        
        delete_text_ad.py
        
        estimate_keyword.py
        
        get_account_info.py
        
        get_all_ad_groups.py
        
        get_all_ads.py
        
        get_all_campaigns.py
        
        get_all_criteria.py
        
        get_api_usage.py
        
        get_client_accounts.py
        
        get_keyword_variations.py
        
        get_sites_by_topics.py
        
        schedule_report.py
        
        decoratorators_01
        
        fibo.py
        
        fortune.txt
        
        indent_test_01.txt
        
        interactive_01.py
        
        interactive_02.py
        
        interactive_03.py
        
        interactive_05.py
        
        interactive_06.py
        
        interactive_07.py
        
        interactive_08.py
        
        interactive_09.py
        
        interactive_10.py
        
        interactive_11.py
        
        interactive_12.py
        
        interactive_13.py
        
        interactive_14.py
        
        interactive_15.py
        
        interactive_16.py
        
        interactive_17.py
        
        interactive_18.py
        
        interactive_19.py
        
        literal_01.py
        
        python_class.py
        
        python_func.py
        
        simple_baby.py
        
        simple_celsius.py
        
        simple_fortune.py
        
        simple_letters.py
        
        simple_pyprint.py
        
        simple_strfile.py
        
        simple_tabview.py
        
        string_concat.py
        
        string_test_01.txt
      - python_2_5_2_i.cs
      - python_2_5_2_peg.txt
    - SamplesTester.cs
    - vssver2.scc
  - Problems
    - problem_list.txt
  - Sample PEG Console Parser
    - PEG Console Parser
      - input
        
        json_ex01.json.txt
      - json_tree.cs
      - Properties
        
        AssemblyInfo.cs
      - Sample PEG Console Parser.csproj
      - Sample PEG Console Parser.csproj.user
      - SamplePegConsoleParser.cs
    - Sample PEG Console Parser.sln
  - vssver2.scc
peg_grammarexplorer_submission.zip
- PEG_GrammarExplorer_Submission
  - PEG_GrammerExplorer.htm
  - PEG_GrammerExplorer.jpg
  - PEG_GrammerExplorer.zip

import whrandom, string

# Logic game
# From a program by Judith Haris, John Swets, and Wallace Feurzeig
# Reference: The Secret Guide to Computers, by Russ Walter, 18th ed 1993.
# Written in Python by A.M. Kuchling (amk@amk.ca)

# For each letter, we need the various characteristics:
# (curves, loose ends, obliques, horizontals, verticals).
# There should really be a sample character set for the user to look
# at; otherwise, there are ambiguities.  For example, does B have
# horizontals?  Does D?  How about P and R?

# There's a bug lurking in this data!  Can you catch it?
# (See the bottom of the program for the answer.)

letter_stats={'a': (0, 2, 2, 1, 0), 'b':(2, 0, 0, 3, 1),
		  'c': (1, 2, 0, 0, 0), 'd':(1, 0, 0, 0, 1),
		  'e': (0, 3, 0, 3, 1), 'f':(0, 3, 0, 2, 1),
		  'g': (1, 2, 0, 1, 1), 'h':(0, 4, 0, 1, 2),
		  'i': (0, 2, 0, 0, 1), 'j':(1, 2, 0, 0, 1),
		  'k': (0, 4, 2, 0, 1), 'l':(0, 2, 0, 1, 1),
		  'm': (0, 2, 2, 0, 2), 'n':(0, 2, 1, 0, 2),
		  'o': (1, 0, 0, 0, 0), 'p':(1, 1, 0, 2, 1),
		  'q': (1, 2, 1, 0, 0), 'r':(1, 2, 1, 0, 1),
		  's': (1, 2, 0, 0, 0), 't':(0, 3, 0, 1, 1),
		  'u': (1, 2, 0, 0, 2), 'v':(0, 2, 2, 0, 0),
		  'w': (0, 2, 4, 0, 0), 'x':(0, 4, 2, 0, 0),
		  'y': (0, 3, 2, 0, 1), 'z':(0, 2, 1, 2, 0)}

# We'll define constants for the various statistics; each constant is
# equal to the position of the statistic in the tuples in
#letter_stats.
CURVES=0 ; LOOSE_ENDS=1 ; OBLIQUES=2 ; HORIZONTALS=3 ; VERTICALS=4

# This dictionary is used to map questions to corresponding
# statistics.  Note that different keys can map to the same value;
# for example, 'obliques' and 'diagonals' both map to the OBLIQUES constant.  
questions={'curves':CURVES, 'looseends':LOOSE_ENDS,
	   'obliques':OBLIQUES, 'diagonals':OBLIQUES,
	   'horizontals':HORIZONTALS, 'verticals':VERTICALS}

# Play a single game

def play_once():
    # Choose a random number between 0 and 26, inclusive.
    choice=26*whrandom.random()
    # Convert the numeric choice to a letter: 0->a, 1->b, etc.
    choice=chr(ord('a')+choice)
    
    #choice=raw_input("What should I choose?")		# (for debugging)
    
    # We'll track how many possibilities the user still has available.
    # Start with all of the letters.
    possibilities=string.lower("ABCDEFGHIJKLMNOPQRSTUVWXYZ")
    # We'll also track which questions have been asked, and chide the
    # user when he repeats a question.
    asked=[]

    # Loop forever; the play_once() function will exit by hitting a
    # 'return' statement inside the loop.
    while (1):
	try:
		#print possibilities						# (for debugging)
		
		# Get input from the user
		query=raw_input('Next? ')
		# Convert the input to lowercase
		query=string.lower(query)
		# Remove all non-letter characters
		query=filter(lambda x: x in string.lowercase, query)
		# Remove whitespace 
		query=string.strip(query)
		
	except (EOFError, KeyboardInterrupt):
		# End-Of-File : the user 
		print '\nOK; give up if you like.'
		return
	
	if len(query)==1:
		# The query is one character long, so it's a guess
		if query not in possibilities:
			print ("Wrong!  That guess is inconsistent "
			      "with the information you've been given.\n"
			      "I think you made that guess just to see "
			      "what I would say.")
		elif len(possibilities)>1:
			print "You don't have enough information yet."
			# Temporarily remove the user's guess from
			# possibilities, and pick a random letter.
			temp=filter(lambda x, query=query: x!=query, possibilities)
			r=int(whrandom.random()*len(temp))
			print "How do you know it isn't", temp[r]+',',
			print "for example?"
		else:
			# query is in possibilities, and
			# len(possibilities)==1, so the user is right.
			print "Yes, you've done it.  Good work!" ; return
	elif questions.has_key(query):
		# Get the field of the letter_stats tuple to compare.
		field=questions[query]
		# Determine the answer for the computer's letter
		result=letter_stats[choice][field]
		original_length=len(possibilities)

		# Exclude possibilities that don't match those of the
		# mystery letter.
		# filter(func, sequence) calls func() on each element in
		# the sequence, and returns a new sequence object
		# containing only elements for which func() returned true.
		# For strings, each character is an element.  Instead of
		# defining a formal function, a lambda is used to create
		# an anonymous function (one without a name).
		# Various other things required by the function are set
		# as default arguments, so they're accessible inside the
		# scope of the anonymous function.
		possibilities=filter(lambda letter, letter_stats=letter_stats,
						 field=field, result=result:
						 letter_stats[letter][field]==result,
						 possibilities)
		new_length=len(possibilities)
		if field in asked:
			print "You asked me that already."
			print "The answer is the same as before:",
		else: asked.append(field)  # Note that this question was asked.
		print str(result)+'.'
		if (original_length==new_length):
			print 'That was a wasted question; it did not exclude any possibilities.'
		elif (new_length<original_length/2 or new_length==1):
			print "Good question."
	else:
		print "I don't understand the question."

# Print the instructions
print """This is a guessing game about capital letters.
You can ask various questions about the features of the letter:
curves, loose ends, obliques (or diagonals), horizontals, verticals.
To make a guess, just enter the letter of your choice.

Sample transcript:
	Next? curves?
	1.
	Good question.
	Next? c
	You don't have enough information yet.
	How do you know it isn't s, for example?
	Next? horizontals?
	0.
	Next? s
	You don't have enough information yet.
	How do you know it isn't c, for example?
"""

# Play a single game
play_once()
raw_input("Press Return>")

# The solution to the bug-hunt is below...











# It's not a bug that the Python interpreter can catch; instead, it's
# a specification bug:
#
# 'C' and 'S' both have the same stats: 1 curve, 2 loose ends,
# and no obliques, horizontals, or verticals.  If either C or S is
# chosen as the computer's letter, the user can never get the right
# answer, because he/she can't narrow down the possibilities to just
# one!  To fix this, you'd have to add another statistic, like 
# number of intersections or number of closed loops.  However, the
# statistic would have to be *different* for 'C' and 'S', and neither
# of those two suggestions qualify.  Can you think of a property to
# distinguish between the two letters?

By viewing downloads associated with this article you agree to the Terms of Service and the article's licence.

If a file you wish to view isn't highlighted, and is a text file (not binary), please let us know and we'll add colourisation support for it.

License

This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Written By

Martin.Holzherr

Switzerland

This member has not yet provided a Biography. Assume it's interesting and varied, and probably something to do with programming.

Parsing Expression Grammar Support for C# 3.0 Part 1 - PEG Lib and Parser Generator

License

Comments and Discussions