arithmetic-parser

command module

v0.0.0-...-18a8b2c Latest Latest Go to latest Published: Feb 26, 2021 License: MIT Imports: 4 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/bediger4000/arithmetic-parser

Links

Open Source Insights

README ¶

Arithmetic Expressions, lexer and parser

Another cut at a Golang algebraic-order-of-operations arithmetic expression evaluator.

Compare to my earlier incarnation which was more-or-less a Go transliteration of a C program.

This version does share most of the implementation of the parse tree with my earlier incarnation. It has an idiomatic Go arithmetic evaluation.

Build

$ cd $GOPATH/src
$ git clone https://github.com/bediger4000/arithmetic-parser.git
$ cd arithmetic-parser
$ go build arithmetic-parser

arithmetic-parser parses and prints a single arithmetic expression, passed to arithmetic-parser as a command line arguments:

$ ./arithmetic-expressions '1 + 3*4'
Reconstituted expression: "1 + (3 * 4)"
/* 13 */

With a -g command line flag, arithmetic-parser prints a GraphViz dot format representation of the parse tree, evaluates the parse tree and prints the value. You would do something like this:

$ ./arithmetic-expressions -g '1 + 3*4' > x.dot
$ dot -Tpng -o x.png x.dot
$ feh x.png

Lexer

I did the lexer based on a Rob Pike talk

The lexer runs in its own goroutine, using a channel to give tokens and token types to the parser. Using a lexer struct looks like some object oriented garbage, but under the hood, it's asynchronous with the parser.

Following along with Rob Pike and understanding his lexer design was my motivation for this project.

Parser

I did a recursive descent parser, using this Ohio State class handout as a guide.

The grammar looks like this:

expr     ->  term   {add-op term}
term     ->  spork  {mult-op spork}
spork    ->  factor {exp-op factor}
factor   ->  '(' expr ')' | add-op factor | NUMBER
add-op   ->  '+'|'-'
mult-op  ->  '*'|'/'|'%'
exp-op   ->  '^'

Punctuation (parentheses), operation signs and numbers are terminal symbols.

I use "spork" as a name to get an extra level of precedence. There's got to be an official name for this precedence.

I added "%" (for remainder/modulo), '^' (for exponentiation) and allowed unary positive and negative operators, to customize the exercize.

"CFG" below abbreviates "context free grammar". I was able to follow their rules to write the code:

One parse method per non-terminal symbol
A non-terminal symbol on the right-hand side of a rewrite rule leads to a call to the parse method for that non-terminal
A terminal symbol on the right-hand side of a rewrite rule leads to "consuming" that token from the input token string
| in the CFG leads to "if-else" in the parser
{...}in the CFG leads to while in the parser

The grammar has to be correct for this sort of semi-mechanical coding to work. The tricky part was realizing that the parser needed to see what type of token it had, but not "use up" that token every time the parser looked at its type, or the lexeme itself. The "consuming" note in the The Ohio State handout didn't make sense until I realized that.

Expression evaluation

I borrowed an interface from a 2010 Google I/O talk by Rob Pike and Russ Cox to do the actual arithmetic.

The interface looks like this:

type Value interface {
    BinaryOp(op lexer.TokenType, y Value) Value
    String() string
}

It has an integer arithmetic implementation, and an error holder implementation. Package tree creates new Value instances and calls BinaryOp() on them. This simplifies tree.Node.Eval() immensely, and separates arithmetic from parse tree. Because there's an error holder implementation, reporting run-time problems like divide-by-zero becomes much easier. at the cost of moving that code into package value.

Documentation ¶

There is no documentation for this package.

Source Files ¶

View all Source files

main.go

Directories ¶

Path	Synopsis
lexer
parser
tree
value

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL