# Adding OOP to the Egg Parser
# Introduction
In this lab, we want to increase the expressiveness of our language.
The following example shows some of the extensions that we want to introduce:
cat examples/object-colon-selector.egg
do (
def(x, { # object literals!
c: [1, 2, 3], # array literals!
gc: fun(
element(self, "c") # old way works
),
sc: fun(value, # look at the left side of the assignment!
=(self.c[0], value)
),
inc: fun(
=(self.c[0], +(self.c[0], 1))
)
}),
print(x),
print(x.gc()), # [1, 2, 3]
x.sc(4),
print(x.gc()), # [4,2,3]
x.inc(),
print(x.gc()), # [5,2,3]
print(x.c.pop()), # 3
print(x.c) # [5,2]
)
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
Take a look at some of the features introduced:
- Added braces
{}
to refer to object literals:def(x, { ... })
- Note the appearance of the colon
:
token to separate the attribute name from the value in an object - Added brackets
[]
to refer to array literals[1, 2, 3]
- It is possible to access the properties of an object using the dot as in
x.c
- In this version of the Egg language,
self
denotes the object. it is likethis
in JS - It is possible to access the properties of any object using square brackets as in
self.c[0]
# Download evm with OOP extensions
During the development of this lab, you can execute the ASTs generated by your parser using one of the interpreters in this release:
download the version you need for the development of this lab, make a symbolic link to have it at hand:
✗ ln -s ~/campus-virtual/shared/egg/oop-evm-releases/evm-2122-macos ./evm
and try with some example:
✗ ./evm examples/object-colon-selector.json
{"c":[1,2,3]}
[1,2,3]
[4,2,3]
[5,2,3]
3
[5,2]
2
3
4
5
6
7
# Multiple Attribute Indexation
You can make multiple indexation of an object so that
a[0,2]
means a[0][2]
:
✗ cat examples/multiple-properties.egg
do(
def(a, [[4,5,6], 1,2,3]),
def(b, a[0,2]),
print(b) # 6
)%
✗ bin/eggc.js examples/multiple-properties.egg
✗ npx evm examples/multiple-properties.json
6
2
3
4
5
6
7
8
9
Same for objects a["p", "q", "r"]
means a.p.q.r
or a["p"]["q"]["r"]
:
✗ cat examples/multiple-properties-object-dot.egg
do(
def(a, { p : { q : { r : 1 } } }),
def(b, a["p", "q", "r"]),
print(b), # 1
print(a.p.q.r) # Same
)
✗ bin/eggc.js examples/multiple-properties-object-dot.egg
✗ npx evm examples/multiple-properties-object-dot.json
1
1
2
3
4
5
6
7
8
9
10
11
This is the section of the grammar that allows the use of property indexation:
expression -> ...
| %WORD applies
applies -> calls
| properties
| null
properties -> bracketExp applies
bracketExp -> "[" commaExp "]"
commaExp -> null
| expression ("," expression):*
2
3
4
5
6
7
8
9
10
11
12
# Property indexation and commaExp is nullable
Notice that commaExp
is nullable, and thus it fits with an empty indexation expression like a[]
which initially makes nonsense, (but read the next section). To fix the problem, we can change the grammar introducing a new category nonEmptyBracketExp
so that we can protest if the index list is empty:
➜ egg-oop-parser-solution git:(master) cat examples/empty-bracket.egg
do(
def(a, [1,2,3]),
print(a[])
)
2
3
4
5
➜ egg-oop-parser-solution git:(master) bin/eggc.js examples/empty-bracket.egg
There was an error: Syntax error accesing property at line 3 col 12.
Specify at least one property.
2
3
# The Syntactically Correct, Semantically Absurd Language Design Pattern
The "Syntactically Correct, Semantically Absurd" Language Design Pattern
Whenever a phrase is syntactically correct and it seems semantically absurd like is the case of x[]
,
I usually stop for a moment and consider 🤔 if there is some not obvious meaning we can give to it.
(opens new window) Noam Chomsky. 1957 Syntactic Structures
For instance all arrays, objects and maps have in common the length
property.
2
- May be we can give to
x[]
the meaning "to return a deep copy ofx
"? 👍 (See structuredClone (opens new window) node v17+ or npm package realistic-structured-clone (opens new window)) - May be we can give to
x[]
the meaning "to returnx.length
"? 👎
See also Syntactically correct, semantically incorrect sentence (opens new window)
# Currying in Egg
When the argument used to index a function object is not an attribute of the function
someFun[arg1, ... ] # and "arg1" is not a property of "someFun"
then we want arg1, ...
to be interpreted as arguments for someFun
and the expression returns the currying of the function (opens new window) on arg1, ...
.
For instance:
✗ cat examples/curry-no-method.egg
print(+[4](2))
2
In Egg +
is a function that takes an arbritrary number of numbers:
and returns its sum. The curried
Execution:
✗ bin/eggc.js examples/curry-no-method.egg
✗ npx evm examples/curry-no-method.json
6
2
3
However, if the attribute exists we want an ordinary property evaluation, as in this example:
➜ egg-oop-parser-solution git:(master) cat examples/function-length-property.egg
do(
def(f, fun(x, y, +(x,y))),
print(f["numParams"]) # JS length property is not supported
)
➜ egg-oop-parser-solution git:(master) ✗ bin/egg examples/function-length-property
2
2
3
4
5
6
7
We have added an attribute numParams
to the Egg Function objects that returns the number of parameters in its declaration.
The dot operator for objects a.b
is defined in such a way that a.b
and a["b"]
are the same thing. This is why the former program can be rewritten this way:
➜ egg-oop-parser-solution git:(master) ✗ cat examples/curry-no-method-dot.egg
print(+.4(2))
➜ egg-oop-parser-solution git:(master) ✗ bin/egg examples/curry-no-method-dot
6
2
This is another example of currying in Egg:
➜ egg-oop-parser-solution git:(master) cat examples/curry-method.egg
do (
print(4["+"][5](3)),
print(4.+[5](3)), # Same thing 12
print(4["*"][5](3)), # 4["*"](5, 3) # 60
print(6["/"][2](3)), # 6["/"](2, 3) # 1
print(6["-"][2](3)) # 6["/"](2, 3) # 1
)
2
3
4
5
6
7
8
The ambiguities that arise in the expression 4.+
are discussed in section The Dot Ambiguity: Property dot or Mantissa dot?.
➜ egg-oop-parser-solution git:(master) ✗ bin/egg examples/curry-method
12
12
60
1
1
2
3
4
5
6
Design Consideration
The decision of overloading the meaning of the property access for functions is a risky one ⚠️
but has few consequences over the grammar design other than the ambiguity that arise in the expression 4.+
(See section Property dot or Mantissa dot?).
The decision of overloading the meaning of the property access for functions has consequences during the interpretation phase.
In this case the idea behind the proposal is that
Any potential argument of a function can be viewed as a property of such function whose value is the function curried for that argument
which makes the design proposal consistent with the idea of property
# Selectors: the Dot Operator
Most OOP languages allow the use of the notation x.y
as a synonym of x["y"]
.
To add it to Egg we add the production properties -> selector applies
to the grammar.
Lines 8-10 show the rules for the new syntactic variable selector
:
applies -> calls
| properties
| null
properties -> bracketExp applies
| selector applies
selector ->
"." %WORD
| "." %NUMBER
2
3
4
5
6
7
8
9
10
We want to allow programs like the following:
➜ egg-oop-parser-solution git:(master) ✗ cat examples/dot-chain.egg
print([1,4,5].join("-").length) # Same as array(1,4,5)["join"]("-")["length"]
➜ egg-oop-parser-solution git:(master) ✗ bin/egg examples/dot-chain
5
2
3
4
same thing with object literals:
➜ egg-oop-parser-solution git:(master) ✗ cat examples/dot-obj-literal.egg
print({x : 3}.x) # 3
➜ egg-oop-parser-solution git:(master) ✗ bin/egg examples/dot-obj-literal
3
2
3
4
and also:
➜ egg-oop-parser-solution git:(master) ✗ cat examples/dot-num.egg
print(c)
➜ egg-oop-parser-solution git:(master) ✗ bin/egg examples/dot-num
4.30
2
3
4
and even program like this one:
➜ egg-oop-parser-solution git:(master) ✗ cat examples/array-dot.egg
do(
def(a, [[1,2],3]),
print(a.0.1)
)
➜ egg-oop-parser-solution git:(master) ✗ bin/egg examples/array-dot
2
2
3
4
5
6
7
8
Think on the sub-expression above a.0.1
from the lexical analysis point of view. A naif approach will lead to the token's flow [WORD{a}, DOT, NUMBER{0.1}]
# Extended ASTs Tree Grammar
# Introduction
Consider the following Egg program:
✗ cat examples/dot-num.egg
print(4.3.toFixed(2))
2
The AST generated has a new type of node called property
to represent object property access:
✗ cat examples/dot-num.json
{
"type": "apply",
"operator": { "type": "word", "name": "print" },
"args": [
{
"type": "apply",
"operator": {
"type": "property",
"operator": { "type": "value", "value": 4.3, },
"args": [ { "type": "value", "value": "toFixed", } ]
},
"args": [ { "type": "value", "value": 2, } ]
}
]
}
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
- The
type
in line 9 isproperty
, which tell us that this AST node correspond to the operation of accesing the attributes of the object in itsoperator
child. - The
operator
in line 10 refers to the AST of the Egg object being described( 4.3
). - The
args
in line 11 refers to the ASTs of the properties.- The first element of
args
is the AST of a direct property of the object in the operand ( toFixed
). - The second
is a property of the object - The third
is a property of the object - ... and so on
- The first element of
# AST Grammar
Our parser should therefore produce an AST conforming to this tree grammar:
ast: VALUE
| WORD
| APPLY( operator: ast args:[ ast * ]))
| PROPERTY(operator: ast args:[ ast * ]))
2
3
4
Los nodos
APPLY
tienen dos atributosoperator
yargs
El atributo
operator
de un nodoAPPLY
contiene información sobre la función que lo interpreta (if
,while
,print
,+
, etc.)El atributo
args
de un nodoAPPLY
es unARRAY
conteniendo los ASTs que se corresponden con los argumentos para la función asociada conoperator
.Los nodos
PROPERTY
tienen dos atributosoperator
yargs
El atributo
operator
de un nodoPROPERTY
contiene información sobre el objeto (por ejemplo en[1,2,3][0]
el operator sería el AST de[1, 2, 3]
, En{a: 1, b:2}.a
sería el AST de{a: 1, b:2}
)El atributo
args
de un nodoPROPERTY
es unARRAY
conteniendo los ASTs que se corresponden con los atributos/propiedades delobjeto
que está enoperator
. Véase la sección The Shape ofLos nodos
WORD
son nodos hoja y tienen al menos el atributoname
.Los nodos
VALUE
tienen al menos el atributovalue
.
# Example 4.3.toFixed(2)
For example, the AST for 4.3.toFixed(2)
could be described by this term:
APPLY(
operator:PROPERTY(
operator:VALUE{value:4.3},
args:VALUE{value:"toFixed"}
),
args:VALUE{value:2}
)
2
3
4
5
6
7
To the right of the node type and between curly braces we write the attribute: value
pairs that we want to highlight.
If you have difficulties review the section Anatomy of ASTs for Egg
# The Shape of Property ASTs
The final shape of property-type generated ASTs depends on how you implement the functions in the src/build-ast.js
library.
Consider the following input:
➜ egg-oop-parser-solution git:(master) cat examples/ast-shape-for-property-nodes.egg
[[1,2]][0,1]
2
What will be the AST of your compiler for such input?. Here is a simplified notation for the AST generated by my implementation of the parser:
PROPERTY(
op: APPLY(
op: WORD{array},
args: [
APPLY(
op: WORD{array}
args: [ VALUE{1}, VALUE{2}]
)
]
),
args: [VALUE{0}, VALUE{1}]
)
2
3
4
5
6
7
8
9
10
11
12
Notice that the property node args
array has two elements.
Here is the actual JSON:
{
"type": "property",
"operator": {
"type": "apply",
"operator": {
"type": "word",
"length": 5,
"name": "array"
},
"args": [
{
"type": "apply",
"operator": {
"type": "word",
"line": 1,
"col": 3,
"length": 5,
"name": "array"
},
"args": [
{
"type": "value",
"value": 1,
"line": 1,
"col": 3,
"length": 1
},
{
"type": "value",
"value": 2,
"line": 1,
"col": 5,
"length": 1
}
]
}
]
},
"args": [
{
"type": "value",
"value": 0,
"line": 1,
"col": 9,
"length": 1
},
{
"type": "value",
"value": 1,
"line": 1,
"col": 11,
"length": 1
}
]
}
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
# More Examples
Other examples of what args
contains for different property ASTs:
- For the expression
[[1,2],3][0,1]
it would be the ASTs of[0, 1]
or - For
[[1,2],3]["length"]
would be the AST of["length"]
- For
{a: [1, 2], b:2}["a", 0]
would be the ASTs of["a", 0]
)
# The Dot Ambiguity: Property dot or Mantissa dot?
Al introducir el dot para seleccionar la propiedad del objeto se produce una ambiguedad con el punto en los flotantes:
✗ cat test/examples/dot-num.egg
print(4.3.toFixed(2))
2
Propuesta
Se propone que la ambiguedad se resuelva dando prioridad a la interpretación como punto de número si el punto va seguido de un dígito, en otro caso estamos accediendo a la propiedad del número
Ejemplo:
bin/eggc.js test/examples/dot-num.egg
✗ npx evm test/examples/dot-num.json
4.30
2
3
Solution
So, inside the lexical analyzer, we have to force that for a dot to be interpreted as numeric the dot has to be followed by a digit:
const NUMBER = /(?<NUMBER>[-+]?\d+(\.\d+)?(?:[eE][-+]?\d+)?)/; // \d+ to resolve ambiguity
const tokens = [ SPACE, NUMBER, ... DOT, ... ];
2
Esto es diferente de lo que hace JS que no permite usar el punto como selector de atributo:
➜ pl2122apuntes git:(main) node
Welcome to Node.js v16.0.0.
Type ".help" for more information.
> 4.toFixed(2)
4.toFixed(2)
^^
Uncaught SyntaxError: Invalid or unexpected token
2
3
4
5
6
7
En JS la ambiguedad se resuelve parentizando el número:
> 4.toFixed(2)
4.toFixed(2)
^^
Uncaught SyntaxError: Invalid or unexpected token
> (4).toFixed(2)
'4.00'
2
3
4
5
6
# Lexical Transformations
To facilitate the task of doing this lab, it is convenient that we return to the lexer-generator module and modify its API a bit, providing it with the ability to add lexical transformations.
To do this, the nearleyLexer
function will now receive an additional parameter of an object with options:
let lexer = nearleyLexer(tokens, { transform: transformerFun});
The only option we are going to add is transform
. When specified, it applies the transformerFun
function to each of the tokens
of the lexer object generated by nearleyLexer
.
We can have more than one lexical transformations to apply. Thus, we allow the transform
property to be an array, so that the builder nearleyLexer
can be called this way:
let lexer = nearleyLexer(tokens, { transform: [colonTransformer, NumberToDotsTransformer] });
Solution
To achieve the goal we have to modify the reset
method of our nearley compatible object:
reset: function(data, info) {
this.buffer = data || '';
this.currentPos = 0;
let line = info ? info.line : 1;
this.tokens = lexer(data, line);
if (options && options.transform) {
if (typeof options.transform === 'function') {
debugger;
this.tokens = options.transform(this.tokens);
} else if (Array.isArray(options.transform)) {
options.transform.forEach(trans => this.tokens = trans(this.tokens))
}
}
return this;
}
2
3
4
5
6
7
8
9
10
11
12
13
14
15
See the code for the nearley lexer at section La función nearleyLexer of the lab Lexer Generator
# The Lexical Word Colon Transformation
We want to add the colon as syntactic sugar to our language.
We want to transform all the pair subsequences WORD, COLON
into STRING, COMMA
sequences so that phrases like x: 4
are interpreted as "x", 4"
.
In this way we can write a program like this:
✗ cat examples/colon.egg
do(
def(b, [a:4]), # The : is a "lexical" operator
print(b)
)
2
3
4
5
so that when compiled and executed produces:
✗ bin/eggc.js examples/colon.egg
✗ npx evm examples/colon.json
["a",4]
2
3
Proposal
The idea is that inside our lexer we write a lexical transformation function:
function colonTransformer(tokens) {
// ... s/WORD COLON/STRING COMMA/g
return tokens;
}
2
3
4
This transformation is what allow us the elegant syntax to describe the object in the example examples/object-colon-selector.egg
in section introduction
def(x, {
c: [1, 2, 3],
gc: fun(element(self, "c")),
sc: fun(value, =(self.c[0], value)),
inc: fun(=(self.c[0], +(self.c[0], 1)))
})
2
3
4
5
6
# Full Grammar
The following grammar is a NearleyJS non ambiguous grammar that allows the requested features and extends the previous Egg grammar we introduced in lab egg-parser:
program -> expression %EOF
expression ->
%STRING optProperties
| %NUMBER optProperties
| bracketExp optProperties
| curlyExp optProperties
| %WORD applies
applies -> calls
| properties
| null
calls -> parenExp applies
properties -> bracketExp applies
| selector applies
parenExp -> "(" commaExp ")"
bracketExp -> "[" commaExp "]"
curlyExp -> "{" commaExp "}"
selector ->
"." %WORD
| "." %NUMBER
commaExp -> null
| expression ("," expression):*
optProperties -> null
| properties
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
See also the Syntax Diagram/Railroad Diagram
# A new Ambiguity: Number Dot Number
Just for fun and to go beyond what any other programming language allows we want the dot to work with numbers as property selector. This is something, to my knowledge, no language allows. For instance, in JS:
➜ src git:(main) ✗ node
Welcome to Node.js v16.0.0.
Type ".help" for more information.
> a = [[1,2],3,4]
[ [ 1, 2 ], 3, 4 ]
> a[0][0]
1
> a.0.0
a.0.0
^^
Uncaught SyntaxError: Unexpected number
2
3
4
5
6
7
8
9
10
11
You can not use the notation a.0.0
to select the a[0][0]
element since allowing this notation
confuses the interpreter.
Even if the JS designers would take a decision as the one we took in section The Dot Ambiguity: Property dot or Mantissa dot? it will not suffice: The lexer will interpret the 0.0
in a.0.0
as a word a
followed by floating point 0.0
!.
This goal (the dot to work with numbers as property selector) is the reason I introduced the "." %NUMBER
production in the selector
rule:
selector ->
"." %WORD
| "." %NUMBER
2
3
this, if correctly implemented, will allow us to write programs like this one:
✗ cat examples/array-dot.egg
do(
def(a, [[1,2],3]),
print(a.0.1)
)
2
3
4
5
that will produce this output:
✗ bin/eggc.js examples/array-dot.egg
✗ npx evm examples/array-dot.json
2
2
3
the key observation here is that
Disambiguation Rule
In an Egg program a number token corresponding to a floating point as 0.1
or 0.0
can not be preceded by a dot token.
Notice that before a dot token not necessarily comes a word, but it can be a complex expression like in this other example (Observe the first dot at line 4):
✗ cat examples/function-returning-array-dot-number.egg
do(
def(f, fun([[0,Math.PI],2])), # A function that returns an array
print(f().0.1)
)
2
3
4
5
When executed we obtain:
✗ bin/eggc.js examples/function-returning-array-dot-number.egg
✗ npx evm examples/function-returning-array-dot-number.json
3.141592653589793
2
3
Proposal
The proposed solution is to write another lexical transformation:
// Substitute DOT NUMBER{4.3} by DOT NUMBER{4} DOT NUMBER{3}
function NumberToDotsTransformer(tokens) {
/* ... fill the code ... */
return tokens;
}
2
3
4
5
# Array Literals
Let us study now the support for Array Literals. The involved rules are:
expression -> ...
| bracketExp optProperties
bracketExp -> "[" commaExp "]"
optProperties -> null
| properties
2
3
4
5
6
TIP
The idea is that the transformer associated to the bracketExp
rule builds an apply node like
APPLY(operator:(WORD{name:array}, args: commaexp)
where commaexp
is the AST forest associated with the appearance of commaExp
in the production bracketExp -> "[" commaExp "]"
.
# Object Literals
The production rules for object literals are:
expression -> ...
| curlyExp optProperties
curlyExp -> "{" commaExp "}"
optProperties -> null
| properties
2
3
4
5
6
TIP
As for array literals, the idea is that the transformer associated to the curlyExp
rule builds an apply node like
APPLY(operator:(WORD{name:object}, args: commaexp)
# The Begin End Something Language Design Pattern
The solution we have used to solve the two previous sections Array Literals and Object Literals follows a pattern I will call the Begin-End-Something Pattern:
The "Begin End Something" Language Design Pattern
- Add a couple of tokens to the language to signal the beginning and the end of the new specialized category of expressions: for instance add
[
to begin array literals and]
to end array literals- Introduce the new tokens in the lexer (be carefull with conflicts, specially with "expansive" tokens. Don't trample on existing "reserved words")
- Modify the grammar adding the new rule(s) for the new kind of expression
- Build an AST for the the new category by adding a function
buildCategory
to yourbuild-ast.js
library.- The function
buildCategory
returns in fact a specialized case of an already existent kind of AST - Remember to export the new function and import the new function in your grammar file
- The function
Following these instructions it is trivial to extend Egg with a family of constructs as
(
...)
as a synonym ofdo( ...)
: See an example in the branchdoendo
of the solution repo➜ egg-oop-parser-solution git:(doendo) ✗ cat examples/do-endo.egg ( def(a,4), print(a) ) ➜ egg-oop-parser-solution git:(doendo) ✗ bin/egg examples/do-endo 4
1
2
3
4
5
6
7loop
...end loop
orWhile
...end While
as a synonym ofwhile(...)
. Do not usewhile
...end while
for the delimiter tokens or you will trample with the already existing wordwhile
etc.
# Error Management
The errors produced by Nearley.JS are quite verbose:
➜ egg-oop-parser-solution git:(b2bc2de) cat test/errors/unexpected-token.egg
+{2,3}
2
➜ egg-oop-parser-solution git:(b2bc2de) bin/eggc.js test/errors/unexpected-token.egg
There was an error: Error near "{" in line 1
Unexpected LCB token: "{". Instead, I was expecting to see one of the following:
A "(" based on:
parenExp → ● "(" commaExp ")"
calls → ● parenExp applies
applies → ● calls
expression → %WORD ● applies
program → ● expression %EOF
A "[" based on:
bracketExp → ● "[" commaExp "]"
properties → ● bracketExp applies
applies → ● properties
expression → %WORD ● applies
program → ● expression %EOF
A "." based on:
selector → ● "." %WORD
properties → ● selector applies
applies → ● properties
expression → %WORD ● applies
program → ● expression %EOF
A "." based on:
selector → ● "." %NUMBER
properties → ● selector applies
applies → ● properties
expression → %WORD ● applies
program → ● expression %EOF
A EOF token based on:
program → expression ● %EOF
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
In version 2.20.1
of Nearley, the Error object has an attribute token
than can be used to simplify the error message.
In the example below we make use of a RegExp to traverse the message
attribute of the error and add to the message the expected tokens. In Nearley JS error message you can see many repetitions of the A "<something>" based on:
pattern that for named tokens changes to A <something> token based on:
function parseFromFile(origin) {
try {
const parser = new nearley.Parser(nearley.Grammar.fromCompiled(grammar));
const source = fs.readFileSync(origin, 'utf8');
parser.feed(source);
let results = parser.results;
if (results.length > 1) throw new Error(`Language Design Error: Ambiguous Grammar! Generated ${results.length}) ASTs`);
if (results.length == 0) {
console.error("Unexpected end of Input error. Incomplete Egg program. Expected more input");
process.exit(1);
}
const ast = results[0];
return ast;
}
catch(e) {
let token = e.token;
let message = e.message;
let expected = message.match(/(?<=A ).*(?= based on:)/g).map(s => s.replace(/\s+token/i,''));
let newMessage = `Unexpected ${token.type} token "${token.value}" `+
`at line ${token.line} col ${token.col}.`;
if (expected && expected.length) newMessage += ` Tokens expected: ${[...new Set(expected)]}`;
throw new Error(newMessage)
}
}
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
When executed with an erroneous input the message is simplified to:
➜ egg-oop-parser-solution git:(master) ✗ bin/eggc.js test/errors/unexpected-token.egg
Unexpected LCB token "{" at line 1 col 2. Tokens expected: "(","[",".",EOF
2
Another related idea with error management is to introduce in your Grammar production rules for specific error situations with an associated semantic action that deals with the error. For instance, the rule at line 8 expression -> %EOF
is added to control when in the middle of the parsing an unexpected end of file occurs:
expression ->
%STRING optProperties {% buildStringValue %}
| %NUMBER optProperties {% buildNumberValue %}
| bracketExp optProperties {% buildArray %}
| curlyExp optProperties {% buildObject %}
| "(" commaExp ")" {% buildDo %}
| %WORD applies {% buildWordApplies %}
| %EOF {% dealWithError %}
2
3
4
5
6
7
8
➜ egg-oop-parser-solution git:(master) ✗ bin/eggc.js test/errors/unexpected-eof.egg
Unexpected EOF token near line 1, col 4. Found EOF
2
# Resources
- GitHub Actions. The Secrets context
- Installing Private Packages in a GitHub Action
- latest-version-cli (opens new window) Get the latest version of an npm package
- Egg Virtual Machine with OOP extensions for Windows/Linux/Mac OS (opens new window)
- Eloquent JS: Chapter 11. Project: A Programming Language (opens new window)
- Vídeo Programando un bucle REPL para el lenguaje Egg (opens new window)
- El módulo @ull-esit-pl/example2test (opens new window)
Grading Rubric#
#Labs
- Task GitHub-AluXXXX Form
- Lab GitHub Campus Expert
- Lab GitHub Project Board
- Lab GitPod and Visual Studio Code
- Lab IAAS
- Lab Espree Logging
- Lab Hello Compilers
- Lab Constant Folding
- Lab ast-types
- Lab egg-parser
- Lab Lexer Generator
- Lab The Egg Interpreter
- Lab Adding OOP to the Egg Parser
- Lab Extending the Egg Interpreter
- Lab TFA: Final Project PL
- Task Training Exam for PL