PA3: Code Generator

PA3: Code Generator Due Monday, 14 April 2025, 11:59PM AoE.

You may complete this assignment in any language listed on the languages page in buckets 1, 2, or 3. You are required to complete this assignment and PA4 in the same language (you are expected to modify you PA3 assignment for PA4).

You may work in a team of two people for this assignment. If you worked with a teammate on PA2, you do not need to work with the same teammate for this assignment. You do not need to keep the same teammate between this assignment and PA4, but if you choose not to work with your PA3 teammate again on PA4, you must work alone on PA4. The course staff are not responsible for finding you a willing teammate. You are always permitted to work alone, if you choose.

Goal

For this assignment you will write a code generator. Among other things, this involves implementing the operational semantics specification of Cool. You will track enough information to generate legitimate run-time errors (e.g., dispatch on void). You do not have to worry about “malformed input” because the semantic analyzer (from PA2) has already ruled out bad programs. You will also write additional code to unserialize the class map, implementation map, parent map, and annotated AST produced by the semantic analyzer.

Summary of Checkpoints

You need to make four submissions for this assignment:

PA3c1 requires you to write test cases (i.e., Cool programs) that find injected bugs in our reference compiler. This checkpoint ensures that you have a robust test suite for your own compiler before you begin implementing it in earnest. This checkpoint is relatively straightforward (you just need to write small Cool programs), so it due early: on Friday, March 7.
PA3c2 requires you to produce a simple intermediate representation for (some) Cool programs (“three-address code” (TAC), which we’ll discuss in class). While it’s up to you how much you want to use TAC for the rest of PA3, we think it’s a useful intermediate representation, and we want to force you to make some progress early. And, it’s hard to break the compiler up into sensible chunks, so this is the best that we can do. Since producing TAC is pretty straightforward compared to the rest of the compiler implementation, this checkpoint is due relatively quickly after PA3c1: on Monday, March 17.
PA3c3 requires that your compiler generates assembly instructions for certain simple Cool programs. This is harder than it sounds: once you have this part working, getting to the full compiler is usually easier than getting to this point. For that reason, we give you a few weeks to make it to this point: it is due on Wednesday, April 2.
PA3 (full) requires that your compiler handle all of Cool. It is due on Monday, April 14.

Specification

You will eventually create three artifacts:

A program that takes a single command-line argument (e.g., file.cl-type). That argument will be an ASCII text Cool class map, implementation map, parent map and annotated AST file (as described in PA2). Your program must emit x86-64 Assembly Language (file.s). Your program will consist of a number of source files in a single language.
- Compiling file.s with gcc -no-pie -static on a 64-bit x86 Linux system (specifically, Ubuntu 22.04) must produce an executable that, when run, produces the correct output for file.cl according to Cool’s operational semantics.
- You will only be given .cl-type files from programs that pass the semantic analysis phase of the reference compiler. You are not responsible for correctly handling (1+"hello") programs.
A plain ASCII text file called readme.txt describing your design decisions and choice of test cases. See the grading rubric. A few paragraphs should suffice.
Testcases test1.cl, test2.cl, test3.cl and test4.cl. The testcases should exercise compiler and run-time error corner cases. See the description of PA3c1, below.

Grading

Out of 100 points:

60 points - for autograder tests
- Each missed test removes points, to a minimum of 0/70, even if there are more or fewer tests than total points.
4 points - for a correct PA3c1 submission. This score will be pro-rated based on your autograder score (e.g., if you discover 12/15 = 80% of the bugs, you will receive 80% of 4 or 3.2 points).
4 points - for your PA3c2 submission. This score will be pro-rated based on your autograder score.
8 points - for your PA3c3 submission. This score will be pro-rated based on your autograder score.
8 points - for a clear description in your README
- 8 - thorough discussion of design decisions (e.g., the handling of let and new and dispatch) and choice of test cases; a few paragraphs of coherent English sentences should be fine
- 4 - vague or hard to understand; omits important details
- 0 - little to no effort, or submitted an RTF/DOC/PDF file instead of plain TXT
8 points - for valid test1.cl, test2.cl, test3.cl and test4.cl files
- 8 - wide range of test cases added, stressing most Cool features and some error conditions
- 4 - added some tests, but the scope not sufficiently broad
- 0 - little to no effort
8 points - for code cleanliness
- 8 - code is mostly clean and well-commented
- 4 - code is sloppy and/or poorly commented in places
- 0 - little to no effort to organize and document code

PA3c1: Testing the Reference Compiler

PA3c1 is a preliminary testing exercise that introduces a form of test-driven development or mutation testing into our software development process and requires you to construct a high-quality test suite.

The goal of PA3c1 is to leave you with a high-quality test suite of Cool programs that you can use to evaluate your own PA3 and PA4 code generators. Writing a code generator requires you to consider many corner cases when reading the formal and informal semantics in the Cool Reference Manual. While you you can check for correct “positive” behavior by executing your code generator’s output against the reference compiler on existing “good” Cool programs, it is comparatively harder to check for “negative” behavior (i.e., run-time errors, strange corner cases).

If you fail to construct a rich test suite of semantically-valid programs you will face a frustrating series of “you fail held-out negative test x” reports for PA3c3 and PA3 proper, which can turn into unproductive guessing games. Because students often report that this is frustrating (even though it is, shall we say, infinitely more realistic than making all of the post-deployment tests visible in advance), the PA3c1 preliminary testing exercise provides a structured means to help you get started with the construction of a rich test suite.

The course staff have produced 21 variants of the reference compiler, each with a secret intentionally-introduced defect related to code generation. A high-quality test suite is one that reveals each introduced defect by showing a difference between the behavior of the true reference compiler and the corresponding buggy version. You desire a high-quality test suite to help you gain confidence in your own PA3 (and, eventually, PA4) submission.

For PA3c1, you must produce semantically-valid Cool programs (test cases). There are 21 separate held-out seeded code generator bugs waiting on the grading server. For each bug, if one of your tests causes the reference and the buggy version to produce difference output, you win: that test has revealed that bug. For full credit your tests must reveal at least 15 of the 21 unknown defects.

The secret defects that we have injected into the reference compiler correspond to common defects made by students in PA3. Thus, if you make a rich test suite for PA3c1 that reveals many defects, you can use it on your own PA3 submission to reveal and fix your own bugs!

For PA3c1 you should turn in (electronically):

A set of .cl files: Cool code generator testcases.
- Each testcase you submit must be syntactically semantically valid (i.e., must pass cool --type).
- Each testcase you submit may have a corresponding input file. For example, if you submit wes.cl you may also submit wes.cl-input (if you do, you must follow this naming convention or it will be ignored).
- Each testcase you submit must be at most 2048 characters (i.e., wc -c yourtest.cl says 2048 or less). You want each of your testcases to be meaningful so that it helps you narrow down a bug later (cf. Delta Debugging).
  - You, A Mischevious Student: Ha ha! I will get around this edict by removing comments and compressing whitespace and variable names.
  - Me, A Former Mischevious Student: Remember, the point of PA3c1 is not to make me happy, but to provide you with a high quality test suite. You’re only hurting yourself later by making low-quality tests now. When your compiler fails one of your own tests later you’ll end up shrinking the test down as part of debugging, so you’re not actually saving yourself any time.
- No testcase should be named bug… or ref… because the testing server uses those prefices internally. If you submit a test case with such a name it will be ignored. (Limited course resources can either be spent making better lectures, grading your assignments, etc., or ironing out wrinkles such as these from grading scripts. We have chosen to focus on pedagogy.)

Hint: because you can find “positive” bugs in your code generator more easily (e.g., by running your code generator on the known-good Cool programs from cool-examples.zip), this exercise is somewhat biased toward “negative” or “tricky” bugs.

PA3c2: Three-Address Code Generator

For this checkpoint you will write a program that converts the abtract syntax tree for a method into three-address code. While your full compiler is not required to use three-address code as one of its intermediate representations, we strongly encourage you to do so. The first checkpoint for PA4 will also use the cl-tac three-address code format, so it behooves you to maintain your implementation.

Your program will take a .cl-type program (like those you generated for PA2) as input, and then generate a .cl-tac program in the format described below.

Three-Address Code Example

Three-address code is a simplified representation focusing on assignments to variables and unary or binary operators.

The code below computes (1+3)*7 prints out the result, 28:

a <- int 1
b <- int 3
c <- + a b 
d <- int 7
e <- * c d 
retval <- call out_int e 
return retval 

In three address code the expression (1+3)*7 must be broken down so that the addition and multiplication operations are each handled separately. (The exact format is described below.)

As a second example, the code below reads in an integer and computes its absolute value (by checking to see if it is negative and subtracting it from zero if so):

x <- call in_int
z <- int 0
b <- < x z 
bt b is_negative
output <- x
jmp do_the_printing
label is_negative
output <- - z x 
label do_the_printing
retval <- call out_int output 
return retval

Note the use of labels, conditional branches and unconditional jumps. You can use the cool reference compiler to evaluate .cl-tac files:

$ echo -87 | ./cool absval.cl-tac
87

Expressions to Three-Address Code

Three-address code is a simplified representation focusing on assignments to variables and unary or binary operators.

The traditional approach to converting expressions to three-address code involves a recursive descent traversal of the abstract syntax tree. The recursive descent traversal returns both a three-address code instruction as well as a list of additional instructions that should be prepended to the output.

Consider the following pseudocode:

let rec convert (a : ast) : (tac_instr list * tac_expr) =
                match a with
                | AST_Variable(v) -> [], TAC_Variable(v)
                | AST_Int(i) -> 
                        let new_var = fresh_variable () in
                        [TAC_Assign_Int(new_var, i)], (TAC_Variable(new_var)
                | AST_Plus(a1,a2) ->
                        let i1, ta1 = convert a1 in 
                        let i2, ta2 = convert a2 in 
                        let new_var = fresh_variable () in
                        let to_output = TAC_Assign_Plus(new_var, ta1, ta2) in
                        (i1 @ i2 @ [to_output]), (TAC_Variable(new_var))
                | ... 

On the input (1+3)+5, this code first calls itself recursively on (1+3). To convert (1+3), it calls itself recursively on 1 and 3. To convert 1, it finds a fresh variable x, outputs x <- 1, and returns x. Similarly, to convert 3, it finds a fresh variable y, outputs y <- 3, and returns y. Now it can convert 1+3 by finding a fresh variable z and outputting z <- x + y, returning z. The final output would look like:

temp1 <- int 1
temp2 <- int 3
temp3 <- + temp1 temp2
temp4 <- int 5
temp5 <- + temp3 temp4

Control-Flow to Three-Address Code

The traditional approach to converting control-flow statements to three-address code involves a recursive descent traversal of the abstract syntax tree. The recursive descent traversal returns a list of three-address code instructions.

For example, an instruction of the form if COND then THEN_BRANCH else ELSE_BRANCH typically becomes:

... code to evaluate COND
bt COND then_label
... code to evaluate ELSE_BRANCH
jmp end_label
label then_label
... code to evaluate THEN_BRACH
label end_label

You must consider other control-flow instructions (e.g., while loops, short-circuit boolean evaluation, etc.) and similarly convert them.

`.cl-tac` Format

The format of .cl-tac files is designed to be easy to serialize and deserialize — you can process it in just a few lines, without needing special parsing tools.

A .cl-tac file is text-based and line-based. Your program must support Unix, Mac and Windows text files — regardless of where you, personally, developed your program. Note that lines can be delineated by carriage returns, new lines, or both (e.g., the regular expression [\r\n]+ may be helpful).

Possible line contents are (stand-ins for lexemes in italics):

Arithmetic

x <- + y z
x <- - y z
x <- * y z
x <- / y z

Comparisons

x <- < y z
x <- <= y z
x <- = y z

Constants:

x <- int integer
x <- bool boolean
x <- string\n string-on-next-line \n (Note that the string constant will appear on the next line. The string constant assignment is the only three-address form that takes two lines.)

Boolean Negation:

x <- not y

Arithmetic Negation:

x <- ~ y

Object Allocation:

x <- new type (Relevant types are Int, String, Bool and Object. See the Cool Reference Manual.)

Object Default Value:

x <- default type

Null Check:

x <- isvoid y

Function Calls:

x <- call out_int y
x <- call out_string y
x <- call in_int
x <- call in_string

Branches, Labels and Control Flow:

jmp label
label label
return x
comment …text until end of line…
bt x label (The bt instruction branches if the variable x is True (x must be a Bool); otherwise control falls through to the next instruction.)

Commentary

You can use cool --type to obtain a .cl-type file.

You can use cool --opt --cfg to obtain control-flow graph images that can be processed by GraphViz.

You can use cool --tac file.cl to produce file.cl-tac for the first method in that Cool file. Thus you can use the Cool reference compiler to produce three address code for you automatically, given Cool source code. You should do this to test your PA3c2 code.

The input program will introduce only one class, Main, with only one method, main. So, you can either match the reference compiler’s behavior (and print out the TAC for only the first method) or print out TAC for all methods - we will not be able to tell the difference on the grading server.

What to Turn In for PA3c2

You must turn in these files:

source_files — your implementation, including exactly one of the following files:
- main.c
- main.py
- main.cpp
- Main.java
- main.kt
- main.rs
- Main.scala
- main.ml
- main.hs

PA3c3: Code Generation For a Restricted Subset of Cool

For this checkpoint you will write a program that generates assembly instructions for certain simple Cool programs. This program should be the basis for your full compiler.

Your goal for this checkpoint is to write a correct code generator for a subset of Cool. Among other things, this involves implementing the operational semantics specification of Cool. You do not have to track information to generate run-time errors (e.g., division by zero, dispatch on void) for this checkpoint. You also do not have to worry about “malformed input” because the semantic analyzer (from PA2) will rule out bad programs.

You will also write additional code to unserialize the class map, implementation map, parent map, and annotated AST produced by the semantic analyzer.

No Error Reporting

For this checkpoint you do not have to generate assembly code that checks for or reports run-time errors. (You’ll do that for the full compiler.) You are guaranteed that the test cases on the grading server for PA3c3 do not trigger run-time errors in the reference compiler (your compiler, of course, might still produce incorrect code…). However, you can save yourself from subsequent regret by remembering that you will eventually have to do so (e.g., don’t throw away that line-number information!).

Restricted Subset of Cool

For this checkpoint, you only have to handle simple Cool programs that use a subset of the possible expressions. The input program will introduce only one class, Main, with only one method, main. The following AST or language elements will not appear:

attribute_*
self_dispatch — except IO (e.g., in_out, out_int)
static_dispatch
dynamic_dispatch
internal functions
while
isvoid
case
new
let_binding_init
any novel class/method — except “Main.main”
run-time errors (e.g., division by zero)
strings — programs will largely involve integers and booleans

This results in single-class, single-method Cool programs focusing on integer and boolean manipulation.

Commentary

I strongly recommend that you convert to a control-flow graph where the basic blocks are three-address code (e.g., using the TAC representation from the previous checkpoint…) and convert that three-address code into assembly.

You will have to handle certain internal functions (e.g., IO.out_string) that are defined in the CRM. You might want to look at the assembly code that the reference compiler emits for them.

Whitespace and newlines do not matter in your file.s assembly code. However, whitespace and newlines do matter for the output of your generated assembly! This is because you are specifically being asked to implement IO (and, later, substring) functions.

You should implement all of the relevant operational semantics rules in the Reference Manual. You will also have to implement all of the relevant built-in functions on the IO class.

What to Turn In for PA3c3

You must turn in these files:

source_files — your implementation, including exactly one of the following files:
- main.c
- main.py
- main.cpp
- Main.java
- main.kt
- main.rs
- Main.scala
- main.ml
- main.hs

PA3: The Full Compiler

For the full compiler, you need to implement a code generator that emits correct code for all valid Cool programs. Moreover, your generated assembly must abort gracefully if the input Cool program encounters a run-time error while executing: that is, your generated assembly may not segfault when we execute it.

Error Reporting

You are guaranteed that the file.cl-type input file will be correctly formed and will correspond to a well-typed Cool program. Thus, there will not be any direct static errors in the input. Those were all caught by the semantic analyzer earlier.

However, you must generate file.s so that it checks for and reports run-time errors. When your file.s program detects an error, it should use Cool’s built-ins (e.g., IO.out_string)via assembly instructions to cause an error string to be printed to the screen.

To report an error, write the string ERROR: line_number: Exception: message and terminate the program. You may generate your file.s so that it writes whatever you want in the message, but it should be fairly indicative. Example erroneous input:

class Main inherits IO {
  my_void_io : IO ; -- no initializer => void value
  main() : Object {
    my_void_io.out_string("Hello, world.\n")
  } ;
} ;

For such an input, you must generate a well-formed file.s assmebly language file. However, when that file is executed on an x86-64 machine, it will produce output such as:

ERROR: 4: Exception: dispatch on void

To put this another way, rather than actually checking for errors directly, you must generate assembly code that will later check for and report errors. You probably want to examine how the reference compiler does this to get an idea of what to do.

Line Numbers for Error Reporting

The typing rules do not directly specify the line numbers on which errors are to be reported. As of v1.40, the Cool reference compiler uses these guidelines:

Stack overflow: undefined (not your responsibility)
Division by zero: location of the division expression
Substring out of range: location of the internal expression (i.e., 0)
Dispatch on void: location of dispatch expression
case-related errors: location of case expression

Note that this list specifically excludes stack overflows. It is not your responsibility to handle them, and code you generate may segfault when it encounters a stack overflow. The reference compiler does this.

Note that the reference interpreter uses different line numbers in some cases; you must match the reference compiler (which you invoke by passing the --x86 argument to cool).

Commentary

You will have to handle all of the internal functions (e.g., IO.out_string) that are defined in the Cool Reference Manual.

You can do basic testing as follows:

cool --x86 file.cl-type
gcc -no-pie -static file.s
./a.out &> reference-output
my-code-generator file.cl-type
gcc -no-pie -static file.s
./a.out &> my-output
diff my-output reference-output

Whitespace and newlines do not matter in your file.s assembly code. However, whitespace and newlines do matter for the output of your assembly code. This is because you are specifically being asked to implement IO and string functions.

You should implement all of the operational semantics rules in the Reference Manual. You will also have to implement all of the built-in functions on the five Basic Classes.

What to Turn In for PA3 (full)

You must turn in these files:

source_files — your implementation, including exactly one of the following files:
- main.c
- main.py
- main.cpp
- Main.java
- main.kt
- main.rs
- Main.scala
- main.ml
- main.hs
readme.txt - your README file
test1.cl, test2.cl, test3.cl, and test4.cl - novel or otherwise interesting test cases for the code generator or any of its checkpoints. Describe why each is novel or interesting in your readme.txt.