The
simple Sethi–Ullman algorithm works as follows (for a
load/store architecture): • Traverse the
abstract syntax tree in pre- or postorder • For every leaf node, if it is a non-constant left-child, assign a 1 (i.e. 1 register is needed to hold the variable/field/etc.), otherwise assign a 0 (it is a non-constant right child or constant leaf node (RHS of an operation – literals, values)). • For every non-leaf node, if the left and right subtrees respectively need different numbers of registers
l and
r, then assign max(
l,
r), otherwise assign
r + 1. • To emit code, if the subtrees need different numbers of registers, evaluate the subtree needing the most registers first (since the register needed to save the result of one subtree may make the other one
spill), otherwise the order is irrelevant.
Example For an arithmetic expression a = (b + c + f * g)*(d+3), the
abstract syntax tree looks like this: = / \ a * / \ / \ + + / \ / \ / \ d 3 + * / \ / \ b c f g To continue with the algorithm, we need only to examine the arithmetic expression (b + c + f * g) * (d + 3), i.e. we only have to look at the right subtree of the assignment '=': * / \ / \ + + / \ / \ / \ d 3 + * / \ / \ b c f g Now we start traversing the tree (in preorder for now), assigning the number of registers needed to evaluate each subtree (note that the last summand in the expression (b + c + f * g) * (d + 3) is a constant): *
2 / \ / \ +
2 +
1 / \ / \ / \ d
1 3
0 +
1 *
1 / \ / \ b
1 c
0f
1 g
0 From this tree it can be seen that we need 2 registers to compute the left subtree of the '*', but only 1 register to compute the right subtree. Nodes 'c' and 'g' do not need registers for the following reasons: If T is a tree leaf, then the number of registers to evaluate T is either 1 or 0 depending whether T is a left or a right subtree (since an operation such as add R1, A can handle the right component A directly without storing it into a register). Therefore we shall start to emit code for the left subtree first, because we might run into the situation that we only have 2 registers left to compute the whole expression. If we now computed the right subtree first (which needs only 1 register), we would then need a register to hold the result of the right subtree while computing the left subtree (which would still need 2 registers), therefore needing 3 registers concurrently. Computing the left subtree first needs 2 registers, but the result can be stored in 1, and since the right subtree needs only 1 register to compute, the evaluation of the expression can do with only 2 registers left. ==Advanced Sethi–Ullman algorithm==