The SA-110 was the first microprocessor in the StrongARM family. The first versions, operating at 100, 160, and 200 MHz, were announced on 5 February 1996. Samples of these versions were available at announcement, with volume production slated for December 1996. Throughout 1996, the SA-110 was the highest performing microprocessor for portable devices. Towards the end of 1996 it was a leading CPU for internet/intranet appliances and
thin client systems. The SA-110's first design win was the
Apple MessagePad 2000. It was also used in a number of products including the
Acorn Computers Risc PC and
Eidos Optima video editing system. The SA-110's lead designers were
Daniel W. Dobberpuhl, Gregory W. Hoeppner, Liam Madden, and Richard T. Witek.
Description The SA-110 had a simple
microarchitecture. It was a
scalar design that executed instructions
in-order with a five-stage
classic RISC pipeline. The microprocessor was partitioned into several blocks, the IBOX, EBOX, IMMU, DMMU, BIU, WB and PLL. The IBOX contained hardware that operated in the first two stages of the pipeline such as the
program counter. It fetched, decoded and issued instructions. Instruction fetch occurs during the first stage, decode and issue during the second. The IBOX decodes the more complex instructions in the ARM instruction set by translating them into sequences of simpler instructions. The IBOX also handled branch instructions. The SA-110 did not have
branch prediction hardware, but had mechanisms for their speedy processing. Execution starts at stage three. The hardware that operates during this stage is contained in the EBOX, which comprises the
register file,
arithmetic logic unit (ALU),
barrel shifter,
multiplier and condition code logic. The register file had three read ports and two write ports. The ALU and barrel shifter executed instructions in a single cycle. The multiplier is not pipelined and has a latency of multiple cycles. The IMMU and DMMU are
memory management units for instructions and data, respectively. Each MMU contained a 32-entry
fully associative translation lookaside buffer (TLB) that can map 4 KB, 64 KB or 1 MB
pages. The write buffer (WB) has eight 16-byte entries. It enables the pipelining of stores. The bus interface unit (BIU) provided the SA-110 with an external interface. The
PLL generates the internal
clock signal from an external 3.68 MHz clock signal. It was not designed by DEC, but was contracted to the Centre Suisse d'Electronique et de Microtechnique (CSEM) located in
Neuchâtel,
Switzerland. The instruction
cache and data cache each have a capacity of 16 KB and are 32-way
set-associative and virtually addressed. The SA-110 was designed to be used with slow (and therefore low-cost) memory and therefore the high set associativity allows a higher hit rate than competing designs, and the use of virtual addresses allows memory to be simultaneously cached and uncached. The caches are responsible for most of the transistor count and they take up half the die area. The SA-110 contained 2.5 million transistors and is 7.8 mm by 6.4 mm large (49.92 mm2). It was fabricated by DEC in its proprietary CMOS-6 process at its Fab 6
fab in Hudson, Massachusetts. CMOS-6 was DEC's sixth-generation
complementary metal–oxide–semiconductor (CMOS) process. CMOS-6 has a 0.35 μm feature size, a 0.25 μm effective channel length but for use with the SA-110, only three levels of
aluminium interconnect. It used a power supply with a variable voltage of 1.2 to 2.2
volts (V) to enable designs to find a balance between power consumption and performance (higher voltages enable higher clock rates). The SA-110 was packaged in a 144-pin
thin quad flat pack (TQFP). ==SA-1100==