ATI had held the lead for a while with the
Radeon 8500 but
Nvidia retook the performance crown with the launch of the
GeForce 4 Ti line. A new high-end refresh part, the 8500XT (R250) was supposedly in the works, ready to compete against NVIDIA's high-end offerings, particularly the top line Ti 4600. Pre-release information listed a 300
MHz core and RAM clock speed for the
R250 chip. ATI, perhaps mindful of what had happened to
3dfx when they took focus off their
Rampage processor, abandoned it in favor of finishing off their next-generation R300 card. This proved to be a wise move, as it enabled ATI to take the lead in development for the first time instead of trailing NVIDIA. The R300, with its next-generation architecture giving it unprecedented features and performance, would have been superior to any R250 refresh. The R3xx chip was designed by ATI's West Coast team (formerly
ArtX Inc.), and the first product to use it was the Radeon 9700 PRO (internal ATI code name: R300; internal ArtX codename: Khan), launched in August 2002. The architecture of R300 was quite different from its predecessor, Radeon 8500 (
R200), in nearly every way. The core of 9700 PRO was manufactured on a 150 nm
chip fabrication process, similar to the Radeon 8500. However, refined design and manufacturing techniques enabled a doubling of transistor count
and a significant clock speed gain. One major change with the manufacturing of the core was the use of the
flip-chip packaging, a technology not used previously on
video cards. Flip chip packaging allows far better cooling of the die by flipping it and exposing it directly to the
cooling solution. ATI thus could achieve higher clock speeds. Radeon 9700 PRO was launched clocked at 325 MHz, ahead of the originally projected 300 MHz. With a transistor count of 110 million, it was the largest and most complex GPU of the time. A slower chip, the 9700, was launched a few months later, differing only by lower core and memory speeds. Despite that, the Radeon 9700 PRO was clocked significantly higher than the
Matrox Parhelia 512, a card released but months before R300 and considered to be the pinnacle of graphics chip manufacturing (with 80 million
transistors at 220 MHz), up until R300's arrival.
Architecture The chip adopted an architecture consisting of 8 pixel pipelines, each with 1
texture mapping unit (an 8x1 design). While this differed from the older chips using 2 (or 3 for the original Radeon) texture units per pipeline, this did not mean
R300 could not perform multi-texturing as efficiently as older chips. Its texture units could perform a new
loopback operation which allowed them to sample up to 16 textures per geometry pass. The textures can be any combination of one, two, or three dimensions with
bilinear,
trilinear, or
anisotropic filtering. This was part of the new DirectX 9 specification, along with more flexible floating-point-based Shader Model 2.0+
pixel shaders and
vertex shaders. Equipped with 4 vertex shader units,
R300 possessed over twice the
geometry processing capability of the preceding Radeon 8500 and the
GeForce4 Ti 4600, in addition to the greater feature-set offered compared to DirectX 8 shaders. ATI demonstrated part of what was capable with pixel shader PS2.0 with their
Rendering with Natural Light demo. The demo was a real-time implementation of noted 3D graphics researcher
Paul Debevec's paper on the topic of
high dynamic range rendering. A noteworthy limitation is that all R300-generation chips were designed for a maximum
floating point precision of 96-bit, or
FP24, instead of DirectX 9's maximum of 128-bit
FP32. DirectX 9.0 specified FP24 as a minimum level for conforming to the specification for full precision. This trade-off in precision offered the best combination of transistor usage and image quality for the manufacturing process at the time. It did cause a usually visibly imperceptible loss of quality when doing heavy blending. ATI's Radeon chips did not go above FP24 until
R520. The R300 was the first board to truly take advantage of a 256-bit memory bus.
Matrox had released their Parhelia 512 several months earlier, but this board did not show great gains with its 256-bit bus. ATI, however, had not only doubled their bus to 256-bit, but also integrated an advanced crossbar memory controller, somewhat similar to
NVIDIA's memory technology. Utilizing four individual load-balanced 64-bit memory controllers, ATI's memory implementation was quite capable of achieving high bandwidth efficiency by maintaining adequate granularity of memory transactions and thus working around memory latency limitations. "R300" was also given the latest refinement of ATI's innovative
HyperZ memory bandwidth and fillrate saving technology,
HyperZ III. The demands of the 8x1 architecture required more bandwidth than the 128-bit bus designs of the previous generation due to having double the texture and pixel fillrate. Radeon 9700 introduced ATI's multi-sample
gamma-corrected anti-aliasing scheme. The chip offered sparse-sampling in modes including 2×, 4×, and 6×. Multi-sampling offered vastly superior performance over the
supersampling method on older Radeons, and superior image quality compared to NVIDIA's offerings at the time. Anti-aliasing was, for the first time, a fully usable option even in the newest and most demanding titles of the day. The R300 also offered advanced anisotropic filtering which incurred a much smaller performance hit than the anisotropic solution of the GeForce4 and other competitors' cards, while offering significantly improved quality over Radeon 8500's anisotropic filtering implementation which was highly angle dependent. On March 14, 2008, AMD released the 3D Register Reference for R3xx.
Performance Radeon 9700's architecture was very efficient and much more advanced compared to its peers of 2002. Under normal conditions, the Radeon 9700 Pro outperforms the
GeForce4 Ti 4600, the previous top-end card, by 4-101%. and up to 278%, when anti-aliasing (AA) and/or anisotropic filtering (AF) was enabled. At the time, this was quite special, and resulted in the widespread acceptance of AA and AF as truly usable features. Besides advanced architecture, reviewers also took note of ATI's change in strategy. The 9700 would be the second of ATI's chips (after the 8500) to be shipped to third-party manufacturers instead of ATI producing all of its graphics cards, though ATI would still produce cards off of its highest-end chips. This freed up engineering resources that were channeled towards
driver improvements, and the 9700 performed phenomenally well at launch because of this.
id Software technical director
John Carmack had the Radeon 9700 run the
E3 Doom 3 demonstration. The performance and quality increases offered by the R300 GPU are considered to be one of the greatest in the history of 3D graphics, alongside the achievements
GeForce 256 and
Voodoo Graphics. Furthermore, NVIDIA's response in the form of the
GeForce FX 5800 was both late to market and somewhat unimpressive, especially when pixel shading was used. R300 would become one of the GPUs with the longest useful lifetime in history, allowing playable performance in new games at least 3 years after its launch. ==Further releases==