Graphics Core Next 1 The GCN 1 microarchitecture was used in several
Radeon HD 7000 series graphics cards. • support for 64-bit addressing (
x86-64 address space) with unified address space for CPU and GPU • GPU sends
interrupt requests to CPU on various events (such as
page faults) • support for Partially Resident Textures, which enable virtual memory support through
DirectX and
OpenGL extensions •
AMD PowerTune support, which dynamically adjusts performance to stay within a specific TDP • support for
Mantle (API),
Vulkan and OpenGL 4.6 There are Asynchronous Compute Engines controlling computation and dispatching.
ZeroCore Power ZeroCore Power is a long idle power saving technology, shutting off functional units of the GPU when not in use. AMD ZeroCore Power technology supplements
AMD PowerTune.
Chips Discrete GPUs (Southern Islands family): • Hainan • Oland • Cape Verde • Pitcairn • Tahiti
Graphics Core Next 2 "Bonaire" The 2nd generation of GCN was introduced with the
Radeon HD 7790 and is also found in the
Radeon HD 8770,
R7 260/260X, R9 290/290X, R9 295X2,
R7 360, and R9 390/390X, as well as
Steamroller-based
desktop "Kaveri" APUs and
mobile "Kaveri" APUs and in the
Puma-based
"Beema" and "Mullins" APUs. It has multiple advantages over the original GCN, including
FreeSync support,
AMD TrueAudio and a revised version of
AMD PowerTune technology. GCN 2nd generation introduced an entity called "Shader Engine" (SE). A Shader Engine comprises one geometry processor, up to 44 CUs (Hawaii chip), rasterizers,
ROPs, and L1 cache. Not part of a Shader Engine is the Graphics Command Processor, the 8 ACEs, the L2 cache and memory controllers as well as the audio and video accelerators, the display controllers, the 2
DMA controllers and the
PCIe interface. The
A10-7850K "Kaveri" contains 8 CUs (compute units) and 8 Asynchronous Compute Engines for independent scheduling and work item dispatching. At AMD Developer Summit (APU) in November 2013 Michael Mantor presented the
Radeon R9 290X.
Chips Discrete GPUs (Sea Islands family): • Bonaire • Hawaii integrated into APUs: • Temash • Kabini • Liverpool (i.e. the APU found in the PlayStation 4) • Durango (i.e. the APU found in the Xbox One and Xbox One S) • Kaveri • Godavari • Mullins • Beema • Carrizo-L
Graphics Core Next 3 GCN 3rd generation was introduced in 2014 with the
Radeon R9 285 and R9 M295X, which have the "Tonga" GPU. It features improved tessellation performance, lossless delta color compression to reduce memory bandwidth usage, an updated and more efficient instruction set, a new high quality scaler for video, HEVC encoding (VCE 3.0) and HEVC decoding (UVD 6.0), and a new multimedia engine (video encoder/decoder). Delta color compression is supported in Mesa. However, its double precision performance is worse compared to previous generation.
Chips discrete GPUs: • Tonga (Volcanic Islands family), comes with
UVD 5.0 (Unified Video Decoder) • Fiji (Pirate Islands family), comes with UVD 6.0 and
High Bandwidth Memory (HBM 1) integrated into APUs: • Carrizo, comes with UVD 6.0 • Bristol Ridge • Stoney Ridge But Polaris feature a newer Display Controller engine, UVD version 6.3, etc. All Polaris-based chips other than the Polaris 30 are produced on the
14 nm FinFET process, developed by
Samsung Electronics and licensed to
GlobalFoundries. The slightly newer refreshed Polaris 30 is built on the
12 nm LP FinFET process node, developed by Samsung and GlobalFoundries. The fourth generation GCN instruction set architecture is compatible with the third generation. It is an optimization for 14 nm FinFET process enabling higher GPU clock speeds than with the 3rd GCN generation. Architectural improvements include new hardware schedulers, a new primitive discard accelerator, a new display controller, and an updated UVD that can decode HEVC at 4K resolutions at 60 frames per second with 10 bits per color channel.
Chips discrete GPUs: • Polaris 10 (also codenamed
Ellesmere) found on "Radeon RX 470" and "Radeon RX 480"-branded graphics cards • Polaris 11 (also codenamed
Baffin) found on "Radeon RX 460"-branded graphics cards (also Radeon RX 560
D) • Polaris 12 (also codenamed Lexa) found on "Radeon RX 550" and "Radeon RX 540"-branded graphics cards • Polaris 20, which is a refreshed (
14 nm LPP
Samsung/
GloFo FinFET process) Polaris 10 with higher clocks, used for "Radeon RX 570" and "Radeon RX 580"-branded graphics cards • Polaris 21, which is a refreshed (14 nm LPP Samsung/GloFo FinFET process) Polaris 11, used for "Radeon RX 560"-branded graphics cards • Polaris 22, found on "Radeon RX Vega M GH" and "Radeon RX Vega M GL"-branded graphics cards (as part of
Kaby Lake-G) • Polaris 23, which is a refreshed (14 nm LPP Samsung/GloFo FinFET process) Polaris 12, used for "Radeon Pro WX 3200" and "Radeon RX 540X"-branded graphics cards (also Radeon RX 640) • Polaris 30, which is a refreshed (
12 nm LP GloFo FinFET process) Polaris 20 with higher clocks, used for "Radeon RX 590"-branded graphics cards In addition to dedicated GPUs, Polaris is utilized in the APUs of the PlayStation 4 Pro and Xbox One X, titled "Neo" and "Scorpio", respectively.
Precision Performance FP64 performance of all GCN 4th generation GPUs is 1/16 of FP32 performance.
Graphics Core Next 5 AMD began releasing details of their next generation of GCN Architecture, termed the 'Next-Generation Compute Unit', in January 2017. The new design was expected to increase
instructions per clock, higher
clock speeds, support for
HBM2, a larger memory
address space. The discrete graphics chipsets also include "HBCC (High Bandwidth Cache Controller)", but not when integrated into APUs. Additionally, the new chips were expected to include improvements in the
Rasterisation and
Render output units. The
stream processors are heavily modified from the previous generations to support packed math Rapid Pack Math technology for 8-bit, 16-bit, and 32-bit numbers. With this there is a significant performance advantage when lower precision is acceptable (for example: processing two
half-precision numbers at the same rate as a single
single precision number). Nvidia introduced tile-based rasterization and binning with
Maxwell, and this was a big reason for Maxwell's efficiency increase. In January,
AnandTech assumed that Vega would finally catch up with Nvidia regarding energy efficiency optimizations due to the new "DSBR (Draw Stream Binning Rasterizer)" to be introduced with Vega. It also added support for a new
shader stage – Primitive Shaders. Primitive shaders provide more flexible geometry processing and replace the
vertex and
geometry shaders in a rendering pipeline. As of December 2018, the Primitive shaders can't be used because required API changes are yet to be done. Vega 10 and Vega 12 use the
14 nm FinFET process, developed by
Samsung Electronics and licensed to
GlobalFoundries. Vega 20 uses the
7 nm FinFET process developed by
TSMC.
Chips discrete GPUs: • Vega 10 (
14 nm Samsung/
GloFo FinFET process) (also codenamed
Greenland) found on "Radeon RX Vega 64", "Radeon RX Vega 56", "Radeon Vega Frontier Edition", "Radeon Pro V340", Radeon Pro WX 9100, and Radeon Pro WX 8200 graphics cards • Vega 12 (14 nm Samsung/GloFo FinFET process) found on "Radeon Pro Vega 20" and "Radeon Pro Vega 16"-branded mobile graphics cards • Vega 20 (
7 nm TSMC FinFET process) found on "Radeon Instinct MI50" and "Radeon Instinct MI60"-branded accelerator cards, "Radeon Pro Vega II", and "Radeon VII"-branded graphics cards. integrated into APUs: • Raven Ridge came with VCN 1 which supersedes VCE and UVD and allows full fixed-function VP9 decode. • Picasso • Renoir • Cezanne
Precision performance Double-precision floating-point (FP64) performance of all GCN 5th generation GPUs, except for Vega 20, is one-sixteenth of FP32 performance. For Vega 20 with Radeon Instinct this is half of FP32 performance. For Vega 20 with Radeon VII this is a quarter of FP32 performance. All GCN 5th generation GPUs support
half-precision floating-point (FP16) calculations which is twice of FP32 performance. == Comparison of GCN GPUs ==