AMD 10h

The AMD Family 10h, or K10, is a microprocessor microarchitecture by AMD based on the K8 microarchitecture. The first third-generation Opteron products for servers were launched on September 10, 2007, with the Phenom processors for desktops following and launching on November 11, 2007, as the immediate successors to the K8 series of processors.

Nomenclature

It appears that AMD has not used K-nomenclature (which originally stood for "Kryptonite" in the K5 processor) from the time after the use of the codename K8 for the AMD K8 or Athlon 64 processor family, since no K-nomenclature naming convention beyond K8 has appeared in official AMD documents and press releases after the beginning of 2005. The name "K8L" was first coined by Charlie Demerjian in 2005, at the time a writer at The Inquirer, and was used by the wider IT community as a convenient shorthand while according to AMD official documents, the processor family was termed "AMD Next Generation Processor Technology". The microarchitecture has also been referred to as Stars, as the codenames for desktop line of processors was named under stars or constellations (the initial Phenom models being codenamed Agena and Toliman). In a video interview, Giuseppe Amato confirmed that the codename is K10. It was revealed, by The Inquirer itself, that the codename "K8L" referred to a low-power version of the K8 family, later named Turion 64, and that K10 was the official codename for the microarchitecture. AMD refers to it as Family 10h Processors, as it is the successor of the Family 0Fh Processors (codename K8). 10h and 0Fh refer to the main result of the CPUID x86 processor instruction. In hexadecimal numbering, 0Fh (h represents hexadecimal numbering) equals the decimal number 15, and 10h equals decimal 16. (The "K10h" form that sometimes pops up is an improper hybrid of the "K" code and Family identifier number.) ==Schedule of launch and delivery==

Schedule of launch and delivery

Timeline Historical information In 2003, AMD outlined the features for upcoming generations of microprocessors after the K8 family of processors in various events and analyst meetings, including the Microprocessor Forum 2003. The outlined features to be deployed by the next-generation microprocessors are as follows: • Threaded architectures. • Chip level multiprocessing. • Huge scale MP (multi-processor) machines. • 10 GHz operation. • Much higher performance superscalar, out-of-order CPU core. • Huge caches. • Media/vector processing extensions. • Branch and memory hints. • Security and virtualization. • Enhanced Branch Predictors. • Static and dynamic power management. In June 2006, AMD executive vice president Henri Richard had an interview with DigiTimes commented on the upcoming processor developments: Live demonstrations On November 30, 2006, AMD live demonstrated the native quad core chip known as "Barcelona" for the first time in public, while running Windows Server 2003 64-bit Edition. AMD claims 70% scaling of performance in real world loads, and better performance than Intel Xeon 5355 processor codenamed Clovertown. On January 24, 2007, AMD Executive Vice President Randy Allen claimed that in live tests, in regard to a wide variety of workloads, "Barcelona" was able to demonstrate 40% performance advantage over the comparable Intel Xeon codenamed Clovertown dual-processor (2P) quad-core processors. The expected performance of floating point per core would be approximately 1.8 times that of the K8 family, at the same clock speed. On May 10, 2007, AMD held a private event demonstrating the upcoming processors codenamed Agena FX and chipsets, with one demonstrated system being AMD Quad FX platform with one Radeon HD 2900 XT graphics card on the upcoming RD790 chipset. The system was also demonstrated real-time converting a 720p video clip into another undisclosed format while all 8 cores were maxed at 100% by other tasks. Sister microarchitecture On the December 2006 analyst day, Executive vice president Marty Seyer announced a new mobile core codenamed Griffin launched in 2008 with inherited power optimizations technologies from the K10 microarchitecture, but based on a K8 design. TLB bug In November 2007, AMD stopped delivery of Barcelona processors after a bug in the translation lookaside buffer (TLB) of stepping B2 was discovered that could rarely lead to a race condition and thus a system lockup. A patch in BIOS or software worked around the bug by disabling cache for page tables, but it was connected to a 5 to 20% performance penalty. Kernel patches that would almost completely avoid this penalty were published for Linux. In April 2008, the new stepping B3 was brought to the market by AMD, including a fix for the bug plus other minor enhancements. ==Features==

Features

Fabrication technology AMD has introduced the microprocessors manufactured at 65 nm feature width using Silicon-on-insulator (SOI) technology, since the release of K10 coincides with the volume ramp of this manufacturing process. Supported DRAM standards The K8 family was known to be particularly sensitive to memory latency since its design gains performance by minimizing this through the use of an on-die memory controller (integrated into the CPU); increased latency in the external modules negates the usefulness of the feature. DDR2 RAM introduces some additional latency over DDR RAM since the DRAM is internally driven by a clock at one quarter of the external data frequency, as opposed to one half that of DDR. However, since the command clock rate in DDR2 is doubled relative to DDR and other latency-reducing features (e.g. additive latency) have been introduced, common comparisons based on CAS latency alone are not sufficient. For example, Socket AM2 processors are known to demonstrate similar performance using DDR2 SDRAM as Socket 939 processors that utilize DDR-400 SDRAM. K10 processors support DDR2 SDRAM rated up to DDR2-1066 (1066 MHz). While some desktop K10 processors are AM2+ supporting only DDR2, an AM3 K10 processor supports both DDR2 and DDR3. A few AM3 motherboards have both DDR2 and DDR3 slots (this does not mean that both types can be fitted at the same time), but for the most part they have only DDR3. Lynx desktop processors only support DDR3, as they use the FM1 socket. ==Microarchitecture characteristics==

Microarchitecture characteristics

Characteristics of the microarchitecture include the following: • Form factors • Socket AM2+ with DDR2 for the 65 nm Phenom and Athlon 7000 Series • Socket AM3 with either DDR2 or DDR3 for Semprons and the 45 nm Phenom II and Athlon II Series. They can also be used on AM3+ motherboards with DDR3. Note that, while all K10 Phenom Processors are backwards compatible with Socket AM2+ and Socket AM2, some 45 nm Phenom II Processors are only available for Socket AM2+. Lynx processors do not use either AM2+ nor AM3. • Socket FM1 with DDR3 for Lynx processors. • Socket F with DDR2, DDR3 with Shanghai and later Opteron processors • Instruction set additions and extensions • New bit-manipulation instructions ABM: Leading Zero Count (LZCNT) and Population Count (POPCNT) • New SSE instructions named as SSE4a: combined mask-shift instructions (EXTRQ/INSERTQ) and scalar streaming store instructions (MOVNTSD/MOVNTSS). These instructions are not found in Intel's SSE4 • Support for unaligned SSE load-operation instructions (which formerly required 16-byte alignment) • Execution pipeline enhancements • 128-bit wide SSE units • Wider L1 data cache interface allowing for two 128-bit loads per cycle (as opposed to two 64-bit loads per cycle with K8) • Lower integer divide latency • 512-entry indirect branch predictor and a larger return stack (size doubled from K8) and branch target buffer • Side-Band Stack Optimizer, dedicated to perform increment/decrement of register stack pointer • Fastpathed CALL and RET-Imm instructions (formerly microcoded) as well as MOVs from SIMD registers to general purpose registers • Integration of new technologies onto CPU die: • Four processor cores (Quad-core) • Split power planes for CPU core and memory controller/northbridge for more effective power management, first dubbed Dynamic Independent Core Engagement or D. I. C. E. by AMD and now known as Enhanced PowerNow! (also dubbed Independent Dynamic Core Technology), allowing the cores and northbridge (integrated memory controller) to scale power consumption up or down independently. • Shutting down portions of the circuits in core when not in load, named "CoolCore" Technology. • Improvements in the memory subsystem: • Improvements in access latency: • Support for re-ordering loads ahead of other loads and stores • More aggressive instruction prefetching, 32 bytes instruction prefetch as opposed to 16 bytes in K8 • DRAM prefetcher for buffering reads • Buffered burst writeback to RAM in order to reduce contention • Changes in memory hierarchy: • Prefetch directly into L1 cache as opposed to L2 cache with K8 family • 32-way set associative L3 victim cache sized at least 2 MB, shared between processing cores on a single die (each with 512 K of independent exclusive L2 cache), with a sharing-aware replacement policy. • Extensible L3 cache design, with 6 MB planned for 45 nm process node, with the chips codenamed Shanghai. • Changes in address space management: • Two 64-bit independent memory controllers, each with its own physical address space; this provides an opportunity to better utilize the available bandwidth in case of random memory accesses occurring in heavily multi-threaded environments. This approach is in contrast to the previous "interleaved" design, where the two 64-bit data channels were bounded to a single common address space. • Larger Tagged Lookaside Buffers; support for 1 GB page entries and a new 128-entry 2 MB page TLB • 48-bit memory addressing to allow for 256 TB memory subsystems • Memory mirroring (alternatively mapped DIMM addressing), data poisoning support and Enhanced RAS • AMD-V Nested Paging for improved MMU virtualization, claimed to have decreasing world switch time by 25%. • Improvements in system interconnect: • HyperTransport retry support • Support for HyperTransport 3.0, with HyperTransport Link unganging which creates 8 point-to-point links per socket. • Platform-level enhancements with additional functionality: • Five p-states allowing for automatic clock rate modulation • Increased clock gating • Official support for coprocessors via HTX slots and vacant CPU sockets through HyperTransport: Torrenza initiative. ==Feature tables==

Feature tables

CPUs APUs APU features table ==Desktop==

Desktop

Phenom models Agena (65 nm SOI, quad-core) • Four AMD K10 cores • L1 cache: 64 KB instruction and 64 KB data (data + instructions) per core • L2 cache: 512 KB per core, full-speed • L3 cache: 2 MB shared between all cores • Memory controller: dual channel DDR2-1066 MHz with unganging option • ISA extensions: ''MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, AMD64, Cool'n'Quiet, NX bit, AMD-V'' • Socket AM2+, HyperTransport with 1600 to 2000 MHz • Power consumption (TDP): 65, 95, 125 and 140 Watt • First release • November 19, 2007 (B2 Stepping) • March 27, 2008 (B3 Stepping) • Clock rate: 1800 to 2600 MHz • Models: Phenom X4 9100e - 9950 Toliman (65 nm SOI, tri-core) • Three AMD K10 cores • L1 cache: 64 KB instruction and 64 KB data cache per core • L2 cache: 512 KB per core, full-speed • L3 cache: 2 MB shared between all cores • Memory controller: dual channel DDR2-1066 MHz with unganging option • ISA extensions: ''MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, AMD64, Cool'n'Quiet, NX bit, AMD-V'' • Socket AM2+, HyperTransport with 1600 to 1800 MHz • Power consumption (TDP): 65 and 95 Watt • First release • March 27, 2008 (B2 Stepping) • April 23, 2008 (B3 Stepping) • Clock rate: 2100 to 2500 MHz • Models: Phenom X3 8250e - 8850 Phenom II models Thuban (45 nm SOI, hexa-core) • Six AMD K10 cores • L1 cache: 64 KB instructions and 64 KB data per core • L2 cache: 512 KB per core, full-speed • L3 cache: 6 MB shared between all cores. • Memory controller: dual channel DDR2-1066 MHz (AM2+), dual channel DDR3-1333 (AM3) with unganging option • ISA extensions: ''MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, AMD64, Cool'n'Quiet, NX bit, AMD-V'' • Socket AM2+, Socket AM3, HyperTransport with 1800 to 2000 MHz • Power consumption (TDP): 95 or 125 Watt • First release • 27 April 2010 (E0 Stepping) • Clock rate: 2.6 - 3.3 GHz; up to 3.7 GHz with Turbo Core • Models: Phenom II X6 1035T, 1045T, 1055T, 1065T, 1075T, 1090T and 1100T Zosma (45 nm SOI, quad-core) • Four AMD K10 cores harvested from Thuban with two cores disabled • Four AMD K10 cores • Models: Phenom II 42 TWKR Propus (45 nm SOI, quad-core) • Four AMD K10 cores harvested from Deneb with L3 cache disabled • ISA extensions: ''MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, Cool'n'Quiet, AMD-V'' • Models: Athlon X2 6500 - 7850 Regor/Deneb (45 nm SOI, dual-core) • Two AMD K10 cores. Some 5000 series processors are chip harvests from Propus or Deneb; All 5200 series chips are harvests, each has two cores disabled • L1 cache: 64 KB instructions and 64 KB data per core • L2 cache: 512 KB per core, full-speed • Memory controller: dual channel DDR2-1066 MHz (AM2+), dual channel DDR3-1333 (AM3) with unganging option • ISA extensions: ''MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, AMD64, Cool'n'Quiet, NX bit, AMD-V'' • Socket AM3, HyperTransport with 2000 MHz • Power consumption (TDP): 45 Watt or 95 Watt • First release • September 2009 (C2 Stepping) • Clock rate: 2200 - 3100 MHz • Models: Athlon II X4 600e - 650 Rana (45 nm SOI, tri-core) • Three AMD K10 cores chip harvested from Propus or Deneb with one core disabled • Power consumption (TDP): 45 Watts or 95 Watts • First release • October 2009 (Stepping C2) • Clock rate: 2.2–3.4 GHz • Models: Athlon II X3 400e - 460 Regor (45 nm SOI, dual-core) • Two AMD K10 cores • L1 cache: 64 KB instructions and 64 KB data per core • L2 cache: 1024 KB per core, full-speed • Memory controller: dual channel DDR2-1066 MHz (AM2+), dual channel DDR3-1333 (AM3) with unganging option • ISA extensions: ''MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, AMD64, Cool'n'Quiet, NX bit, AMD-V'' • Socket AM3, HyperTransport with 2000 MHz • Power consumption (TDP): 65 Watt • First release • June 2009 (C2 Stepping) • Clock rate: 1600 - 3600 MHz • Models: Athlon II X2 250u - 280 Sargas (45 nm SOI, single-core) • Single AMD K10 core harvest from Regor with one core disabled • AMD K10 cores with no L3 cache • GPU: TeraScale 2 • All A and E series models feature Redwood-class integrated graphics on die (BeaverCreek for the dual-core variants and WinterPark for the quad-core variants). Sempron and Athlon models exclude integrated graphics. • Support for up to four DIMMs of up to DDR3-1866 memory • 5 GT/s UMI • Integrated PCIe 2.0 controller • Select models support Turbo Core technology for faster CPU operation when the thermal specification permits • Select models support Hybrid Graphics technology to assist a discrete Radeon HD 6450, 6570, or 6670 discrete graphics card. This is similar to the current Hybrid CrossFireX technology available in the AMD 700 and 800 chipset series • ISA extensions: ''MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, Cool'n'Quiet, AMD-V'' • Models: Lynx desktop APUs and CPUs == Mobile ==

Mobile

Turion II (Ultra) models "Caspian" (45nm SOI, dual-core) • '''Tigris platform''' • Two AMD K10 cores • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR2 SDRAM (Up to 800 MHz) • Models: Turion II Ultra M600 to M660 Turion II models "Caspian" (45nm SOI, dual-core) • '''Tigris platform''' • Two AMD K10 cores • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR2 SDRAM (Up to 800 MHz) • Models: Turion II M500 TO M560 "Champlain" (45nm SOI, dual-core) • '''Danube platform''' • Two AMD K10 cores • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR3 SDRAM, DDR3L SDRAM (Up to 1333 MHz) • Models: Turion II models Athlon II models "Caspian" (45nm SOI, dual-core) • '''Tigris platform''' • Two AMD K10 cores • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR2 SDRAM (Up to 800 MHz) • Models: Athlon II M300 to M360 "Champlain" (45nm SOI, dual-core) • '''Danube platform''' • Two AMD K10 cores • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR3 SDRAM, DDR3L SDRAM (Up to 1333 MHz) • Models: Athlon II models Sempron models "Caspian" (45nm SOI, single-core) • '''Tigris platform''' • Single AMD K10 core • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR2 SDRAM (Up to 800 MHz) • Models: Sempron M100 to M140 Turion II Neo models "Geneva" (45nm SOI, dual-core) • '''Nile platform''' • Two AMD K10 cores • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR3 SDRAM, DDR3L SDRAM (Up to 1066 MHz) • Models: Turion II Neo models Athlon II Neo models "Geneva" (45nm SOI, dual-core) • '''Nile platform''' • Two AMD K10 cores • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR3 SDRAM, DDR3L SDRAM (Up to 1066 MHz) • Models: Athlon II Neo models "Geneva" (45nm SOI, single-core) • '''Nile platform''' • Single AMD K10 core • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR3 SDRAM, DDR3L SDRAM (Up to 1066 MHz) • Models: Athlon II K125 and K145 V models "Geneva" (45nm SOI, single-core) • '''Nile platform''' • Single AMD K10 core • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR3 SDRAM, DDR3L SDRAM (Up to 1066 MHz) • Models: V 105 "Champlain" (45nm SOI, single-core) • '''Danube platform''' • Single AMD K10 core • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR3 SDRAM, DDR3L SDRAM (Up to 1333 MHz) • Models: V 120 to 160 Phenom II models "Champlain" (45nm SOI, quad-core) • '''Danube platform''' • Four AMD K10 cores • Unlike desktop models, mobile Phenom II models do not have L3 cache • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR3 SDRAM, DDR3L SDRAM (Up to 1333 MHz) • Models: Phenom II models "Champlain" (45nm SOI, tri-core) • '''Danube platform''' • Three AMD K10 cores • Unlike desktop models, mobile Phenom II models do not have L3 cache • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR3 SDRAM, DDR3L SDRAM (Up to 1333 MHz) • Models: Phenom II models "Champlain" (45nm SOI, dual-core) • '''Danube platform''' • Two AMD K10 cores • Unlike desktop models, mobile Phenom II models do not have L3 cache • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Memory support: DDR3 SDRAM, DDR3L SDRAM (Up to 1333 MHz) • Models: Phenom II models Llano APUs "Sabine" (32nm SOI, dual or quad-core) • Fabrication 32 nm on GlobalFoundries' SOI process • Socket FS1 • Two or four upgraded K10 cores codenamed Husky (K10.5) with no L3 cache, and with Redwood-class integrated graphics on die (WinterPark for the dual-core variants and BeaverCreek for the quad-core variants) • Integrated PCIe 2.0 controller • GPU: TeraScale 2 • Select models support Turbo Core technology for faster CPU operation when the thermal specification permits • 2.5 GT/s UMI • ISA extensions: MMX, Enhanced 3DNow!, SSE, SSE2, SSE3, SSE4a, ABM, NX bit, AMD64, AMD-V, PowerNow! • Support for 1.35 V DDR3L-1333 memory, in addition to regular 1.5 V DDR3 memory specified • Models: Sabine mobile APUs ==Server==

Server

There are two generations of K10-based processors for servers: Opteron 65 nm and 45 nm. ==Successor==

Successor

AMD discontinued further development of K10 based CPUs after Thuban, choosing to focus on Fusion products for mainstream desktops and laptops and Bulldozer based products for the performance market. However, within the Fusion product family, APUs such as the first generation A4, A6 and A8-series chips (Llano APUs) continued to use K10-derived CPU cores in conjunction with a Radeon graphics core. K10 and its derivatives were phased out of production by the introduction of Trinity-based APUs in 2012, which replaced the K10 cores in the APU with Bulldozer-derived cores. ==Family 11h and 12h derivatives==

Family 11h and 12h derivatives

Turion X2 Ultra Family 11h The Family 11h microarchitecture was a mixture of both K8 and K10 designs with lower power consumption for laptop that was marketed as Turion X2 Ultra and was later replaced by completely K10-based designs. • Both CPU and GPU were re-used to avoid complexity and risk • Distinct Software and Physical integration makes Fusion (APU) microarchitectures different • Power-saving improvements including clock gating • Improvements to hardware pre-fetcher • Redesigned memory controller • 1 MB L2 cache per core • No L3 cache • Two new buses for on-die GPU to access memory (called Onion and Garlic interfaces) • AMD Fusion Compute Link (Onion) – interfaces to CPU cache and coherent system memory (see cache coherence) • Radeon Memory Bus (Garlic) – dedicated non-coherent interface connected directly to memory ==Media discussions==

Media discussions

Note: These media discussions are listed in ascending date of publication. • • • • • • • • • • • • • • • • • • • • • • • • • • • ==See also==

Source: Wikipedia ↗

tickerdossier.com tickerdossier.substack.com