The incompatibility between Intel's FMA3 and AMD's FMA4 is due to both companies changing plans without coordinating coding details with each other. AMD changed their plans from FMA3 to FMA4 while Intel changed their plans from FMA4 to FMA3 almost at the same time. The history can be summarized as follows: •
August 2007: AMD announces the
SSE5 instruction set, which includes 3-operand FMA instructions. A new coding scheme (DREX) is introduced for allowing instructions to have three operands. •
April 2008: Intel announces their
AVX and FMA instruction sets, including 4-operand FMA instructions. The coding of these instructions uses the new
VEX coding scheme, which is more flexible than AMD's DREX scheme. •
December 2008: Intel changes the specification for their FMA instructions from 4-operand to 3-operand instructions. The VEX coding scheme is still used. •
May 2009: AMD changes the specification of their FMA instructions from the 3-operand DREX form to the 4-operand VEX form, compatible with the April 2008 Intel specification rather than the December 2008 Intel specification. •
October 2011: AMD
Bulldozer processor supports FMA4. •
January 2012: AMD announces FMA3 support in future processors codenamed Trinity and Vishera; they are based on the
Piledriver architecture. •
May 2012: AMD Piledriver processor supports both FMA3 and FMA4. •
February 2017: The first generation of AMD
Ryzen processors officially supports FMA3, but not FMA4 according to the
CPUID instruction. There has been confusion regarding whether FMA4 was implemented or not on this processor due to errata in the initial patch to the
GNU Binutils package that has since been rectified. One unconfirmed report of wrong results led to some doubt, but Mysticial (Alexander Yee, developer of y-cruncher) debunked it: FMA4 worked for bit-exact bignum calculations on his Zen 1 system for years, and the one report on Reddit never had any followup investigation to rule out mistakes in the testing software before being widely repeated. The initial Ryzen CPUs could be crashed by a particular sequence of FMA3 instructions, but updated CPU microcode fixes the problem. •
July 2019: AMD
Zen 2 and later Ryzen processors don't support FMA4 at all. They continue to support FMA3. Only Zen 1 and Zen+ have unofficial FMA4 support. ==Compiler and assembler support==