VFMADDxxxPS - Fused Multiply ADD xxx Packed Single
VFMSUBxxxPS - Fused Multiply SUBtract xxx Packed Single
VFMADDSUBxxxPS - Fused Multiply ADD SUBtract xxx Packed Single
VFMSUBADDxxxPS - Fused Multiply SUBtract ADD xxx Packed Single
VFNMADDxxxPS - Fused Negative Multiply ADD xxx Packed Single
VFNMSUBxxxPS - Fused Negative Multiply SUBtract xxx Packed Single




For each element, performs MUL and ADD/SUB calculation with 3 operands and set the result to the first operand. (returns the result.)

Which operands to MUL, and which operand to ADD/SUB, depend on the order of the numbers (1, 2, 3) in the instruction name. depend on the order of arguments for intrinsic.

????? instruction name odd even
FMADD VFMADD132PS (1) * (3) + (2)
VFMADD213PS (2) * (1) + (3)
VFMADD231PS (2) * (3) + (1)
_mm_fmadd_ps
_mm256_fmadd_ps
_mm512_fmadd_ps
a * b + c
FMSUB VFMSUB132PS (1) * (3) - (2)
VFMSUB213PS (2) * (1) - (3)
VFMSUB231PS (2) * (3) - (1)
_mm_fmsub_ps
_mm256_fmsub_ps
_mm512_fmsub_ps
a * b - c
FMADDSUB VFMADDSUB132PS (1) * (3) + (2) (1) * (3) - (2)
VFMADDSUB213PS (2) * (1) + (3) (2) * (1) - (3)
VFMADDSUB231PS (2) * (3) + (1) (2) * (3) - (1)
_mm_fmaddsub_ps
_mm256_fmaddsub_ps
_mm512_fmaddsub_ps
a * b + c a * b - c
FMSUBADD VFMSUBADD132PS (1) * (3) - (2) (1) * (3) + (2)
VFMSUBADD213PS (2) * (1) - (3) (2) * (1) + (3)
VFMSUBADD231PS (2) * (3) - (1) (2) * (3) + (1)
_mm_fmsubadd_ps
_mm256_fmsubadd_ps
_mm512_fmsubadd_ps
a * b - c a * b + c
FNMADD VFNMADD132PS - (1) * (3) + (2)
VFNMADD213PS - (2) * (1) + (3)
VFNMADD231PS - (2) * (3) + (1)
_mm_fnmadd_ps
_mm256_fnmadd_ps
_mm512_fnmadd_ps
- a * b + c
FNMSUB VFNMSUB132PS - (1) * (3) - (2)
VFNMSUB213PS - (2) * (1) - (3)
VFNMSUB231PS - (2) * (3) - (1)
_mm_fnmsub_ps
_mm256_fnmsub_ps
_mm512_fnmsub_ps
- a * b - c

_mask_  if k bit is 0, a is copied.
_mask3_  if k bit is 0, c is copied.
_maskz_  if k bit is 0, zero cleared.


x86/x64 SIMD Instruction List  Feedback