AMD's upcoming RDNA 5 GPUs might improve dual-issue execution & use shader units more efficiently — LLVM patch adds new FMA instruction to ease compiling

A new LLVM patch has added V_FMA_F32, a 3-operand fused multiply-add (FMA instruction and introduced the VOPD3 instruction format for RDNA 5. Both of these changes should make it easier for compilers to use dual issue execution, working around the strict pairing rules that would otherwise limit max FP32 throughput in certain workloads.