That link says "\* With sparsity". For extremely sparse matrixes you can get mor...

YetAnotherNick · 2025-05-06T13:50:36 1746539436

I am counting FP16/BF16 without sparsity, which is used in majority of AI.

cma · 2025-05-06T21:31:41 1746567101

That change checks out then. They didn't see much need for FP16 outside of that so no longer run it at double FP32 rate outside of tensor cores (unless I'm mixing that up with AMD).

Other forms of sparsity are heavily used at training time now, like block compression in Deepseek.