I run ml algorithms like boosted trees (i.e xgboost) on data sets with 30k-1m ro...

		henrydark on June 10, 2023 \| parent \| context \| favorite \| on: A performance analysis of Intel x86-SIMD-sort (AVX... I run ml algorithms like boosted trees (i.e xgboost) on data sets with 30k-1m rows and 200-2k columns. Sorting is the bottleneck, it's what the algorithm does. I doubt I'm special, and I'm sure these size are common

IIRC the average qsort len is less than 20 according to debian code search.