We've been using the FLUERS eval and you can see comparisons to other models on ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		jeffharris on March 20, 2025 \| parent \| context \| favorite \| on: OpenAI Audio Models We've been using the FLUERS eval and you can see comparisons to other models on the market in the post https://openai.com/index/introducing-our-next-generation-aud... Curious if there's a benchmark you trust most?

lern_too_spel on March 20, 2025 [–]

FLUERS and GP's Common Voice dataset focus on read speech. I've observed models that perform well on these datasets be completely useless on other distributions, like whispered speech or shouted speech or conversational speech between humans who aren't talking to a computer.

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact