Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

When you query some APIs/scrape sites for personal use, it is unlikely you get throttled. Openai doing it at large scale for many users might have to go slower (they have tons of proxies for sure, but don't want to burn those IPs for user controlled traffic).

Similarly, their inference GPUs have some capacity. Spreading out the traffic helps keep high utilization.

But lastly, I think there is just a marketing and psychological aspect. Even if they can have the results in one minute, delaying it to two-five minutes won't impact user retention much, but will make people think they are getting a great value.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: