Yes 200 as a business expense is really not that bad. But a hobby is hard to jus...

scuff3d · 2026-04-17T22:20:20 1776464420

It's not gonna stay that way. Token cost is being massively subsidized right now. Prices will have to start increasing at some point.

ianm218 · 2026-04-17T22:57:48 1776466668

This is hard to say definitively. The new Nvidia Vera Rubin chips are 35-50x more efficient on a FLOPS/ megawatt basis. TPU/ ASICS/ AMD chips are making similar less dramatic strides.

So a service ran at a loss now could be high margin on new chips in a year. We also don’t really know that they are losing money on the 200/ month subscriptions just that they are compute constrained.

If prices increase might be because of a supply crunch than due to unit economics.

scuff3d · 2026-04-18T01:42:09 1776476529

Given the massive costs on training, R&D, and infrastructure build out in addition to the fact that both Anthropic and OpenAI are burning money as quickly as they can raise it, the safe bet is on costs going up.

NewJazz · 2026-04-18T03:33:16 1776483196

What is your source on 35x more efficient? That seems like a wild performance improvement that I would have hears about.

My research shows claims of 10x efficiency, but that number is very questionable.

ianm218 · 2026-04-18T12:23:01 1776514981

https://hashrateindex.com/blog/nvidia-vera-rubin-nvl72-specs...

Honestly some of this info is quite hard to parse. I think the efficiency is ~35X on the system level but 10X on the hardware level. I think this is due to Nvidia bringing in Groq in addition to chip improvements.

Gigachad · 2026-04-17T22:30:51 1776465051

Seems like the real costs and numbers are very hidden right now. It’s all private companies and secret info how much anything costs and if anything is profitable.

davikr · 2026-04-17T23:22:14 1776468134

Some say margins could be up to 90% on API inference. The house always wins?

scuff3d · 2026-04-18T01:39:00 1776476340

That's like saying driving for Uber is profitable if you only take into consideration gas mileage but ignore car maintenance, payments, insurance, and all the other costs associated with owning a car.

Gigachad · 2026-04-18T01:00:36 1776474036

Some could say anything when there’s no proof.

barrkel · 2026-04-18T04:38:12 1776487092

You can run Qwen3 Coder today - on expensive hardware - but fairly cheaply on a token by token basis. It's no Opus, but you can get things done.

wickedsight · 2026-04-18T08:27:22 1776500842

Not sure which exact model you're talking about, but I've run the 30B and the 3.5 32B models and both can get some things done and can waste tons of time getting some things completely wrong.

They're fun to mess around with to figure out what they can and can't do, but they're certainly not not tools in the way I can count on Codex.