More

victorbjorklund · 2026-04-24T14:11:54 1777039914

Not worse than having our stuff built there. Is it great to be relying on them? No, but at least more stable than US under Trump.

victorbjorklund · 2026-04-24T06:33:33 1777012413

You don’t think the Trump admin leaked any secrets at all? No chats on signal? Nothing like that?

victorbjorklund · 2026-04-22T22:05:34 1776895534

I theory I think dynamic pricing in grocery stores is allowed in EU but not sure if actually used.

victorbjorklund · 2026-04-22T21:54:42 1776894882

This is just epic. Really amazing.

victorbjorklund · 2026-04-22T21:52:28 1776894748

Also it decrease demand. Someone who is feeling ”eh maybe I’ll take an uber or the train” will be more likely to take the train.

esseph · 2026-04-23T01:04:46 1776906286

Not an option in much of the US. I'd argue it's mostly a Bay Area and DC corridor thing.

victorbjorklund · 2026-04-22T18:18:03 1776881883

Honestly, my accessibility on my apps/websites is much better now with AI because you can just tell AI to do it (and run automated tests to validate it worked) vs not doing it at all for a small side project with 2 users.

victorbjorklund · 2026-04-21T13:54:11 1776779651

5 billion now vs 10 billion per year in spend on compute that you had to buy anyways (not necessarily at aws)

victorbjorklund · 2026-04-21T07:07:28 1776755248

Anthropic is really trying to burn all that goodwill they worked up by raising prices, reducing limits and making it impossible to know what the actual policies are.

notarobot123 · 2026-04-21T07:12:45 1776755565

Boiling the frog is an art form. You've got to know when to turn up the heat and when to let it simmer.

Gigachad · 2026-04-21T09:58:39 1776765519

Don’t know, I feel like I’ve watched every tech company get through every controversy without consequence.

Google when they merged YouTube and Google+, Reddit multiple times, Facebook after countless scandals. Microsoft destroying windows and pushing ads.

At the end of the day a solid product and company can withstand online controversy.

conductr · 2026-04-21T17:14:49 1776791689

> a solid product and company can withstand online controversy

A product with a massive moat. Switching from Claude to another competitor is insanely easy and without much loss of quality. Until they’ve built their moat, burning goodwill is foolish.

What’s different is it’s probably required due to the cash that’s being burnt to operate. They can’t afford to keep offering so much for so little revenue.

nativeit · 2026-04-21T13:40:27 1776778827

At the end of the day, unenforced anti-competition regulations can bulldoze controversies.

sitkack · 2026-04-21T08:22:13 1776759733

Hormussy started it.

bandrami · 2026-04-21T08:25:42 1776759942

If you want LLMs to continue to be offered we have to get to a point where the providers are taking in more money than they are spending hosting them. And we still aren't there (or even close).

hobom · 2026-04-21T10:54:10 1776768850

They are taking in more than they are spending hosting them. However, the cost for training the next generation of models is not covered.

bandrami · 2026-04-21T11:18:40 1776770320

Nope. They're losing money on straight inference (you may be thinking of the interview where Dario described a hypothetical company that was positive margin). The only way they can make it look like they're making money on inference is by calling the ongoing reinforcement training of the currently-served model a capital rather than operational expense, which is both absurd and will absolutely not work for an IPO.

wild_egg · 2026-04-21T13:17:56 1776777476

Inference, in and of itself, can't be completely unprofitable. Unless you're purely talking about Anthropic?

But

> If you want LLMs to continue to be offered we have to get to a point where the providers are taking in more money than they are spending hosting them

Suggests you just mean in general, as a category, every provider is taking a loss. That seems implausible. Every provider on OpenRouter is giving away inference at a loss? For what purpose?

bandrami · 2026-04-22T02:25:51 1776824751

For the same reason that Amazon operated at a loss for two decades and Uber operated at a loss for a decade and a half. The problem is the free money hose isn't running anymore.

dgellow · 2026-04-21T12:25:37 1776774337

Do you have sources? I would be interested to read them

bandrami · 2026-04-21T12:35:06 1776774906

Probably the best roundup is Ed Zitron at https://wheresyoured.at

Half the articles are paywalled but the free ones outline the financial situation of the SOTA providers and he has receipts

victorbjorklund · 2026-04-22T20:50:45 1776891045

I really doubt that since prices are even higher than no-name hosts on open router etc charges.

quikoa · 2026-04-21T09:39:35 1776764375

The open models may not be as great but maybe these are good enough. AI users can switch when the prices rise before it becomes sustainable for (some) of the large LLM providers.

Gigachad · 2026-04-21T09:59:43 1776765583

Currently it costs so much more to host an open model than it costs to subscribe to a much better hosted model. Which suggests it’s being massively subsidised still.

finaard · 2026-04-21T11:14:45 1776770085

For a lot of tasks smaller models work fine, though. Nowadays the problem is less model quality/speed, but more that it's a bit annoying to mix it in one workflow, with easy switching.

I'm currently making an effort to switch to local for stuff that can be local - initially stand alone tasks, longer term a nice harness for mixing. One example would be OCR/image description - I have hooks from dired to throw an image to local translategemma 27b which extracts the text, translates it to english, as necessary, adds a picture description, and - if it feels like - extra context. Works perfectly fine on my macbook.

Another example would be generating documentation - local qwen3 coder with a 256k context window does a great job at going through a codebase to check what is and isn't documented, and prepare a draft. I still replace pretty much all of the text - but it's good at collecting the technical details.

pbronez · 2026-04-21T11:25:47 1776770747

I haven’t tried it yet, but Rapid MLX has a neat feature for automatic model switching. It runs a local model using Apple’s MLX framework, then “falls forward” to the cloud dynamically based on usage patterns:

> Smart Cloud Routing > > Large-context requests auto-route to a cloud LLM (GPT-5, Claude, etc.) when local prefill would be slow. Routing based on new tokens after cache hit. --cloud-model openai/gpt-5 --cloud-threshold 20000

https://github.com/raullenchai/Rapid-MLX

stingraycharles · 2026-04-21T10:32:21 1776767541

You can use open models through OpenRouter, but if you want good open models they’re actually pretty expensive fairly quickly as well.

layoric · 2026-04-21T10:43:55 1776768235

I've found MiniMax 2.7 pretty decent and even pay-as-you-go on OpenRouter, it's $0.30/mt in, and $1.20/mt out you can get some pretty heavy usage for between $5-$10. Their token subscription is heavily subsidized, but even if it goes up or away, its pretty decent. I'm pretty hopeful for these openweight models to become affordable at good enough performance.

stingraycharles · 2026-04-21T10:50:43 1776768643

It’s okay, but if you compare it to eg Sonnet it’s just way too far off the mark all the time that I cannot use it.

ericd · 2026-04-21T13:32:32 1776778352

Efficiency goes way up with concurrent requests, so not necessarily subsidy, could just be economy of scale.

JumpCrisscross · 2026-04-21T11:10:55 1776769855

If I drop $10k on a souped-up Mac Studio, can that run a competent open-source model for OpenClaw?

pbronez · 2026-04-21T11:36:15 1776771375

Rapid MLX team has done some interesting benchmarking that suggests Qwopus 27B is pretty solid. Their tool includes benchmarking features so you can evaluate your own setup.

They have a metric called Model-Harness Index:

MHI = 0.50 × ToolCalling + 0.30 × HumanEval + 0.20 × MMLU (scale 0-100)

https://github.com/raullenchai/Rapid-MLX

JumpCrisscross · 2026-04-21T11:38:35 1776771515

Pardon the silly question, but why do I need this tool versus running the model directly (and SSH’ing in when I’m away from home)?

Atotalnoob · 2026-04-21T12:06:34 1776773194

Qwen is probably your best bet…

Edit: I’d also consider waiting for WWDC, they are supposed to be launching the new Mac Studio, an even if you don’t get it, you might be able to snag older models for cheaper

JumpCrisscross · 2026-04-21T13:10:05 1776777005

> consider waiting for WWDC

100% agree. I’m just looking forward to setting something up in my electronic closet that I can remote to instead of having everything tracked.

storus · 2026-04-21T16:54:21 1776790461

Latest rumors are no Mac Studio until at least October.

Larrikin · 2026-04-21T13:52:59 1776779579

It is nobody's responsibility to ensure billion dollar companies are profitable. Use them until local models are good enough

lynx97 · 2026-04-21T08:36:58 1776760618

I see the current situation as a plus. I get SOTA models for dumping prices. And once the public providers go up with their pricing, I will be able to switch to local AI because open models have improved so much.

baruch · 2026-04-21T11:28:11 1776770891

If they started doing caching properly and using proper sunrooms for that they'd have a better chance with that

bandrami · 2026-04-21T12:37:29 1776775049

If my empty plate had a pizza on it it would be a good lunch

holoduke · 2026-04-21T15:52:50 1776786770

Like with all new products. It takes time to let the market do its work. See if from a positive side. The demand for more and faster and bigger hardware is finally back after 15 years of dormancy. Finally we can see 128gb default memory or 64gb videocards in 2 years from now.

carefree-bob · 2026-04-21T08:29:14 1776760154

I think this has to be done with technological advances that makes things cheaper, not charging more.

I understand why they have to charge more, but not many are gonna be able to afford even $100 a month, and that doesn't seem to be sufficient.

It has to come with some combination of better algorithms or better hardware.

bandrami · 2026-04-21T08:46:47 1776761207

Making it more affordable would be very bad news for Amazon, who are now counting on $100B in new spending from OpenAI over the next 10 years.

philipwhiuk · 2026-04-21T09:01:59 1776762119

Someone's going to get burned here that's for sure. This isn't going to end with every person on the planet paying $100 a month for an LLM.

LtWorf · 2026-04-21T10:00:36 1776765636

A guy from Meta interviewing at BBC a few years ago claimed that every school child in India was going to have the metaverse VR or they'd be left behind in their education, so every family was certainly going to pony up the money.

throwthrowuknow · 2026-04-21T09:07:32 1776762452

Somethings not adding up. Why is Amazon making financial plans for the next decade based on continued OpenAI spending but you’re saying AI providers like OpenAI and Anthropic aren’t even close to being profitable, so how can they last a decade or more?

Who’s wrong?

bandrami · 2026-04-21T09:17:48 1776763068

I take it you don't remember 2008

arcanemachiner · 2026-04-21T09:34:25 1776764065

Are we before or after the part where they start throwing money out of helicopters?

bandrami · 2026-04-21T10:13:59 1776766439

That's the interesting question, right? Because if this unwinds during a period of external inflation (say, because of a big war and energy shortage) then even the Bernanke would say helicopter money won't work

Gigachad · 2026-04-21T10:01:02 1776765662

They probably aren’t planning on making the money on consumer subscriptions. Any price is viable as long as the user can get more value out of it than they spend.

bandrami · 2026-04-21T10:09:40 1776766180

"Sell this for less than it cost us" was a viable business plan during the ZIRP era but is not now

vegnus · 2026-04-21T18:19:59 1776795599

I'll take local models over these corporate ones any day of the week. Hopefully it's only a matter of time

baq · 2026-04-21T07:22:40 1776756160

Would you please think of the shareholders

sofixa · 2026-04-21T07:36:46 1776757006

What shareholders, Anthropic is a money burning pit. Not to the same extent as OpenAI, but both will struggle hard to actually turn a profit some day, let alone make back the massive investments they've received.

Not that they don't bring value, I'm just not convinced they'll be able to sell their products in a sticky enough way to make up the prices they'll have to extract to make up for the absurd costs.

bruce511 · 2026-04-21T07:49:55 1776757795

>> both will struggle hard to actually turn a profit some day, let alone make back the massive investments they've received.

I'd agree with you, except I've heard this argument before. Amazon, Google, Facebook all burned lots of cash, and folks were convinced they would fail.

On the other hand plenty burned cash and did fail. So could go either way.

I expect, once the market consolidates to 2 big engines, they'll make bonkers money. There will be winners and losers. But I can't tell you which is which yet.

throwthrowuknow · 2026-04-21T09:15:08 1776762908

I’m not sure there will be consolidation. There’s too much room for specialization and even when the models are trained to do the same task they have very different qualities and their own strengths and weaknesses. You can’t just swap one for the other. If anything, as hardware improves I’d expect even more models and providers to become available. There’s already an ocean of fine tuned and merged models.

baq · 2026-04-21T07:47:59 1776757679

$20B ARR or so reported added in Q1 doesn’t sound particularly bad, they’ll raise effective prices some more while Claude diffuses into the economy, sounds like a money printer. The issue is they’re compute constrained on the supply side to grow faster…

sofixa · 2026-04-21T07:58:50 1776758330

> $20B ARR or so reported added in Q1 doesn’t sound particularly bad

Unless you compare with the reported cash burn or projected losses.

> they’ll raise effective prices some more while Claude diffuses into the economy, sounds like a money printer

But the problem is, they have no moat. Even if Claude diffuses into the economy (still to be seen how much it can effectively penetrate sectors other than engineering, spam, marketing/communications), there is no moat, all providers are interchangeable. If Antrhopic raise the prices too much, switch out to the OpenAI equivalent products.

baq · 2026-04-21T09:10:01 1776762601

> But the problem is, they have no moat

I disagree very strongly with this, both anecdotally and in the data - subscriptions are growing in all frontier providers; anecdata is right here in HN when you look around almost everyone is talking about CC, codex is a distant second, and completely anecdotally I personally strictly prefer GPT 5.3+ models for backend work and Opus for frontend; Gemini reviews everything that touches concurrency or SQL and finds issues the other models miss.

My general opinion is that models cannot be replaceable, because a model which can replace every other provider must excel at everything all specialist models excel at and that is impossible to serve at scale economically. IOW everyone will have at least two subscriptions to different frontier labs and more likely three.

sofixa · 2026-04-21T10:36:48 1776767808

You're actually reinforcing my point. Models are interchangable and easy to switch between to adjust based on needs and costs. That means that no individual model / model provider has any sort of serious moat.

If tomorrow Kimi release a model better at something, you'd switch to it.

WarmWash · 2026-04-21T13:43:07 1776778987

It's likely that Chinese models will get regulatory knee-capped at some point, and the domestic labs all have pretty common costs they need to make up. This creates an environment where they match each other as prices climb. Unless Google/Meta suffocates the startups since they have actual cash flow that is non-AI.

Sure you can go local, but lets be real, that would be <1% of users.

baq · 2026-04-21T11:42:24 1776771744

Yes, in that sense, technically correct.

I postulate in practice this won't matter since the space of use cases is so large if Kimi released the absolutely best model at everything they wouldn't be able to serve it (c.f. Mythos).

waysa · 2026-04-21T10:36:18 1776767778

It's almost like they want me to switch to the Chinese clones - which they consider malicious actors.

aurareturn · 2026-04-21T08:18:11 1776759491

Aren't they just doing what Hacker News was trying to tell them to do? That AI is useful but not sure if sustainable. Now they're increasing prices and decreasing tokens and you guys are pissed off.

freedomben · 2026-04-21T09:42:31 1776764551

I feel this has to be said constantly, though I hate doing it.

hn is not a monolith. People here routinely disagree with each other, and that's what makes it great

aurareturn · 2026-04-21T10:06:24 1776765984

I'm aware. When I say "Hacker News", I mean a very sizable portion of users who keep repeating the OpenAI collapse imminent opinion.

ex-aws-dude · 2026-04-21T19:24:52 1776799492

https://en.wiktionary.org/wiki/Goomba_fallacy

aurareturn · 2026-04-22T14:07:08 1776866828

https://en.wikipedia.org/wiki/Argument_from_fallacy

victorbjorklund · 2026-04-20T14:17:56 1776694676

Failed to build one but my advice would be to focus very narrow. In your case start with literally between two cities. Also, fake supply by either paying people to do the trip (in addition to the normal payment on the platform) or literally do it yourself once per week. Focus on supply. It will be way harder to get.

victorbjorklund · 2026-04-20T14:13:34 1776694414

Haha no, I been in 90c saunas many times. Can I stay there for a long time? Heck no. But some people can and it doesn’t kill you (maybe if you have some preexisting condition)

KeplerBoy · 2026-04-20T14:15:35 1776694535

the point is the humidity. hot saunas need to be relatively dry otherwise your sweat won't evaporate.