One thing I rarely see discussed is that AI cost is not just dollars per token.<p>There’s also latency, dependency on external infrastructure, privacy and compliance concerns, energy usage, and just the general predictability of the system itself.<p>My guess is that this will gradually push a lot of companies toward more hybrid architectures over time. Small or local models are probably good enough for things like filtering, routing or repetitive high volume tasks, while frontier models get reserved for the places where the quality jump actually justifies the added cost and complexity.<p>As useful as frontier models are, using them for absolutely everything sometimes reminds me of using a distributed system for problems that could have been solved locally with something much simpler.<p>I wouldn’t be surprised if, in many real world cases, a fast specialized system plus a smaller model ends up being the more practical and economical setup overall.
by Tony_Delco
|
May 8, 2026, 12:56:57 PM
I think it’s going to be like infrastructure —- eventually they will reach certain level, maybe like electricity.
by markus_zhang
|
May 8, 2026, 12:56:57 PM
Less people will use the frontline models and those who do will pay more. Progress will slow. OpenAI will sell your chat data. You will get an AI tax. Companies will use less of it.<p>Hopefully new ways to deliver similiar quality will be discovered.<p>Stock market will pop.<p>Prices will go up for people inside the moat
by ipaddr
|
May 8, 2026, 12:56:57 PM
What always happens. A market correction followed by going back to a reasonable state, until the next bubble of course.<p>In my opinion, LLMs are useful for many things but not anything and everything and definitely not in the way the boosters are claiming. This is not a popular opinion when you are inside the bubble or have something to gain by it. So when there there's a downturn, things will hopefully stabilize with LLMs being another tool that can be used to automate certain things. It feels crazy saying this these days and have been told I'm out of touch if I think this way and who knows, maybe that's true.
by scorpioxy
|
May 8, 2026, 12:56:57 PM
Sometimes I do wonder about this. Some companies might get people used to AI first and then raise prices later, which could put many of us in a difficult position. But I also think Linux came out in a similar kind of environment, and in the end the community will find a way through it.
by kaant
|
May 8, 2026, 12:56:57 PM
For a lot of companies, probably shut down or drastically limit their AI usage due to rising costs. A small or medium sized business dependent on ever growing AI expenses is in a real bad position, and could well go under.<p>I heard a few companies ended up going back to hiring actual employees for work that was previous done by LLMs, so there's a chance we could see some more of that too. Might also see a few try to make it work with outdated or local ones too.
by CM30
|
May 8, 2026, 12:56:57 PM
Token anxiety is real. What worked for me: prompt caching on fixed system prompts cut my Anthropic bill by ~60% overnight. Most devs don't realize cache writes are 25x cheaper than input tokens on Claude.<p>Local models for classification/routing + frontier only for generation is the other move — but the latency tradeoff is real if you're in a user-facing flow.
by MehdiBelkacem
|
May 8, 2026, 12:56:57 PM
Prices are going down. Just look at open source models, you can run the equivalent to a SOTA model 8 months ago on your laptop.
by atleastoptimal
|
May 8, 2026, 12:56:57 PM
most people will stop paying for the frontier models and will look out for the small models which are optimised on certain tasks
by B_Nemade
|
May 8, 2026, 12:56:57 PM
What do you think will happen? How does supply and demand work? Practically every business and government in existence is existentially dependent on AI, speculation on it is the only thing keeping the world from global financial collapse. It's "too big to fail" at a scale that dwarfs the financial crisis of 2008.<p>You'll pay the fucking danegeld is what you'll do, and keep paying it, because you reorganized your entire existence around and mortgaged your future on a closed proprietary third party service's <i>business model</i> that is now a single point of failure for our entire technological civilization, making its market value practically infinite.<p>That's a collective "you" there, by the way, not "you" personally.
by krapp
|
May 8, 2026, 12:56:57 PM
[flagged]
by catbot_dev
|
May 8, 2026, 12:56:57 PM
[dead]
by rebekkamikkoa
|
May 8, 2026, 12:56:57 PM
[dead]
by KeynitionAuto
|
May 8, 2026, 12:56:57 PM