What’s 🔥 in Enterprise IT/VC #504

From Building Intelligence to Controlling It

Jun 27, 2026

Over the past year I’ve argued that AI models would become abundant while the value would shift to the software that sits above them. This week may have been the clearest evidence yet that we’re entering that world.

This wasn’t really a story about China. It wasn’t really a story about OpenAI. And it wasn’t just another story about cheaper models.

It was the week the market began pricing intelligence like a commodity.

As you know I’ve been long open weight/open source models for quite a long time, and I feel like the great reckoning is accelerating. Everyone always wants frontier lab state-of-the-art (SOTA) models if they can, but once subsidization ends, and other alternatives become good enough and dramatically cheaper, economics begin to win.

Well…here we are.

Rohan Paul@rohanpaul_ai

UBS says 60% of companies now watching AI budgets are moving to cheaper models and open-source Chinese models The pressure is coming from extreme bills, including users spending up to $35K/month, teams exceeding quotas by 200%, and companies cutting internal AI tools from 5 to

4:07 AM · Jun 26, 2026 · 53.1K Views

38 Replies · 115 Reposts · 436 Likes

If you think that’s just a Wall Street survey, Coinbase is already doing exactly this internally.

Brian Armstrong@brian_armstrong

How to keep AI spend flat while token usage grows exponentially: Not with friction and spend alerts. With better defaults, routing, and caching. Better Defaults (not Usage Caps) – Engineers can choose any model they want, but defaults matter. We’re experimenting with defaulting

12:48 AM · Jun 27, 2026 · 149K Views

114 Replies · 97 Reposts · 1.06K Likes

Brian Armstrong just described how Coinbase has cut AI spend nearly in half while token usage continues to grow. They did it with better defaults, intelligent routing, caching, leaner context, and visibility. They’re no longer optimizing for the smartest model but for maximum intelligence per dollar.

This is what every technology market looks like when it begins to commoditize: buyers stop asking for the absolute best product and start optimizing for cost, flexibility, and control.

and this…

will brown@willccbb

something has definitely shifted in the past few weeks. seeing a huge uptick in large enterprises wanting to secure compute and post-train their own models in house, frequently on top of GLM-5.2. everyone is starting to understand how open source wins.

12:09 AM · Jun 26, 2026 · 104K Views

43 Replies · 96 Reposts · 1.41K Likes

Another example among many is Hugging Face which is the largest hoster of open source models and infra just crossed $100M of ARR with recent growth by…you guessed it - Chinese models.

clem 🤗@ClementDelangue

We just crossed $100M annual run-rate. I know many AI companies are capturing much more $$$ these days, but still proud of the milestone! Maximizing short-term revenue has never been our priority. In fact, we're proud to manage to store and serve hundreds of petabytes of models

11:18 AM · Jun 25, 2026 · 151K Views

214 Replies · 101 Reposts · 2.15K Likes

And OpenRouter stats - many startups, less enterprises but you get the point.

zerohedge@zerohedge

"the share of tokens used for US models on OpenRouter has collapsed": Bloomberg

11:38 PM · Jun 25, 2026 · 385K Views

131 Replies · 373 Reposts · 2.67K Likes

None of this would matter if open weight/open source models weren’t good enough and much cheaper. Per JPM:

Many tokens consumed in the future may not come from frontier models but from smaller open models that are up to the tasks. Amazon now offers a half-dozen open models at a fraction of frontier pricing, and NVIDIA is teaming up with Dell, Lenovo and HP to make PCs designed with AI agents.

Trevor Noren@trevornoren

JPM: "Many tokens consumed in the future may not come from frontier models but from smaller open models that are up to the tasks. Amazon now offers a half-dozen open models at a fraction of frontier pricing, and NVIDIA is teaming up with Dell, Lenovo and HP to make PCs designed

6:12 PM · Jun 25, 2026 · 10.4K Views

5 Replies · 36 Reposts · 138 Likes

This doesn’t mean frontier models don’t matter. They absolutely do. But most enterprise workloads don’t require frontier intelligence every time. They require the right intelligence at the right cost, delivered securely, with the flexibility to switch models as economics, performance, and policy change.

Whether by building on frontier models or through alleged distillation, the gap continues to compress.

Chubby♨️@kimmonismus

Anthropic claims: Alibaba continues to distill Claude on a large scale to train Qwen. Via Bloomberg Anthropic is accusing Alibaba-linked operators of running a massive campaign to illicitly access Claude through nearly 25,000 fraudulent accounts. According to Bloomberg,

8:25 PM · Jun 24, 2026 · 573K Views

234 Replies · 174 Reposts · 1.99K Likes

Another consequence may be increasing government involvement in access to frontier models. The U.S. government now will decide who gets access to the latest model releases and when 🤯. This will further separate the world of the “haves” versus “have nots” and also allow China to catch up very quickly and close that 9 month gap between best in class and free.

NIK@ns123abc

🚨 BREAKING: U.S. government will decide who gets access to GPT-5.6 OpenAI will release GPT-5.6 only in a limited preview to a small group of partners. Sam Altman told staff the government would be "approving access customer by customer." Commerce Sec Lutnick personally

9:14 PM · Jun 25, 2026 · 1.45M Views

783 Replies · 732 Reposts · 4.89K Likes

TravisGood@IridiumEagle

The US gov is playing a dangerous game. If Chinese models soundly surpass Opus 4.8 while Mythos and GPT 5.6 are banned, the narrative around 'distillation' will collapse, and US AI valuations and adoption will face an existential crisis.

1:28 AM · Jun 26, 2026 · 36K Views

64 Replies · 90 Reposts · 793 Likes

The bigger risk isn’t just geopolitics. It’s a future where access to the most capable models becomes increasingly restricted while open models continue improving. That only strengthens the case for enterprises to own more of their AI stack.

In the meantime, enjoy access to cheaper intelligence and make sure you get your multimodel AI workflow strategy rolling. This all builds on what I laid out in What’s 🔥 #502 two weeks ago. Re-upping it here.

What’s 🔥 in Enterprise IT/VC #502

Jun 13

Satya Nadella recently offered a framework that I think nails where the value is heading. He argued that a company’s private evals may ultimately become its most valuable IP. (h/t to Gokul for summarizing this)

Read full story

Here’s a new interview this week going more in depth on his vision for the future.

Yash Patil@ypatil125

"There should be as many models in the world as firms in the world." Satya and I dig into when to own vs. rent your intelligence, why every company should be building and climbing its own private evals, and what makes for a stable frontier.

4:34 PM · Jun 26, 2026 · 71.7K Views

27 Replies · 54 Reposts · 458 Likes

Satya described where value accrues, while Brian Armstrong is showing how enterprises will operate.

The good news is that only 8% of enterprises have broadly deployed AI agents today (UBS), so we’re still in the early innings. Even if intelligence keeps getting cheaper, enterprises will deploy exponentially more agents. Coding agents consume enormous numbers of tokens. Multimodal AI expands compute requirements. Demand for intelligence keeps growing even as the price per unit falls.

The frontier labs may face pricing pressure. Everyone building above them won’t.

The first era of AI was about building intelligence.

The second era of AI isn’t about building smarter models; it’s about controlling intelligence through routing, governance, security, cost optimization, private context, and private evals, which together become the operating system of enterprise AI.

In a world where intelligence becomes abundant, control compounds.

As always, 🙏🏼 for reading and please share with your friends and colleagues!

Thanks for reading What's Hot 🔥 in Enterprise IT/VC! This post is public so feel free to share it.

Scaling Startups

#in a world moving so fast, it’s all about the founders and ability to recruit the best - talent density matters

Ben Lang@benln

Early teams matter a lot

1:44 PM · Jun 25, 2026 · 107K Views

38 Replies · 55 Reposts · 1.79K Likes

#this trend is exactly what AI should enable, not everyone needs venture capital, in fact most shouldn’t raise (full report is worth a read)

Patrick Collison@patrickc

New from Stripe Economics: The Age of the Solopreneur stripeeconomics.com/p/the-age-of-t…

6:20 PM · Jun 25, 2026 · 211K Views

86 Replies · 265 Reposts · 2.2K Likes

#another hundred million + inception round, $200M 🤯

Behnam Neyshabur@bneyshabur

Today, I’m excited to formally announce @MirendilAI with my amazing co-founders Harsh Mehta, Shayan Salehian, and Tara Rezaei! We’re fortunate to work with @a16z and @kleinerperkins, who led our seed round of $200M, followed by a major investment from NVIDIA, among others.

7:11 PM · Jun 24, 2026 · 735K Views

249 Replies · 129 Reposts · 1.42K Likes

Enterprise Tech

#OpenAI built its own chip but here is what you should pay attention to:

An AI company with frontier coding models can now become a hardware vendor with only a small team of experienced SWEs and an infinite amount of tokens
This is the first chip program fully accelerated by frontier AI.

Patrick C Toulme@PatrickToulme

A few thoughts on OpenAI's Jalapeño chip announcement today: 1. This chip is most likely the first one virtually entirely developed by Codex/GPT. Codex with whatever internal coding model (GPT 5.6/6.0 whatever) coded the entire software stack and most likely the hardware design

6:57 PM · Jun 24, 2026 · 391K Views

133 Replies · 198 Reposts · 2.25K Likes

#he has a point

Matthew Berman@MatthewBerman

> mythos is so good at cyber it can't be released also > mythos can't detect 20k fraudulent chinese accounts attacking it

7:00 PM · Jun 26, 2026 · 93.3K Views

139 Replies · 242 Reposts · 4.57K Likes

#the model routing wars continue as Sakana releases a single model API which claims it matches the performance of Fable and Mythos 🤔 - great marketing but some real questions raised by Elie 👇🏼

the biggest and most obvious issue is that they are introducing a “test time scaling” method with “best of N” over models, and they literally NEVER REPORT the number of output tokens or cost to achieve a benchmark/task

elie@eliebakouch

to be clear, this is a closed source orchestrator on top of closed source models. if before you didn't control the models, now you don't even control which ones are used or how much. this is not "AI sovereignty" i've also read the tech report to get an opinion on the technical

Sakana AI @SakanaAILabs

Introducing Sakana Fugu: A full multi-agent orchestration system accessible via a single model API. Our ‘Fugu Ultra’ model matches the performance of Fable and Mythos, delivering frontier capability without the risk of export controls. Try it: https://t.co/aDEFyySWlS 🐡

6:10 AM · Jun 22, 2026 · 116K Views

65 Replies · 99 Reposts · 1K Likes

#fun time continuing our conversation from the McKinsey Tech Leadership Forum workshop with leading enterprise CTOs/Heads of AI on where are we with diffusion of agentic workflows and what’s ahead

#what used to be mostly a “buy data from vendors” game has shifted toward “build sophisticated internal research infrastructure” because the data/reward design itself is now a core research problem and everyone needs to own their evals

Abhijay Rana@abhijaymrana

Seems like Anthropic is in-housing its RL env efforts. It was only a matter of time anyways, considering what’s been happening in the data space recently.

Xiaoyi Zhang @xiaoyiz_uw

Join Anthropic and help Claude learn real-world knowledge work! You'll own the strategy for domains (e.g., finance, healthcare, legal): source high-value tasks, design RL env / reward signal, and evaluate the capability improvement. Look for Staff+ research engineers who love

4:54 AM · Jun 21, 2026 · 155K Views

7 Replies · 8 Reposts · 558 Likes

#one way to solve the energy and compute problem

Bull Theory@BullTheoryio

TESLA QUIETLY REVEALED A MASSIVE AI INFRASTRUCTURE PLAY. Tesla filed a trademark application for "MEGAPOD", signaling plans to turn its Supercharger network into a massive distributed AI computing platform. The USPTO filing describes MEGAPOD as - "Modular data center hardware

5:39 AM · Jun 21, 2026 · 281K Views

99 Replies · 354 Reposts · 2.18K Likes

#robots 📈

a16z@a16z

VC interest in robotics is surging Charts of the Week: a16z.news/p/charts-of-th…

10:37 PM · Jun 26, 2026 · 91.7K Views

31 Replies · 81 Reposts · 889 Likes

#speaking of, here’s why we’re so excited about Generalist AI, a boldstart port co - this demo was not possible a few years ago - variables changing and the arm adapting

Lukas Ziegler@lukas_m_ziegler

That's one of the coolest robotics demos! 😮‍💨 @GeneralistAI showed GEN-1 handling box folding and screw packing during @AutomateShow. The boxes are cardboard with real variability: creasing, deformation, different configurations. GEN-1 retries when things go wrong. It adapts

1:44 AM · Jun 24, 2026 · 26.1K Views

18 Replies · 29 Reposts · 179 Likes

#speaking of other tokens, the tokenization of real-world assets is happening

The tokenized real-world asset market has grown to nearly $32 billion, highlighting increasing institutional adoption of blockchain-based asset infrastructure.US Treasuries (47%) and private credit (19%) account for approximately 66% of the market, emerging as the dominant use cases for asset tokenization. (Apollo)