AI Is Becoming Infrastructure And Everyone Is Scrambling to Own It
Anthropic filed for IPO and called for an AI pause all in the same week.
The Bottom Line (No Jargon Edition)
Microsoft spent this week at its Build conference launching seven in-house AI models, including a reasoning model called MAI-Thinking-1. The headline from their AI chief: the company was "set free" from OpenAI to go build superintelligence on its own. For anyone running enterprise software, this is a big deal. The model you're routing traffic through today is probably not the one Microsoft wants you using in 18 months.
Anthropic filed confidentially for an IPO on June 1 at a valuation of $965 billion. OpenAI is expected to follow. Two companies that didn't exist five years ago are now each approaching a $1 trillion price tag on the public markets. The capital race is no longer about software. It's about who controls the physical infrastructure underneath it.
Anthropic also published a detailed post arguing that global AI development should slow down or pause. Their reason: Claude is getting close to being able to improve itself without human help. When the company building one of the most capable models in the world says "we should slow down," that's worth paying attention to.
Claude Code has a security problem involving its MCP (Model Context Protocol) configuration. Multiple researchers, including Check Point Research and Mitiga, have documented ways attackers can use it to steal API keys or run code remotely. Your developers are almost certainly already using Claude Code. This needs a conversation with your security team this week.
Chinese AI labs are dominating video AI. ByteDance, Alibaba, Kuaishou, MiniMax, and Tencent have shipped video generation tools that are more capable and cheaper than anything OpenAI has released. They're powering 470 AI-made micro-dramas per day in China. OpenAI ceded this category without most people noticing.
The big infrastructure pattern this week: Microsoft, Nvidia, and firms like Apollo and Brookfield are spending hundreds of billions to buy land, build datacenters, and lock in compute capacity. This is no longer a software business. AI infrastructure is starting to look like power and cooling. a fixed physical constraint, not a flexible digital service.
Model routing strategies are becoming real. When the hyperscalers are building their own models and locking you into their infrastructure, picking one vendor and committing fully is increasingly a risky bet. The teams thinking about multi-model routing now are six months ahead of the ones who aren't.
The Take That Started the Week
Microsoft's Mustafa Suleiman stood on stage at Build 2026 in San Francisco and said something that would have sounded strange two years ago: Microsoft was "set free" from OpenAI. The renegotiated partnership now gives Microsoft the right to build toward superintelligence independently. They shipped seven in-house models at Build, including MAI-Thinking-1, a 35-billion-parameter reasoning model that Suleiman claims users prefer over Claude in side-by-side testing. The company was careful to note it wasn't trained through distillation from any other model. They want full credit for this one.
That independence framing matters for a reason that goes beyond the PR story. Microsoft is not just building models to save on OpenAI licensing costs. They're building models so that every layer of their stack. Azure compute, GitHub Copilot, Copilot in M365, the foundational reasoning. runs on infrastructure they own. The more you rely on Microsoft products, the more Microsoft controls your AI stack. That's the actual business logic here.
At the same time, Anthropic did two things in the same week that seem contradictory on the surface. First, they filed confidentially for a $965 billion IPO. a move designed to raise enormous capital to compete. Then, they published a lengthy post calling for the world's AI labs to consider a temporary pause, citing evidence that Claude is approaching recursive self-improvement: the ability to make itself smarter without human intervention. If that framing is accurate, we're not in the fine-tuning era anymore. We're in something different.
The IPO filings from both Anthropic and OpenAI will be among the most consequential public market events in years. Anthropic went first on June 1. OpenAI is expected close behind. The first mover sets the valuation template. Capital markets analysts are already warning that the combined demand from SpaceX (targeting a $1.75T valuation), Anthropic, and OpenAI could create real disruption in capital markets. The money required to build what these companies are building doesn't come from software margins. It comes from public markets and debt. That's a different game.
Cloud Roundup
AWS
AWS launched a new generation of OpenSearch Serverless this week, rebuilt from the ground up for agentic workloads. The system scales instantly when agents spin up tasks and drops to zero when idle. If you're building multi-agent pipelines on AWS, this is the infrastructure bet they're making: search and vector retrieval baked into the agentic execution layer, not bolted on after the fact. Snowflake also announced a $6 billion multi-year collaboration agreement with AWS, focused on enterprise agentic AI adoption. That's not a partnership announcement. That's a capital commitment. AWS has locked in a major data platform partner at a scale that makes competing on price extremely difficult.
Azure
Build 2026 was Microsoft's biggest developer conference in years. Beyond the MAI model family, Microsoft shipped MAI-Code-1-Flash (their first coding model, aimed at GitHub Copilot), along with models covering image generation, transcription, and voice. The strategy is clear: every API call you make today to an external model provider is a revenue line Microsoft wants to own internally. Suleiman's comment that there are "three labs that matter". and Microsoft wants to be the fourth. was the most honest thing said at any tech conference this year. Meanwhile, Microsoft's $25 billion investment to expand Azure capacity in Australia by more than 140% by 2029 signals that this is a global infrastructure buildout, not a US-centric one.
GCP
Google stayed quieter this week relative to Microsoft's Build noise, but the underlying pattern is the same. Combined AI infrastructure spending by Google, Microsoft, Amazon, and Meta is projected to reach nearly $600 billion in 2026 according to PitchBook. The EU is moving in a different direction. a draft proposal surfaced this week that would establish strict criteria for cloud services used in critical government tenders, potentially excluding Amazon, Microsoft, and Google from certain European public sector contracts. If that passes, it creates a material opening for European-based cloud providers and a sovereign cloud conversation that will reach enterprise buyers well before 2028.
AI Model Roundup
OpenAI
OpenAI's biggest news this week wasn't a model. it was the IPO framing. Anthropic filing first on June 1 puts pressure on OpenAI's timing. The first company to price sets the comp. OpenAI was last valued at $852 billion in March, trailing Anthropic's current $965 billion. The race to the public markets is now a competitive event, not just a capital-raising exercise. Whoever prices first owns the narrative for how AI companies are valued at scale.
Anthropic
Two major stories converged here. The IPO filing is the financial headline. The pause call is the strategic one. In a detailed post published June 4-5, Anthropic argued that Claude is approaching recursive self-improvement. the capability where a model can meaningfully contribute to making better versions of itself. Their position: global AI labs should slow down long enough for safety research and societal infrastructure to catch up. Whether or not you take the safety framing seriously, the fact that the company building Claude is saying this publicly changes the political conversation around AI regulation in ways that will show up in enterprise procurement decisions within 12 months.
The security story is the operational one. Check Point Research found two CVEs in Claude Code earlier this year (CVE-2025-59536 allowed remote code execution; CVE-2026-21852 enabled API key exfiltration). Mitiga added a newer disclosure this week. MCP's configuration model is the attack surface. Developers are using Claude Code in their local environments, often with broader permissions than they realize. If your security team doesn't have a policy for AI coding tools and their MCP configurations, that gap is now documented by multiple researchers.
Google AI / Other
Chinese video AI dominated the non-Anthropic AI model coverage this week. Five integrated video AI stacks. ByteDance's Seedance, Alibaba's Wan and Happy Horse, Kuaishou's Kling, MiniMax's Hailuo AI, and Tencent's Hunyuan. are producing commercial-grade video at a scale and price point that US labs haven't matched. Forbes reported this week that these labs are powering 470 AI-made micro-dramas daily in China, with production costs and timelines that make traditional studio workflows look like a legacy process. OpenAI ceded the video category by not shipping fast enough. Chinese labs filled the gap. Nvidia's Nemotron 3 Ultra also dropped this week. their best open model yet. though analysts noted it still trails Chinese models on several benchmarks. Nvidia's Nemotron Coalition, formed in March with eight AI labs including Mistral and Perplexity, is the open-source counterweight to the closed model race.
The Pattern I'm Watching
I've watched three infrastructure transitions from close range over 30 years: the move from mainframes to client-server, the move from on-prem to cloud, and the containerization wave that followed. Each one looked like a software story in the early innings and turned into a real estate and capital story by the middle innings. AI is following the same arc, but faster and at a larger capital scale than anything I've seen.
The tell is always the same. When hyperscalers start buying land instead of just leasing compute, the infrastructure layer is hardening. When private equity firms like Apollo and Brookfield launch $50 billion AI infrastructure funds, the asset class is real and the lock-in is coming. Microsoft building its own model family isn't about saving money on OpenAI tokens. It's about owning the vertical stack from datacenter to application. the same play IBM ran in the mainframe era, the same play Oracle ran with databases, the same play AWS ran with cloud. The company that controls the infrastructure layer sets the terms for everyone building on top of it.
Model routing and multi-vendor AI strategy are the enterprise response to this. When every hyperscaler is building their own model and the physical infrastructure underneath is geographically locked and capital-intensive, choosing one vendor and going deep is a strategic constraint you probably can't undo in two years. I've seen teams make that mistake with cloud regions, with single-vendor database commitments, and with proprietary container platforms. The teams that built abstraction layers early. even imperfect ones. had options when the market shifted.
The question worth sitting with this week: if your current AI infrastructure choices were locked in place for five years, would you be comfortable with that? If the answer is no, what's the first abstraction layer worth building?
Weekly AI and cloud breakdowns from someone who's been in the game since the early days of the internet. No ads. No filler. The signal.

