The Board Room
Nvidia just paid $20B to license Groq's inference-specialized LPU and ship dedicated
AWS simultaneously partnered with Cerebras on cloud inference. The AI compute market is bifurcating into training and inference economies with different architectures, different silicon, and different winners.
Nvidia-Groq $20B Deal Splits AI Compute Into Two Eras
Nvidia licensing Groq's LPU for $20B, shipping 256-chip inference racks, and manufacturing at Samsung (first non-TSMC server chip) confirms inference needs purpose-built silicon. AWS + Cerebras and OpenAI as launch buyer validate industry-wide shift. GPU-only infrastructure contracts are now demonstrably suboptimal for inference workloads.
AI Agents Cross Autonomous Worker Threshold — China Leads 2:1
Three independent vectors crossed the same line simultaneously: China's OpenClaw deployed 200K+ OS-controlling agents (40% of visible global total), Karpathy's autoresearch ran 700 experiments in 48 hours, and multi-agent orchestration went production-grade. Meta acquired Moltbook (agent social network), signaling agent-to-agent infra as the next platform war.
Amazon AI-Code Outages Make Governance a P0
Amazon suffered a 6-hour retail outage and 13-hour AWS disruption from AI-generated code, then mandated senior sign-off on all AI-assisted changes — the first Big Tech governance pullback. NYT's guardrailed approach (28% → 83% test coverage, 70% less effort) proves safe adoption is possible. McKinsey's AI platform fell to a basic SQL injection, exposing the full stack.
Anthropic's PE Joint Venture Redefines Enterprise AI Distribution
Anthropic (now $19B annualized revenue) is forming a Palantir-style JV with Blackstone and Hellman & Friedman to deploy AI across 250+ portfolio companies. OpenAI evaluated the same deal but chose internal services build instead. These are two diverging, potentially irreversible bets on enterprise AI go-to-market — and PE mandated adoption velocity beats any sales team.
Model Layer Commoditizes — Value Migrates to Orchestration and Context
Nvidia open-sourced Nemotron 3 Super's full training methodology (not just weights) — a 120B-param model outperforming OpenAI at 2.2x speed. Alibaba's Qwen 3.5 Small claims Claude Opus 4.5-level at 0.8B params for mobile. HubSpot pivoted to 'Agentic Customer Platform' built on proprietary context. The moat is no longer the model — it's orchestration, data, and domain integration.
Nvidia's $20B Groq Deal Splits AI Compute — and Your Infrastructure Strategy — in Two
AI Agents Hit the Autonomous Worker Line — China Has a 2:1 Head Start
Amazon's AI-Code Outages Just Made Governance Your P0 — Here's the Playbook
Anthropic's PE Joint Venture Creates a New Enterprise AI Distribution Architecture
- Kubernetes AI Gateway Working Group launched — token-based rate limiting and AI-specific routing standards being set now that will persist 5-10 years. Assign a senior architect to participate before your AI workload patterns are standardized without you.
- Short-sellers are now specifically citing agentic AI as a thesis — NINGI Research argues Kinnevik's software/travel/payments holdings are 'essentially intermediaries' displaceable by agents; Climb Global Solutions flagged because software can be acquired via Claude's marketplace.
- HubSpot CPTO's automation framework — 'judgment required × cost of getting it wrong' — is the most practical AI prioritization filter surfaced this quarter. Worth adopting as a standard intake gate for all AI feature proposals.
- Hardware cost inflation cascading beyond data centers: 40% notebook price increases projected for 2026 as AI demand breaks the memory boom-bust cycle. Stress-test IT budgets against 30-40% hardware inflation scenarios.
- Shift4 Payments ($3.52B market cap) in full leadership vacuum — founder gone to NASA, CFO retired, CAO departed — with prior Blue Orca Capital allegations of 'accounting games.' Talent and customer acquisition window for payments competitors.
- GitHub Actions supply chain vulnerability affecting 48 repos including security tools like Trivy — audit all workflows for pull_request_target misconfigurations and third-party action dependencies immediately.
- AI deepfakes now commercially exploiting creator economy at scale — fabricated influencer personas with false credentials driving real product sales, YouTube building detection tools. Authentication infrastructure for the $32B creator economy is a greenfield opportunity.
- Update: SUSE's potential $6B sale represents continued PE churn in enterprise Linux — assess dependency exposure and develop contingency plans for support quality or licensing term changes post-acquisition.
Nvidia paying $20B for Groq's inference chip, Amazon pulling emergency governance on AI-generated code after dual production outages, Anthropic forming a PE joint venture to push AI into 250+ companies by mandate, and China deploying 200K+ autonomous OS-controlling agents while the US debates adoption at 39% favorability — this is the week the AI industry split into before and after. The training era rewarded whoever had the most GPUs. The inference era rewards whoever can deploy AI agents cheapest, safest, and fastest. Your infrastructure needs separate training and inference tracks, your engineering org needs tiered AI code governance before your own Amazon moment, and your competitive planning needs to account for Chinese competitors running agent-augmented workforces at 2x the adoption rate.