🗓️ July 1 — Claude Fable 5 Restored + Sonnet 5 Launched
Claude Fable 5 is back. The US government lifted the June 12 export control order on June 30 after Anthropic trained an updated safety classifier that blocks the jailbreak technique in over 99% of cases. Both Fable 5 (public) and Mythos 5 (restricted partners) came back online July 1 with the improved safeguards deployed. The 19-day suspension — triggered when Amazon researchers demonstrated a method to extract cybersecurity exploit code — is over.
What changed: the new classifier specifically blocks prompts designed to extract offensive vulnerability code. Anthropic's position throughout: the same jailbreak worked on GPT-5.5 Instant, which never faced equivalent controls. The bipartisan House letter (June 18) demanding answers from the Trump administration led to the government reviewing and ultimately lifting the directive.
Pricing: $10/M input, $50/M output. Free on Pro/Max/Team/Enterprise through June 22 ended — now requires usage credits. API model string: claude-fable-5.
Claude Sonnet 5 is Anthropic's new default on Claude.ai — and the standout value launch of mid-2026. At intro pricing of $2/M input and $10/M output (through August 31), it delivers performance close to Opus 4.8 at roughly one-fifth the cost of Fable 5. This is the model to use if you want near-frontier capability without paying frontier rates.
Benchmark highlights: 63.2% SWE-bench Pro (frontier coding), 80.4% Terminal-Bench 2.1 (beats Opus 4.8's 74.6%), 84.7% BrowseComp agentic search. The most agentic Sonnet ever — built to plan, use tools (browser, terminal, code interpreter), and run autonomously on tasks that previously needed larger models. 1M token context window, 128K max output, adaptive thinking enabled by default.
Also immediately available on GitHub Copilot as of June 30. The new tokenizer produces ~30% more tokens for the same text vs Sonnet 4.6 — factor this into your cost estimates.
📱 Cursor iOS — AI Coding Agents on Your Phone
Cursor launched a native iOS app in public beta for all paid plan subscribers. You can now launch and manage always-on AI coding agents directly from your iPhone — pick any repo, start an agent, choose any frontier model (Claude Sonnet 5, GPT-5.5, Gemini 3.5 Flash), and describe your task via text or voice input. Full slash command support, live diff review, and the ability to merge PRs directly from your phone.
The use case: kick off a long-running agent task on the way to a meeting, check its progress during lunch, review and approve the diff before you sit back down. Agents run in the cloud — your phone is a control panel, not a compute node. 75% off Composer 2.5 runs in the mobile app through July 5 as a launch promo.
⚡ Gemini 3.5 Pro — 2M Context, Slipped to July
Gemini 3.5 Pro is Google's flagship model for July — confirmed for general availability this month after slipping from a June 30 target. In limited Vertex AI enterprise preview since mid-June; broad API and consumer app access coming to Google AI Pro/Ultra subscribers first.
Key specs (confirmed): 2,000,000 token context window (doubles Claude Fable 5 and all competitors), Deep Think reasoning mode (extended thinking for complex tasks), native multimodal — text, image, audio, video in one request. Replaces Gemini Ultra use cases.
Why it matters: The 2M context window is a genuine capability gap — not a benchmark number. Entire codebases, long legal documents, multi-session agent memory, hours of video — all fit in one context. If Google ships this at competitive pricing, it forces the whole market to respond.
Pricing (unconfirmed): Estimates range from $2/$12 to $15/$60 per M tokens. Consumer access via paid Gemini subscriptions (~$20/month Pro, ~$250/month Ultra). Watch this space — announcement expected any day.
🔬 GPT-5.6 Sol / Terra / Luna — Gated to 20 Orgs
OpenAI previewed the GPT-5.6 family — three models codenamed Sol, Terra, and Luna — on June 26, available exclusively through API and Codex to roughly 20 trusted partner organisations. Not on the waitlist, not publicly accessible. Capabilities: stronger coding, scientific reasoning, long-horizon planning, and agentic workflows vs GPT-5.5. A "max reasoning effort" and "ultra mode" for complex tasks.
OpenAI confirmed broader availability "soon" for ChatGPT, Codex, and the API — but gave no date. Given OpenAI's IPO timeline (September 2026 target), the pressure to show frontier progress is high. Expect a wider rollout in July or August 2026.
⚡ Grok 4.5 — Private Beta at SpaceX & Tesla
Elon Musk confirmed Grok 4.5 is running in private beta with engineering teams at SpaceX and Tesla as of June 28. Built on a fresh V9 foundation model with approximately 1.5 trillion parameters, with Cursor data added in supplemental training for coding performance. Early internal evals: performance close to or exceeding Opus 4.8 on some tasks.
No public release date. Originally targeted for late May, so already about a month behind. Musk indicated a wider rollout "within weeks" — expect some form of X Premium or Grok app access in July. Grok 5 (10T parameters, trained on Colossus 2) remains in training with no confirmed timeline.
🐉 LongCat-2.0 — 1.6T Open-Source, No NVIDIA Chips
Chinese food-delivery giant Meituan open-sourced LongCat-2.0 — a 1.6 trillion parameter Mixture-of-Experts model trained entirely on a 50,000-card cluster of Chinese-made ASICs (zero NVIDIA hardware). The geopolitical significance is large: China has demonstrated it can train a near-frontier model without any chips under US export controls.
Architecture: MoE activating 48B parameters per token (dynamically 33–56B by query complexity). Native 1M token context. Focus: agentic coding. Leads OpenRouter's coding leaderboard. Benchmarks: 59.5% SWE-bench Pro (vs GPT-5.5's 58.6%), 70.8% Terminal-Bench. Weights fully available on Hugging Face — no restrictions.
🍎 Siri AI Public Beta — iOS 27 Available Now
Apple's iOS 27 public beta dropped in July — the first chance for regular users (not just developers) to try Siri AI, the complete rebuild of Siri powered by Google's Gemini models. Features live: full on-screen awareness, cross-app actions from a single command, deep personal context (emails, messages, photos, calendar), and a standalone Siri app.
Available on iPhone 11 and up. US English only at public beta stage. Not available in EU or China. If you're on a supported device and want to try the biggest Siri upgrade in its history, the public beta is the way in — full release ships this fall with new iPhone hardware.
Privacy architecture: Simple tasks run on-device → moderate tasks on Apple Private Cloud Compute → heavy reasoning routes to Google Cloud. 1.5B daily Siri requests now flowing through Gemini's reasoning layer.
📅 July 2026 Pipeline
🎯 July Verdict — What to Actually Try
1. Claude Sonnet 5 — the value pick of mid-2026 — At $2/M input, it's the best price-to-capability ratio at near-frontier quality right now. Better than Opus 4.8 on Terminal-Bench. Genuinely strong agentic performance. If you're building on Claude and not yet on Sonnet 5, switch today. API string: claude-sonnet-5. Intro pricing ends August 31.
2. Cursor iOS — if you're a developer on a paid plan — Download it, kick off an agent on your commute, review diffs at lunch. It's not a toy — it's a genuine remote control for your cloud agents. The 75% Composer run discount is gone July 5, so if you're on the fence, today is the cheapest time to try it.
3. Gemini 3.5 Pro — wait for it, then benchmark immediately — When it drops, test the 2M context window on your hardest document analysis or codebase task. If Google prices it competitively (below $5/M input), it will be the most consequential model launch since Fable 5. Don't sleep on it.
4. Claude Fable 5 — it's back, but now costs credits — The free window for Pro/Max/Team/Enterprise users ended June 22. The model is restored and working — but now it costs usage credits. Worth using for your most complex, long-horizon tasks where Sonnet 5 doesn't cut it. Don't use it for tasks that Sonnet 5 handles equally well.
5. Siri AI public beta — try it if you're on iOS 27 — The on-screen awareness and cross-app action features are meaningfully better than the old Siri. US English only. Don't expect it to replace your AI assistant workflow — but do try the "read this email and add the flight to my calendar" demo. It actually works.