Hey folks,
Like a lot of others, I’m trying Fable. But I don’t think I’ve given it really hard tasks for me to feel the big step change people are claiming. Although as I’m working on visual, interactive essays for my reference manual, I’ve been testing one-shot generations on all sorts of topics to play with my kids; how to play the piano, how tetris works, how a volcano works. They’re loving bashing away at my keyboard to see the components come alive.
It’s less chatty than Opus. I use agents as a back-and-forth, so I need some chattiness, GPT models are very to the point, Claude models are usually more verbose. So I think Fable’s a bit of a sweet spot for me, because as of now I still prefer the ‘vibe’ of how Claude models think with me.
It’s slow. Speed is a huge thing for me because I can’t actively run many agents at once, my ADHD context switching is already bad enough. But I need the speed. So then I look to change from High or XHigh reasoning to lower levels but it feels wrong to pick less intelligence. Even though I know it’s probably wasting tokens.
Composer 2.5 Fast from Cursor is the fastest model I’ve tried and I really enjoyed using it in Pi, and it’s shattered the glass for me, GPT 5.5 has too to be honest. They’re quick so I can chew through a bunch of work quickly.
Ever a game of toss-up between price, speed and vibe.
I’m sure GPT 5.6 or 6 is just around the corner, and I imagine OpenAI are really trying to make sure the vibe is on par with Claude models if they want to keep the Codex hype going.
Ben’s Bites is brought to you by Plaid
Over half of Americans used AI to manage their finances in the past 12 months. And their expectations for financial products have never been higher. Plaid's latest report, The State of Intelligent Finance, breaks down what consumers expect from AI and what that means for your team. Get the report.
Fable 5, the new model from Claude, is the “safer” version of Mythos (Anthropic’s unreleased model which they say is a big cybersecurity risk and is only available to certain companies). Fable makes a big jump over Opus 4.8 on benchmarks (not that big vs GPT 5.5 though). Ethan Mollick and Dan Shipper both believe it unlocks new capabilities - a big part of which is its ability to work longer and spawn dozens of subagents reliably without losing context of the main task.
Fable is only available till 22nd June in the Claude subscription plans. Anthropic plans to move Fable use to paid credits after that until they have the capacity to serve it to everyone. It’s 2x more expensive than Opus (vs Mythos, which was 5x Opus)
Anthropic introduced a new policy with the rollout of Fable where it will “secretly” sabotage your work if you use it for any ML/AI-related work. This caused a huge backlash, and in turn Anthropic has partially walked back the “secretly” part of this policy.
Other examples of Fable in use - for a refactor to get clean code, for editing a video, and even making a markdown editor.
Gemini 3.5 Live Translate - new model from Google that does real-time speech-to-speech translation in 70+ languages. Already live in Gemini API and Google Translate + coming soon to Google Meet.
Implications of large-scale test-time compute - Model performance is affected by how much time, compute, or $$ budget is available for the model to complete a task, and hence companies should report these variables when reporting benchmarks.
What does AI pricing actually look like in 2026? Orb analyzed 80 AI agent companies, including leaders like GitHub Copilot, Replit, Factory, and Intercom, to identify the pricing models, packaging strategies, and monetization trends emerging across the industry. Download the report.*
Claude Code can have nested subagents now - each subagent can spawn even more subagents. Currently up to a depth of 5 layers.
The model selector in ChatGPT has been updated to a) show all GPT-5 generation models and b) thinking levels are simplified to Instant, Medium, High, Extra High and Pro - just like how it is in Codex.
Missions are now available on Factory Desktop.
Skribe - Local first markdown writing app with an AI review partner.
Little python harness to run evals on your skills - is the skill improving or hurting the model’s performance?
pr.video by Mainframe - turn any GitHub PR into a narrated video walkthrough to review changes (without needing the code diff).
New essay from Dario Amodei on policymaking to keep up with the pace of AI development. It’s a good read but has “trust me bro” vibes at many places.
Supermemory is now available locally to host yourself.
DiffusionGemma - new open-weights model from Google that uses a different architecture (diffusion instead of transformers) to gain a 3-5x speedup with roughly similar performance.
- by Keshav
I’m building a speech-to-text app that’s entirely local (kinda like Wispr flow, Monologue, Superwhisper, etc.) but no data gets sent to their servers. Part of it stemmed from wanting to play with local models, and part of it was guilt of paying for a similar tool that I don’t use much.
I’m calling the app “Option AFK”, and here’s how I built it in 3-4 days spread out over the past few weeks:
Asked Opus 4.7 to write a simple Python script to get Nvidia’s Parakeet 0.6B working on my M3 Air. I tested it in the browser, and the accuracy/speed were better than what I expected.
Used Codex’s Computer Use to do a screen-by-screen audit of the tool I was paying for and document all the features that it has with screenshots. I was able to do this on the $20 plan in a single session (hitting the 5-hour limit only once).
Then started building the MacOS app with Opus 4.8 (using the audit as a reference). While building this, Opus 4.8 selected an SDK I had no idea existed to wire up things like splitting long voice notes into chunks and speeding the model processing.
Here’s that SDK:
fluidaudio - run transcription and TTS models on MacOS locally.
I got a working version of the app yesterday with Fable 5, and I’m using it on my device already. Here’s how it looks:
It also supports uploading voice notes (even the longer ones) as a file and getting them transcribed at no additional cost.
I’m planning to release this app once I get the Apple Developer Program sign-up done. Would you want to use it?

sunil pai@threepointone
wondering why I feel exhausted. maybe: the agents do all the easy stuff, and I have to work through the leftover hard bits, which means I'm perpetually locked in. and as the models get better, "my" work just gets harder and harder, until I'm basically underqualified to do the
7:51 AM · Jun 10, 2026 · 293K Views
104 Replies · 98 Reposts · 1.58K Likes

🎭@deepfates
the future of all work is this. You must define: - a goal - the criteria that define it - the verifier that makes sure it is achieved - the sensors that inform the verifier - the actuators that affect the sensors - The envelope that contains the sensors and actuators

🎭 @deepfates
The codex "goal" feature is a really good way to spend dozens of hours optimizing some total bullshit btw. If your final criteria is it all vague it will specification game and make masturbatory "evidence" and "verifiers" and "gates" and "smoke tests". must be hell internally
5:05 AM · Jun 10, 2026 · 65.2K Views
40 Replies · 38 Reposts · 765 Likes

Dan Shipper 📧@danshipper
Getting the most out of Claude Fable 5, Anthropic’s powerful new model, you need to maximize your ambition: It’s built for full task delegation—you leave it looping for hours or overnight and come back to a finished product. If you want to get the most out of it, you need to
5:28 PM · Jun 10, 2026 · 30.8K Views
18 Replies · 23 Reposts · 427 Likes

SemiAnalysis@SemiAnalysis_
Recently, we purchased one of each Anthropic/OpenAI subscription plan and randomly ran long horizon coding tasks until we exhausted the weekly limit. It's widely believed that a $200/month plan maxes out at ~$2000/month worth of tokens (assuming API pricing). However, we found

9:00 PM · Jun 10, 2026 · 794K Views
67 Replies · 234 Reposts · 2.53K Likes
Read about me and Ben’s Bites
📷 thumbnail via @_nicolealonso
* sponsors who make this newsletter possible :)
Wanna partner with us for the next quarter?
Email us at shanice@bensbites.com or k@bensbites.com