AI.news
主页教程研究工具模型AI创业讨论新闻每日简报WIKI🚀 创业库★ 投稿
AI+医疗机器人教育金融能源健康娱乐思考

Agents feedback tip

Hey folks, I’m testing out something new in my building workflow…

When an agent asks for feedback it feels like the levels are

  1. type your response

  2. voice-to-text your response

  3. + images to your feedback

  4. get the agent to use the browser

But I just started screen-recording and talking then giving that file to my agent

This is me, in droid, like 30 mins ago. It pulls together a pretty great visual report you can easily review. I can navigate to other websites or apps and show what good looks like from other people, I can highlight specific points and it’ll recreate those points with GIFs.

It gives itself an ‘actions’ checklist underneath. And just feels great to have screenshot → my feedback → action for the agent.

It’s pretty great so far, and then I’ve got these html files saved in my projects to always refer back to - will be good for a build log too.

Probably not great for the token conscious out there - and thinking about it, I could probably use ffmpeg to create actual clips of the video if I wanted. Agents read frames well though so it’d be more for me if I did.

I turned it into a simple skill:

---

name: video-to-html

description: Use when the user wants you to convert their video into a structured HTML document.

---

Turn the user’s video into a structured HTML document. Transcribe the video and pull out the keyframes linked to timestamps for important information. When the user is talking about something that is not dynamic, create short GIFs from the keyframes.

Let me know any cool use-cases or remixes of this 😊

Ben’s Bites is brought to you by Hyperagent from Airtable

Hyperagent, the cloud agent system with full computing environments, is giving $10M in inference credits to help founders build and run agent-first companies. The first 500 qualifying applicants gain access to this limited founder offer. Applications close May 31st.

  • Your Claude plan is changing if you use third-party tools (like Conductor, Zed, Openclaw, T3 Code, etc.) with it.

    • Separate limit for all such usage. Provided as extra monthly credits equal to the value of your plan.

    • No subsidised tokens, credits won’t roll over and usage after you burn through these credits is billed at API rates.

    • Using Claude in Claude Code, Claude app, etc., stays the same and is separate from this.

    • Starts from June 15th, but they are increasing your weekly rate limits by 50% for the next two months.

  • Google announced some Gemini on Android updates before I/O - add features like auto-completing forms, rambling voice notes to clean text, and some app automations under the name “Gemini Intelligence”. They also announced a new class of laptops called Googlebooks, not to be confused with Google Books.

  • Notion has a developer platform now. The biggest addition is a markdown API. Also, devs can sync outside data into Notion, build tools for Notion Agents, run code on Notion’s infra, and eventually bring agents like Claude/Codex into Notion as teammates. But I think people who don’t call them developers will use this.

    • They also launched a CLI called ntn.

  • Vercel published an AI Gateway production index based on real usage across apps and agents. Anthropic leads spend (61% — due to opus), Google leads token volume (38% — due to flash), and agentic workloads are 59% of token usage. Most large teams route across many models instead of betting on one lab.

X avatar for @anvisha

Anvisha@anvisha

Launching today: make any PDF beautiful. It's 2026 - there's no excuse to have ugly resumes, invoices or client proposals. Just upload a PDF -> Get back a polished, professionally designed version in minutes. Works with docs of any complexity👇

7:07 PM · May 11, 2026 · 425K Views

116 Replies · 169 Reposts · 3.1K Likes

X avatar for @ashleevance

Ashlee Vance@ashleevance

Our exclusive interview with @Meta AI chief @alexandr_wang is up. First time he's talked about the new model, the models to come, revamping Meta's AI team, all the money, all the hires, all the beef. Here we go. The Core Memory podcast is on Apple, Spotify, YouTube and

1:34 PM · May 13, 2026 · 4.7K Views

2 Replies · 8 Reposts · 62 Likes

X avatar for @theo

Theo - t3.gg@theo

Is HTML the new Markdown? Had a lot of thoughts on Thariq's latest article so obviously I had to make it a vid

6:58 AM · May 13, 2026 · 90.3K Views

68 Replies · 19 Reposts · 550 Likes

X avatar for @mvanhorn

Matt Van Horn@mvanhorn

Introducing: @meetgranola CLI/Claude Code Skill/OpenClaw and Hermes skill from the @ppressdev printed by @damienstevens . - Cross-meeting SQLite search - MEMO pipeline runner - Attendee timelines - Stop the MCP logged-out pain Really excited about this one. I can't live

3:47 AM · May 14, 2026 · 13.1K Views

13 Replies · 2 Reposts · 116 Likes

Share Ben's Bites

* sponsors who make this newsletter possible :)
Wanna partner with us for the next quarter?
Email us at shanice@bensbites.com or k@bensbites.com