Weekly Update #30
TL;DR
Pochi can now operate inside your real Chrome browser: logged-in sessions, your cookies, your stored state. Task Memory is now inspectable mid-task so you know exactly what shapes compaction, and background jobs respect the same command policies as executeCommand, closing the last open gap in scoped tool access.
More below!
🚀 Features
-
Browser Agent - Local Chrome Support: Pochi's browser agent can now connect to and control your locally running Chrome browser. When you ask it to use local Chrome, the agent checks whether Chrome is already open, starts it with your default profile if needed, then connects via the Chrome DevTools Protocol to operate directly inside your session.
Previously, the browser agent only ran headless sessions — no cookies, no stored logins, no saved state. Accessing anything that requires a login meant the agent was effectively locked out.
Most useful when your task requires accessing GitHub, internal dashboards, or any service you're already authenticated with. Requires Chrome 144+ with remote debugging enabled at
chrome://inspect/#remote-debugging. #1530 -
View Task Memory: A Task Memory button now appears in the Token Usage popover once Pochi has extracted at least one memory entry. Clicking it opens
memory.md— the structured session notes Pochi has been accumulating throughout your task.Task Memory was introduced in v0.46, but the extraction process was invisible. You couldn't verify what had been stored or how it would shape compaction.
Now you can inspect the memory file directly at any point during a long task. #1528
-
Scoped Tool Access - startBackgroundJob Policy:
startBackgroundJobnow respects the same command-pattern policies asexecuteCommand. DeclarestartBackgroundJob(npm run dev)in your custom agent config to restrict which background commands the agent can launch.v0.46 introduced scoped tool access for
executeCommand,readFile,writeToFile,applyDiff,editNotebook, andwebFetch— butstartBackgroundJobwas left out. An agent restricted toexecuteCommand(git *)could still run arbitrary shell commands via a background job.The two tools are kept independent: an
executeCommand(git *)rule does not authorizestartBackgroundJobcalls, and vice versa. The same syntax you already use forexecuteCommandworks here. For the full policy configuration reference, check the custom agent docs. #1538
✨ Enhancements
-
Auto-Expand Task Panel Input: The task panel input now expands as you type, up to 30% of viewport height or 420px, before switching to internal scrolling.
Previously it was capped at 128px — roughly 4 visible lines. Pasting a multi-paragraph prompt or a long file path meant scrolling through a cramped box to review what you'd typed before sending.
Most noticeable when dropping in detailed instructions or long context at the start of a task. #1531
🐛 Bug Fixes
-
Thinking Summary Display: The thinking summary block now displays correctly for Anthropic models. #1526
-
Browser Recording Codec and Canvas: Browser recordings no longer fail on platforms where the hard-coded H.264 codec isn't available — the recorder now detects a supported codec at runtime. Frames are also rendered to a consistent 854×480 canvas with letterboxing, fixing distorted or broken recordings from variable source dimensions. #1535, #1537
-
Collapsed Tool Container Chevron: The chevron icon for collapsed tool containers now points left again, as intended. A previous change had accidentally flipped it to point right. #1529