Wednesday, April 29, 2026

Show HN: Drive any macOS app in the background without stealing the cursor https://ift.tt/Yp1Iwyh

Show HN: Drive any macOS app in the background without stealing the cursor Hi HN, Francesco from Cua here. I hacked this project together last weekend, inspired by the Codex Computer-Use release and lessons learned from deploying GUI-operating agents for our customers. The main problem: when a UI automation process controls a desktop app today, it usually takes over the human’s session. Your cursor moves, keyboard focus gets stolen, windows jump to the front, and you have to stop working until the agent is done. That is why we have historically avoided encouraging users to run these processes directly on their host machine, instead relying on VMs or GUI containers for concurrency and background execution. But computer-use - the tools we give agents to operate computers like humans - does not scale cleanly that way. As models get smarter, agents need to share hosts safely, run in the background, and avoid collisions with the human or other agents using the same machine. We realized macOS has no first-class API for "drive this app without touching the cursor". CGEventPost routes through the hardware input stream, so it moves your cursor. CGEvent.postToPid avoids the cursor warp, but Chromium treats those events as untrusted and silently drops clicks at the renderer boundary. Activating the target app first raises the window and pulls focus, defeating the point of background execution. Cua Driver is our attempt at a real fix: a background computer-use driver for macOS that lets an agent click, type, scroll, and read native apps while your cursor, frontmost app, and Space stay where they are. The default interface is a CLI, so it is easy to script or call from any coding agent shell. Try it on macOS 14+: /bin/bash -c "$(curl -fsSL https://ift.tt/ThAeNzy... )" The first internal use case was delegated demo recording. We ask Claude Code to drive an app while 'cua-driver recording start' captures the trajectory, screenshots, actions, and click markers. The result is an agent-generated product demo, Screen Studio inspired. Other things we have used it for: - Replacing Vercel’s agent-browser and other browser-use CLIs. With Claude Code and Cua Driver, you do not need Chrome DevTools Protocol at all. - A dev-loop QA agent that reproduces a visual bug, edits code, rebuilds, and verifies the UI while my editor stays frontmost. - Personal-assistant flows that use iMessage from Claude Code, Hermes, or other general-purpose agent CLIs. - Pulling visual context from Chrome, Figma, Preview, or YouTube windows I am not looking at, without relying on their APIs. What made this harder than expected: - CGEventPost warps the cursor because it goes through the HID stream. - CGEvent.postToPid does not warp the cursor, but Chromium drops it at the renderer IPC boundary. - Activating the target first raises the window and can drag you across Spaces. - Electron apps stop keeping useful AX trees alive when windows are occluded without a private remote-aware SPI. The unlock was SkyLight. SLEventPostToPid is a sibling of the public per-PID call, but it travels through a WindowServer channel Chromium accepts as trusted. Pair it with yabai’s focus-without-raise pattern, plus an off-screen primer click at (-1, -1), and the click lands without the window ever raising. One thing we learned: the right addressing mode depends on the app. Native macOS apps usually have rich AX trees, Chromium-family apps often need a hybrid of AX and screenshots, and apps like Blender or CAD tools may expose almost no useful AX surface. The mistake is defaulting to pixels everywhere - or defaulting to AX everywhere. Long technical writeup: https://ift.tt/SPXnfhr... I would like feedback from people building Mac automation, agent harnesses, or accessibility tooling. If it breaks on an macOS app you care about, that is useful data for us. https://ift.tt/uXf2Fsz April 28, 2026 at 09:33PM

Show HN: I mapped the latest UK fuel prices by county https://ift.tt/T7pLjPH

Show HN: I mapped the latest UK fuel prices by county I built this using the official UK government forecourt fuel price feed. The map aggregates the latest petrol and diesel prices by county, with filters for fuel type and metric. Clicking a county shows the cheapest forecourt, average price, spread, and station count. The feed covers roughly 8,000 UK forecourts and refreshes every 30 minutes. Retailers publish the prices, so there can still be gaps in the data/stations but it's getting better over time. https://ift.tt/VrZzi1y April 29, 2026 at 12:12AM

Join Us May 3: Muni Appreciation Day Kicks Off SF City Football Club's New Season

Join Us May 3: Muni Appreciation Day Kicks Off SF City Football Club's New Season
By Danbee Song

When you head out for a match, you'll spot the iconic Muni ”worm” logo on SFCFC jerseys.   The 2026 San Francisco City Football Club (SFCFC) season is almost here — and there’s more to celebrate than just soccer. SFCFC is the country’s first — and San Francisco’s only — supporter-owned soccer club. This weekend, it returns with fresh energy, a full home schedule and a continued partnership with Muni. The season kicks off Sunday, May 3 with Muni Appreciation Day. This special match celebrates the riders and employees who help keep San Francisco moving. Whether you're a longtime supporter or a...



Published 2026-04-28T00:00:00Z
https://ift.tt/y1eXwQ4

Show HN: Open Bias – proxy that enforces agent behavior at runtime https://ift.tt/StXUCs4

Show HN: Open Bias – proxy that enforces agent behavior at runtime https://ift.tt/I7an4Qd April 29, 2026 at 12:02AM

Tuesday, April 28, 2026

Show HN: Waiting for LLMs Suck – Give your user a game https://ift.tt/Zkr2RWn

Show HN: Waiting for LLMs Suck – Give your user a game Give your user a game while they wait for the LLM to return a result. https://ift.tt/z7D6qS5 April 28, 2026 at 08:15AM

Show HN: AgentSwift – open-source iOS builder agent https://ift.tt/V5E9Aa2

Show HN: AgentSwift – open-source iOS builder agent I'm working on a coding agent for building iOS apps. It's built on openspec and xcodebuildmcp. It's free and open source. https://ift.tt/03WPcNl April 28, 2026 at 06:44AM

Show HN: 49Agents – Infinite canvas IDE for AI agents https://ift.tt/8WObv2Y

Show HN: 49Agents – Infinite canvas IDE for AI agents https://ift.tt/u5KgYXz April 28, 2026 at 06:06AM

Show HN: Drive any macOS app in the background without stealing the cursor https://ift.tt/Yp1Iwyh

Show HN: Drive any macOS app in the background without stealing the cursor Hi HN, Francesco from Cua here. I hacked this project together la...