Growing India News, world news, nation news, our news, people's news, grow news, entertainment, fashion, movies, tech, automobile and many more..
Thursday, July 31, 2025
Show HN: State of the Art Open-source alternative to ChatGPT Agents for browsing https://ift.tt/HgMKN5C
Show HN: State of the Art Open-source alternative to ChatGPT Agents for browsing Hey HN, We are Winston, Edward, and James, and we built Meka Agent, an open-source framework that lets vision-based LLMs execute tasks directly on a computer, just like a person would. Backstory: In the last few months, we've been building computer-use agents that have been used by various teams for QA testing, but realized that the underlying browsing frameworks aren't quite good enough yet. As such, we've been working on a browsing agent. We achieved 72.7% on WebArena compared to the previous state of the art set by OpenAI's new ChatGPT agent at 65.4%. You can read more about it here: https://ift.tt/cgDrHpW . Today, we are open sourcing Meka, our state of the art agent, to allow anyone to build their own powerful, vision-based agents from scratch. We provide the groundwork for the hard parts, so you don't have to: * True vision-based control: Meka doesn't just read HTML. It looks at the screen, identifies interactive elements, and decides where to click, type, and scroll. * Full computer access: It's not sandboxed in a browser. Meka operates with OS-level controls, allowing it to handle system dialogues, file uploads, and other interactions that browser-only automation tools can't. * Extensible by design: We've made it easy to plug in your own LLMs and computer providers. * State-of-the-art performance: 72.7% on WebArena Our goal is to enable developers to create repeatable, robust tasks on any computer just by prompting an agent, without worrying about the implementation details. We’d love to get your feedback on how this tool could fit into your automation workflows. Try it out and let us know what you think. You can find the repo on GitHub and get started quickly with our hosted platform, https://ift.tt/f69pnVH . Thanks, Winston, Edward, and James https://ift.tt/XF2dwTS July 30, 2025 at 07:41PM
Subscribe to:
Post Comments (Atom)
Show HN: Fast Elevation API with memory mapped tiles https://ift.tt/wpxiYuW
Show HN: Fast Elevation API with memory mapped tiles I recently wrote and launched a high-performance Elevation API, built from the ground u...
-
Show HN: An AI logo generator that can also generate SVG logos Hey everyone, I've spent the past 2 weeks building an AI logo generator, ...
-
Breaking #FoxNews Alert : Number of dead rises after devastating tornadoes, Kentucky governor announces — R Karthickeyan (@RKarthickeyan1)...
-
Show HN: Snap Scope – Visualize Lens Focal Length Distribution from EXIF Data https://ift.tt/yrqHZtDShow HN: Snap Scope – Visualize Lens Focal Length Distribution from EXIF Data Hey HN, I built this tool because I wanted to understand which...
No comments:
Post a Comment