Sunday, August 10, 2025

Show HN: Runtime – skills-based browser automation that uses fewer tokens https://ift.tt/dyIWiq6

Show HN: Runtime – skills-based browser automation that uses fewer tokens Hi HN, I’m Bayang. I’m launching Runtime — a desktop tool that automates your existing browser using small, reusable skills instead of big, fragile prompts. Links - README: https://ift.tt/RtxBga6 - Skills guide: https://ift.tt/BA2dSr9 Why did I build it? I was using browser automation for my own work, but it got slow and expensive because it pushed huge chunks of a page to the model. I also saw agent systems like browser-use that try to stream the live DOM/processed and “guess” the next click. It looked cool, but it felt heavy and flaky. I asked a few friends what they really wanted to have a browser that does some of their jobs, like repetitive tasks. All three said: “I want to teach my browser or just explain to it how to do my tasks.” Also: “Please don’t make me switch browsers—I already have my extensions, theme, and setup.” That’s where Runtime came from: keep your browser, keep control, make automation predictable Runtime takes a task in chat (I’m open to challenging the User experience of conversing with runtime), then runs a short plan made of skills. A skill is a set of functions: it has inputs and an expected output. Examples: “search a site,” “open a result,” “extract product fields,” “click a button,” “submit a form.” Because plans use skills (not whole pages), prompts stay tiny, process stays deterministic and fast. What’s different - Uses your browser (Chrome/Edge, soon Brave). No new browser to install. - Deterministic by design. Skills are explicit and typed; runs are auditable. - Low token use. We pass compact actions, not the full DOM. And most importantly, we don’t take screenshots at all. We believe screenshots are useless if we use selectors to navigate. - Human-in-the-loop. You can watch the steps and stop/retry anytime. Who it's for? People who do research/ops on the web: pull structured info, file forms, move data between tools, or run repeatable flows without writing a full RPA script or without using any API. It’s just “runtime run at runtime” Try this first (5–10 minutes) 1. Clone the repo and follow the quickstart in the README. 2. Run a sample flow: search → open → extract fields. 3. Read `SKILLS.md`, then make one tiny skill for a site you use daily. What’s not perfect yet Sites change. Skills also change, but we will post about addressing this issue. I’d love to hear where it breaks. Feedback I’m asking for - Is the skills format clear? Being declarative, does that help? - Where does the planner over-/under-specify steps? - Which sites should we ship skills for first? Happy to answer everything in the comments, and would love a teardown. Thanks! Bayang https://ift.tt/ikHPZwt August 9, 2025 at 11:15PM

Saturday, August 9, 2025

Show HN: I made a safe anonymous message app https://ift.tt/39Mx6Zu

Show HN: I made a safe anonymous message app Subrosa is an anonymous message-sharing platform where anyone can visit your unique link and write whatever’s on their mind: secret confessions, honest thoughts, or wild opinions, completely anonymously. You get to read what people say about you on your personal dashboard. What sets this apart is the AI-powered moderation that filters out hate speech, abuse, and spam before it ever reaches you, creating a safe space for honesty without toxicity. This is an alpha release with a basic UI as we focus on testing core functionality. Try it out, share your link, and experience raw, honest, and clean anonymous messaging like never before. To test the moderation you can send messages to me at https://subrosa.vercel.app/martianmanhunter Relevant links: https://subrosa.vercel.app/ : Homepage https://subrosa.vercel.app/signup https://subrosa.vercel.app/login https://subrosa.vercel.app/dashboard : Where you can see the messages you received https://subrosa.vercel.app/[username] : Your personal link that you can post on your socials etc. to attract comments. P.S. Please dont share personal or sensitive information. https://subrosa.vercel.app/ August 9, 2025 at 06:50AM

Show HN: Tiered storage and fast SQL for InfluxDB 1.x/2.x https://ift.tt/O2IU6if

Show HN: Tiered storage and fast SQL for InfluxDB 1.x/2.x If you’ve run InfluxDB at scale, you know the pain: Retention policies mean throwing away history, keeping everything means huge hardware & license costs. We built ExyData Historian to fix that. What it does? - Automatically exports old InfluxDB 1.x/2.x data to compressed Parquet in S3 or MinIO - Keep recent data hot in InfluxDB, move the rest to cheap storage - Run fast SQL on archived data via Apache Arrow + DuckDB - Query it all through one interface and / API. No hot/cold boundary for the user Why it matters - 70–80% lower storage costs - Historical queries that are as fast (or faster) than InfluxDB itself - No manual exports, no query rewrites, no downtime Who’s using it right now? InfluxDB Enterprise Customers and Huge instances of OSS, telcos and logistics companies are trying this right now. We help you to reduce your Enterprise licensing cost, cause you are going to shrink your InfluxDB cluster. You keep your existing InfluxDB running, Historian works alongside it, moving history to cheap storage while giving you more analytics power. We’d love feedback from anyone managing large InfluxDB deployments. https://ift.tt/P2cTFey August 9, 2025 at 03:48AM

Show HN: I made FiscalBud to send invoices fast and worldwide in 77 languages https://ift.tt/psUXPTm

Show HN: I made FiscalBud to send invoices fast and worldwide in 77 languages hi! i built an app that takes the pain out of invoicing so you can send them faster and worldwide without a headache. i've always found invoicing to be a waste of time, switching between templates, calculating taxes, tracking different currencies, and keeping files organized. so i made FiscalBud :) the idea from tools like stripe inspired me, but for invoices. it lets you create, customize, and send professional invoices to clients anywhere in the world in just minutes. it supports 8 currencies, 77 languages (you can choose the output data language and ui language separately), and works in 248 countries, so you can bill confidently on a global scale. it comes with smart templates, automatic tax/subtotal/total calculations, localized csv exports, and cloud storage to keep everything organized. (coming soon) you can automate recurring invoices, payment reminders, and follow-ups. it's built to be secure and privacy-focused, with encryption and compliance baked in. you can even send invoices directly via email using your own smtp settings, with automatically signed pdfs. i've got plenty of ideas for making it even better, like deeper automation and more integrations with other tools you already use (including Stripe which is on the roadmap). any feedback is much appreciated! :) https://ift.tt/vhfn0mS August 9, 2025 at 02:56AM

Show HN: Selfhostllm.org – Plan GPU capacity for self-hosting LLMs https://ift.tt/xlZ8FNL

Show HN: Selfhostllm.org – Plan GPU capacity for self-hosting LLMs A simple calculator that estimates how many concurrent requests your GPU can handle for a given LLM, with shareable results. https://ift.tt/kovfDHh August 8, 2025 at 11:19PM

Friday, August 8, 2025

Show HN: A light GPT-5 vs. Claude Code comparison https://ift.tt/uTA8xim

Show HN: A light GPT-5 vs. Claude Code comparison Hi HN! Can’t believe I’ve been here over 12 years and this is my first Show HN. I guess this is two fold, One: I’m doing another startup! Charlie is an agent for TypeScript teams focusing heavily on augmentation. :) Two: Over the last week or so we put GPT-5 (through our Charlie Agent) head-to-head with Claude Code/Opus on 10 real TypeScript issues pulled from active OSS projects. Our Results GPT-5 beat Claude Code on all 10 case-by-case comparisons. Pull requests generated by GPT-5 resolved 29% more issues than o3. PR review quality rose 5% versus o3. Head-to-head case study We measured testability, description, and overall quality across 10 head-to-head PRs. Testability measures how thoroughly a code change is exercised by meaningful, behavior-focused tests. It considers whether tests are present and aligned with the diff, whether they explore edge cases and real-world scenarios, and whether they avoid vacuous, misleading, or implementation-dependent patterns common in code generated by LLMs. Description evaluates how clearly and accurately a pull request’s title and summary convey the purpose, scope, and structure of the code change. It emphasizes technical correctness, relevance to the diff, and clarity for future readers — penalizing vague, verbose, or hallucinated explanations often produced by code-generating agents. Quality assesses the substance and craftsmanship of the code change itself — judging whether it is correct, minimal, idiomatic, and free from hallucinated constructs. It emphasizes clarity, alignment with project norms, and logical integrity, while identifying agent-specific pitfalls like over-engineering, incoherent abstractions, or invented utilities. Testability: Charlie (0.69) vs Claude (0.55) Description: Charlie (0.84) vs Claude (0.90) Overall Quality: Charlie (0.84) vs Claude (0.65) Caveats Single-shot runs; no human feedback loop. Quality score uses a secondary LLM reviewer—subjective but transparent. Def looking for feedback on more evaluations we can do, also please do nit-pick the prompts, ideas, harness design etc etc. Tell us if this bar (CI + types) is the right one, or what you’d track instead. On a personal note: I’ve spent my career working on tools to help creators create, I’m extremely passionate about enabling people to do more easily. I am still somewhat uneasy about Gen AI, however I do believe the future is bright, certainly things are going to change - I would encourage you all to stay optimistic builders. Thanks for taking a look! https://ift.tt/cNDSQ0i August 8, 2025 at 12:26AM

Show HN: My Resume Is a Gameboy https://ift.tt/Pm4Vvzy

Show HN: My Resume Is a Gameboy https://ift.tt/brk2gHe August 7, 2025 at 11:26PM

Show HN: I built a Raspberry Pi webcam to train my dog (using Claude) https://ift.tt/14pnsOW

Show HN: I built a Raspberry Pi webcam to train my dog (using Claude) Hey HN! I’m a Product Manager and made a DIY doggy cam (using Claude a...