Thursday, March 16, 2023

Show HN: Schematic – A simple database management UI for Spring Boot https://ift.tt/x9wUs2C

Show HN: Schematic – A simple database management UI for Spring Boot https://ift.tt/UbfFaGQ March 16, 2023 at 05:41PM

Show HN: Learn ML and AI infrastructure in the browser https://ift.tt/R6L5B1f

Show HN: Learn ML and AI infrastructure in the browser We are the developers of an open-source package Metaflow that we started at Netflix. Metaflow provides a human-friendly interface to the full stack of ML infrastructure, including data access, compute, workflow orchestration, and versioning. It is used by hundreds of companies across industries. Over the past years, we have seen that there are two major stumbling blocks for folks who want to learn to build real-world ML applications: 1) Setting up the full infrastructure stack in the cloud costs time and money. The investment is worth it once you know what you want to do, but that's not always the case in the beginning when you just want to explore. 2) While many excellent ML/AI tutorials exist publicly, there aren't that many hands-on, interactive environments showing how to map models to real-world workflows, running on real infrastructure. This Show HN release is about our attempt to address (1) and (2): We host free, fully-featured sandbox environments that include a private Kubernetes cluster, metadata service and UI, and a workflow orchestrator backed by Argo Workflows (1). If you like what you see, you can set up a similar environment in your cloud account using our open-source Terraform templates. In addition, leaning on industry experts, we have created a number of tutorials covering NLP, computer vision, OpenAI Whisper, and recommendation systems (2). Hopefully you'll find these resources useful. Let us know what you think! https://ift.tt/tsaNEko March 16, 2023 at 12:10AM

Show HN: Ingest data from your customers (Prequel YC W21) https://ift.tt/xX3A2cO

Show HN: Ingest data from your customers (Prequel YC W21) Hey HN! Charles here from Prequel (https://prequel.co). We just launched the ability for companies to import data from their customer’s data warehouse or database, and we wanted to share a little bit more about it with the community. If you just want to see how it works, here’s a demo of the product that Conor recorded: https://ift.tt/buIwdkW. Quick background on us: we help companies integrate with their customer’s data warehouse or database. We’ve been busy helping companies export data to their customers – we’re currently syncing over 40bn rows per month on behalf of companies. But folks kept on asking us if we could help them import data from their customers too. They wanted the ability to offer a 1st-party reverse ETL to their customers, similar to the 1st-party ETL capability we already helped them offer. So we built that product, and here we are. Why would people want to import data? There are actually plenty of use-cases here. Imagine a usage-based billing company that needs to get a daily pull from its customers of all the billing events that happened, so that they can generate relevant invoices. Or a fraud detection company who needs to get the latest transaction data from its customers so it can appropriately mark fraudulent ones. There’s no great way to import customer data currently. Typically, people solve this one of two ways today. One is they import data via CSV. This works well enough, but it requires ongoing work on the part of the customer: they need to put a CSV together, and upload it to the right place on a daily/weekly/monthly basis. This is painful and time-consuming, especially for data that needs to be continuously imported. Another one is companies make the customer write custom code to feed data to their API. This requires the customer to do a bunch of solutions engineering work just to get started using the product – which is a suboptimal onboarding experience. So instead, we let the customer connect their database or data warehouse and we pull data directly from there, on an ongoing basis. They select which tables to import (and potentially map some columns to required fields), and that’s it. The setup only takes 5 minutes, and requires no ongoing work. We feel like that’s the kind of experience every company should provide when onboarding a new customer. Importing all this data continuously is non-trivial, but thankfully we can actually reuse 95% of the infrastructure we built for data exports. It turns out our core transfer logic remains pretty much exactly the same, and all we had to do was ship new CRUD endpoints in our API layer to let users configure their source/destination. As a brief reminder about our stack, we run a GoLang backend and Typescript/React frontend on k8s. In terms of technical design, the most challenging decisions we have to make are around making database’s type-systems play nicely with each other (kind of an evergreen problem really). For imports, we allow the data recipient to specify whether they want to receive this data as JSON blob, or as a nicely typed table. If they choose the latter, they specify exactly which columns they’re expecting, as well as what type guarantees those should uphold. We’re also working on the ability to feed that data directly into an API endpoint, and adding post-ingestion validation logic. We’ve mentioned this before but it bears worth repeating. We know that security and privacy are paramount here. We're SOC 2 Type II certified, and we go through annual white-box pentests to make sure that all our code is up to snuff. We never store any of the data anywhere on our servers. Finally, we offer on-prem deployments, so data never even has to touch our servers if our customers don't want it to. We’re really stoked to be sharing this with the community. We’ll be hanging out here for most of the day, but you can also reach us at hn (at) prequel.co if you have any questions! March 15, 2023 at 09:34PM

Wednesday, March 15, 2023

Show HN: Quality News – Towards a fairer ranking algorithm for Hacker News https://ift.tt/9OjvVkD

Show HN: Quality News – Towards a fairer ranking algorithm for Hacker News Hello HN! TLDR; - Quality News is a Hacker News client that provides additional data and insights on submissions, notably, the upvoteRate metric. - We propose that this metric could be used to improve the Hacker News ranking score. - In-depth explanation: https://ift.tt/JZrsxzL The Hacker News ranking score is directly proportional to upvotes, which is a problem because it creates a feedback loop: higher rank leads to more upvotes leads to higher rank, and so on... → ↗ ↘ Higher Rank More Upvotes ↖ ↙ ← As a consequence, success on HN depends almost entirely on getting enough upvotes in the first hour or so to make the front page and get caught in this feedback loop. And getting these early upvotes is largely a matter of timing, luck, and moderator decisions. And so the best stories don't always make the front page, and the stories on the front page are not always the best. Our proposed solution is to use upvoteRate instead of upvotes in the ranking formula. upvoteRate is an estimate of how much more or less likely users are to upvote a story compared to the average story, taking account how much attention the story as received, based on a history of the ranks and times at which it has been shown. You can read about how we calculate this metric in more detail here: https://ift.tt/JZrsxzL About 1.5 years ago, we published an article with this basic idea of counteracting the rank-upvotes feedback loop by using attention as negative feedback. We received very valuable input from the HN community ( https://ift.tt/KsCNr9v ). Quality News has been created based largely on this feedback. Currently, Quality News shows the upvoteRate metric for live Hacker News data, as well as charts of the rank and upvote history of each story. We have not yet implemented an alternative ranking algorithm, because we don't have access to data on flags and moderator actions, which are a major component of the HN ranking score. We'd love to see the Hacker News team experiment with the new formula, perhaps on an alternative front page. This will allow the community to evaluate whether the new ranking formula is an improvement over the current one. We look forward discussing our approach with you! Links: Site: https://ift.tt/cgd5kbS Readme: https://ift.tt/JZrsxzL Previous Blog Post: https://ift.tt/RSLfDkv... Previous Discussion: https://ift.tt/KsCNr9v https://ift.tt/xEDwfae March 15, 2023 at 08:36PM

Show HN: Mr. Graph. A graph deifnition and execution library for Python https://ift.tt/39BEvpG

Show HN: Mr. Graph. A graph deifnition and execution library for Python What: Mr. Graph is a python library designed to make composing graphs of sync and async functions easy! Use google style docstrings to automagically create dataclasses and chain together function calls into graphs. Why: I like the design of Dagster, but not the latency. For apps and systems engineering, sometimes I want to compose a graph out of regular python functions. I don;t need all the heavy machinery that comes with a full workflow engine. Current features: - Use with either async or sync functions - Uses google style doc strings to name return values. - Creates dataclasses for each function's output. - Can infer pipelines from input and output signatures - All directed acyclic graph layouts supported. linear, fan-in, fan-out. Future Features: - Better examples for use with async calls (like LLMs) - Splitting dataclasses, better error handling, logging improvements. This is under active development. Any feedback, interest, or contributions are appreciated. Thanks! https://ift.tt/Bu2yJrw March 15, 2023 at 10:00PM

Show HN: Discontent – Extension to combat garbage search engine results https://ift.tt/euomATB

Show HN: Discontent – Extension to combat garbage search engine results Creator here. In a nutshell this extension is a "Like / Dislike" system but for website results. I miss the days of things like the YouTube dislike bar, where you can quickly assess if something is good or not. Currently sitting at ~90 users. To get it off the ground I've scraped the last year of HN to make a set of good links, and used a few content farm blacklists to make a set of bad links. Right now it's semi-useful and should only get better with time. There are some simple measures in place to prevent spam & abuse, but will cross that bridge if it ever comes. All feedback welcome! https://ift.tt/TQ5o1Mj March 15, 2023 at 07:01PM

Show HN: Gamaddy – Play Online Games https://ift.tt/f96QZbg

Show HN: Gamaddy – Play Online Games https://gamaddy.com/ March 15, 2023 at 01:01PM

Show HN: Do You Know RGB? https://ift.tt/t8kUpbO

Show HN: Do You Know RGB? https://ift.tt/OWhvmMT June 24, 2025 at 01:49PM