Saturday, September 9, 2023

Show HN: New AI Dataset Based on LibGen and Sci-Hub https://ift.tt/Mci1tnd

Show HN: New AI Dataset Based on LibGen and Sci-Hub We recently began extracting the text layers of scholarly publications and books to include in our database. This encompasses sources such as scimag, libgen, and the latest zlib leaks. Our project, named the Standard Template Construct, also features a distributed search engine and incorporates various AI routines to handle the text corpus. Today we have releases our first dataset, STC230908. This dataset contains approximately 75,000 book texts, 1.3 million scholarly paper texts, and 24 million abstracts, including the years from 2021 to 2023. We're currently in the process of preparing the next version of the dataset, which will include an additional 300,000 books. How to Access Short Instructions: Install IPFS and launch it. pip3 install stc-geck && geck - documents More details: the dataset is released in IPFS and replicated to multiple nodes. It is in format of database for the search engine that we use in STC. GECK is the library that embeds this search engine and allows to stream all contained data in easy way. Even more detailed Instructions: https://ift.tt/tqxnvlc https://ift.tt/DSIv4Ek September 9, 2023 at 02:11AM

Show HN: Find jobs at top AI startups https://ift.tt/j2DPkNx

Show HN: Find jobs at top AI startups Hello HN, I am one of the creators of WorkInAI, and I'm excited to share our project with the community and gather valuable feedback. WorkInAI is a job aggregation platform for positions at leading AI startups. We have compiled over 350 job listings from more than 20 top AI startups, including companies like OpenAI, Anthropic, Cohere, and more. We created this platform in response to a friend's frustration with trying to find suitable AI startup roles in London. He used to check various company career pages frequently to see if any new opportunities had arisen -- so we built this to aggregate jobs in a single place. We're launching this MVP early to gather feedback, whether it's feature requests or suggestions for adding new startups to our list. We value your thoughts and input on our product and idea. Thanks! https://workinai.xyz/ September 8, 2023 at 09:51PM

Friday, September 8, 2023

Show HN: Rivet – open-source AI Agent dev env with real-world applications https://ift.tt/n5BNrCc

Show HN: Rivet – open-source AI Agent dev env with real-world applications We just launched Rivet, the open-source visual AI programming environment! We built Rivet, because we were building complex AI Agent applications at Ironclad. It unlocked our abilities here, and we're excited to make available to the entire community. Backstory: A few months ago, inspired by things like LangChain and LlamaIndex, we started building an AI agent that could work with legal contracts. Unfortunately, we couldn't just use retrieval augmented generation (RAG), because a lot of contracts are basically identical (many chunks with near-identical embeddings), except for a few key details. So, we turned to things like ReAct and AutoGPT for inspiration. At first, things went great. We were adding agent capabilities, doing chain-of-thought prompting. But then we hit a wall. The agent became too complex. We had debugger breakpoints on almost every line of code, but we still had no idea where the agent was breaking. Every change we made destabilized something else. After two weeks of fumbling, I decided to end the project. But one of my teammates, Andy, didn't give up. The following week, he showed me v0 of Rivet. He'd used it to refactor and improve our existing agent. I was skeptical... it just seemed like a visual programming environment, and I was not a fan. But I gave it a shot, and suddenly found myself able to add new skills to the agent, debug brittle areas with ease, and update prompts with confidence. Rivet is a game-changer. And more than that, it makes building with LLMs super fun. What exactly makes it different? First, the debugger is incredible. You have to experience it to believe it. You can update a graph, and then immediately run it, and see where it succeeded or failed. Even better: you can attach Rivet as a remote debugger, and watch your agent graphs execute in your app. Second, visual programming is actually a game-changer for prompting LLMs. I don't know why exactly, but it's way easier to understand and organize your work when you have an extra dimension to work with. Finally, Rivet is built to be embedded into a larger application (TypeScript for now, but we've also found a way to run it in Python). Beyond importing Rivet as a dependency, you can also define "external functions" dynamically at run-time. It feels pretty sketchy to give a LLM a key and unfettered access to an API. With Rivet, you can give it access to a specific set of defined functions, potentially pre-scoped to the access level you want. ...Sorry that was long. If you read this whole thing, thank you! We're really excited to hear what you think! We just launched our first Rivet-based application at Ironclad, and we've been working with companies like Sourcegraph, Attentive, AssemblyAI, Bento, and Willow to make Rivet useful for others. https://ift.tt/ExMoCTb September 8, 2023 at 06:59PM

Show HN: Rocketify – your one-stop shop for growth hacking tools https://ift.tt/zt8fCAa

Show HN: Rocketify – your one-stop shop for growth hacking tools https://ift.tt/zdnplyT September 8, 2023 at 12:13PM

Show HN: Yggdrasil 1.0 – Native executables for Shen https://ift.tt/TN7lC3s

Show HN: Yggdrasil 1.0 – Native executables for Shen Yggdrasil allows for the generation of type secure stand-alone native language programs from Shen source programs. We are pleased to announce the release of Yggdrasil 1.0 which has a plug-in configured for Common Lisp. Yggdrasil requires Shen 34.6 which is now available from https://ift.tt/DQ2LYbv . Questions to https://ift.tt/2RTEjl6 . More details on Yggdrasil can be found at https://ift.tt/0mdi4L3 . September 7, 2023 at 11:59AM

Show HN: Study hack using AI powered article-to-podcast tool https://ift.tt/guvZsQE

Show HN: Study hack using AI powered article-to-podcast tool https://podstash.ai September 8, 2023 at 08:40AM

Show HN: Nero Burning ROM X Muse – The Burning Art of CD https://ift.tt/PpaJOTy

Show HN: Nero Burning ROM X Muse – The Burning Art of CD Hi HN, Have you heard of Nero Burning ROM? It was named after 'the Great Fire of Rome' as a pun. We are thrilled to announce that we're back with an AI-powered new product called 'Nero Muse' – a CD/Album cover generator. It's the great fire of virtuality now. https://muse.nero.com/ September 8, 2023 at 07:26AM

Show HN: Do You Know RGB? https://ift.tt/t8kUpbO

Show HN: Do You Know RGB? https://ift.tt/OWhvmMT June 24, 2025 at 01:49PM