Saturday, September 9, 2023

Show HN: Like Instagram stories but for your groups https://ift.tt/ZIJ3MkB

Show HN: Like Instagram stories but for your groups https://kwakwa.com/ September 9, 2023 at 02:52PM

Show HN: Ghidra Plays Mario https://ift.tt/WOsoMBn

Show HN: Ghidra Plays Mario https://ift.tt/Dg7HFPQ September 9, 2023 at 06:12PM

Show HN: Which is faster? Puppeteer, Playwright or Selenium https://ift.tt/frxLcB2

Show HN: Which is faster? Puppeteer, Playwright or Selenium Hey Everyone, I just ran a [rather silly] race between Puppeteer (JS), Playwright (Python) and Selenium (Python) to see which one would be fastest on a simple scrape (using Google Colab so you can also run it) Far from a comprehensive benchmark, this race is 100% free from advanced configurations, multi-threading or anything complicated. It just opens Wallapop (a second hand marketplace in Spain) and times how long it takes to extract the first 2000 results of a search. If you like this simple format, have any ideas on how to improve a race like this or have a strong urge to prove Ward Cunningham wright, let me know in the comments! https://ift.tt/Fs5qY6I September 9, 2023 at 04:54PM

Show HN: Convert Youtube Video to Pdf https://ift.tt/SPFNxwo

Show HN: Convert Youtube Video to Pdf https://www.u2docs.com September 9, 2023 at 08:12AM

Show HN: Mkwhl – Python wheel creation utility https://ift.tt/j3mEs6k

Show HN: Mkwhl – Python wheel creation utility https://ift.tt/OjieZno September 9, 2023 at 03:38AM

Show HN: What an 8-bit computer can do [video] https://ift.tt/gdMuS0F

Show HN: What an 8-bit computer can do [video] Most under-evaluated 8bit: The plus/4 https://www.youtube.com/watch?v=dgm2eZMFuXw September 9, 2023 at 02:27AM

Show HN: New AI Dataset Based on LibGen and Sci-Hub https://ift.tt/Mci1tnd

Show HN: New AI Dataset Based on LibGen and Sci-Hub We recently began extracting the text layers of scholarly publications and books to include in our database. This encompasses sources such as scimag, libgen, and the latest zlib leaks. Our project, named the Standard Template Construct, also features a distributed search engine and incorporates various AI routines to handle the text corpus. Today we have releases our first dataset, STC230908. This dataset contains approximately 75,000 book texts, 1.3 million scholarly paper texts, and 24 million abstracts, including the years from 2021 to 2023. We're currently in the process of preparing the next version of the dataset, which will include an additional 300,000 books. How to Access Short Instructions: Install IPFS and launch it. pip3 install stc-geck && geck - documents More details: the dataset is released in IPFS and replicated to multiple nodes. It is in format of database for the search engine that we use in STC. GECK is the library that embeds this search engine and allows to stream all contained data in easy way. Even more detailed Instructions: https://ift.tt/tqxnvlc https://ift.tt/DSIv4Ek September 9, 2023 at 02:11AM