Wednesday, April 3, 2024

Show HN: I just open sourced my document/website extractor for Vision-LLMs https://ift.tt/kZ9Q2rp

Show HN: I just open sourced my document/website extractor for Vision-LLMs Hi HackerNews, Lately, I have seen an explosion in posts offering paid APIs/services to get unstructured data into LLMs (i.e. langchain extract, ragflow, unstructured, unstract, just to name a few) and I have been largely disappointed by them, either because they fail to implement multimodal support, fail to give good context for "really tricky" PDFs / Word docs / Powerpoints, or are just plain difficult to use. In light of all these posts I figured I'd share my solution that has been working smoothly for me and my clients. I put it up on GitHub for free so you can check it out and hopefully offer some feedback / criticism or contribute to the code yourself. and BTW, I'm not trying to throw shade at any of the services mentioned, I'm just giving my honest experience in case there are others out there who feel the same way and want something that works Cheers! https://ift.tt/ePlsfS4 April 3, 2024 at 12:10AM

No comments:

Post a Comment

Show HN: IssuePay – Get paid for open-source contributions https://ift.tt/ujCNZEA

Show HN: IssuePay – Get paid for open-source contributions Hi HN! I’m Mario, and I’m about to launch IssuePay. Problem: Open-source contribu...