Sunday, September 22, 2024

Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o https://ift.tt/E6jaViq

Show HN: PDF to MD by LLMs – Extract Text/Tables/Image Descriptives by GPT4o I've developed a Python API service that uses GPT-4o for OCR on PDFs. It features parallel processing and batch handling for improved performance. Not only does it convert PDF to markdown, but it also describes the images within the PDF using captions like `[Image: This picture shows 4 people waving]`. In testing with NASA's Apollo 17 flight documents, it successfully converted complex, multi-oriented pages into well-structured Markdown. The project is open-source and available on GitHub. Feedback is welcome. https://ift.tt/xU7QBoX September 22, 2024 at 07:35AM

No comments:

Post a Comment

Show HN: Shadcn/UI theme editor – Design and share Shadcn themes https://ift.tt/q4YZ3uV

Show HN: Shadcn/UI theme editor – Design and share Shadcn themes Hey, I built https://ift.tt/yZxliP5 - a web app for creating and sharing th...