Ask HN: What are you using to parse PDFs for RAG?

carlbrenThursday, July 25, 2024

161

92

Hi, I'm looking for a simple way to convert PDFs into markdown with integrated images and tables. Tried Llamaindex, but no integrated images. Tried Langchain, but some PDFs will have the footer being parsed before the top. Tried to use Adobe PDF API, but have to pay $25K upfront!

Comments

92