Home » Preparing PDFs for RAGs. | Towards Data Science

Preparing PDFs for RAGs. | Towards Data Science

Photo by Annual Report Design Agency – Report Yak on Unsplash

Converting PDFs to text was possible but has never been easier.

I recently created a graph data store to be used in an RAG. In other words, we built a GraphRAG.

Graph RAGs are a fantastic alternative to other RAG apps like widely used vector store-backed RAGs. They bring reasoning to the table. For example, with semantic similarity search (the technique used in vector stores to retrieve information), you could ask who the CFO of XYZ, Inc. was last year. Because XYZ, Inc.’s last year’s annual report would explicitly mention its CFO. But think of a question like this: Which two directors of XYZ, inc. have studied in the same school? The retrieval process won’t be able to fetch the relevant information without mentioning a school name. But graph RAG could do it.

However, the key issue here is how we construct the graph for retrieval. I’ve addressed this issue in a separate post recently. Thinking another step backward, how do we even prepare the annual…

Related Posts

Leave a Reply

Your email address will not be published. Required fields are marked *