Figure 1: The TabRAG Architecture, a parsing-based RAG pipeline designed specifically for tables. pip install torch pip install 'git+https://github.com ...
This project is a small pipeline for exploring a corpus of text/PDF documents (e.g., the House Oversight Committee’s Jeffrey Epstein email release). Unzip the contents locally, e.g.: project-root/ ...