YT – Building Production RAG Over Complex Documents

YT – Building Production RAG Over Complex Documents
– Decided to stop after 28min. I believe if a pdf is important enough, it should have its own bespoke parser, not a universal parser.
– QA system for company
– how to get high-response quality
– naive RAG … glorified search
– todo1) improve data quality
– todo2) improve query quality
– RAG is only as good as your data
– need to make sure parameters are fine tuned
– Parse, Chunk, Index
– Complex documents i.e. table, chart, image, page # -> page chunking has limits
– Llama Parse – AAPL 10-K example / figure check test
– Advanced Indexing: nodes will be indexed by the LLM

Leave a Reply

Your email address will not be published. Required fields are marked *