Most RAG demos assume clean PDFs and perfect text extraction.
Enterprise reality is different:
- scanned documents
- broken layouts
- missing metadata
- inconsistent formats
This post explains how MQNotebook was designed with failure-first ingestion, OCR fallback, and reranking to maintain retrieval precision.