𝗠𝘆 𝗥𝗔𝗚 𝗪𝗮𝘀 𝗕𝗿𝗼𝗸𝗲𝗻. 𝗧𝗵𝗲 𝗣𝗿𝗼𝗯𝗹𝗲𝗺 𝗪𝗮𝘀 𝗖𝗵𝘂𝗻𝗸𝗶𝗻𝗴.

📅5 days ago⏱1 min read

I thought RAG was hard because of embeddings. I thought vector databases were the issue. I thought the LLM was the problem. I was wrong.

The system returned results. The LLM gave answers. But the answers were wrong. They lacked context. They were irrelevant.

The problem was document splitting. RAG systems find what you provide. Bad chunking causes:

Large chunks add noise. One 20-page chapter as one chunk is too much. Small chunks break meaning. Splitting a sentence in half kills the point.

Use overlap. Overlap keeps information across boundaries. Avoid fixed character counts. Text does not fit into 500-character blocks.

Use document structure:

Chunking is retrieval engineering. It determines answer quality. Better chunks lead to:

Fix your chunks before you change your model.

Continue reading