𝗥𝗔𝗚 𝗜𝘀 𝗔 𝗦𝗲𝗮𝗿𝗰𝗵 𝗣𝗿𝗼𝗯𝗹𝗲𝗺, 𝗡𝗼𝘁 𝗔𝗻 𝗔𝗜 𝗣𝗿𝗼𝗯𝗹𝗲𝗺

📅5 days ago⏱1 min read

I thought RAG was about Large Language Models. I was wrong.

I spent time on search instead of prompts. RAG is a search problem.

AI does not know everything. It knows only its training data. Ask it about yesterday's news. It might lie. RAG fixes this.

Think of a library. A librarian does not read every book to find one answer. The librarian finds the right book. They open the right page. They read the answer.

RAG works the same way. Search first. Generate second.

Machines struggle with keywords. Embeddings turn text into numbers. Similar meanings get similar numbers. Now the machine searches by meaning.

Chunking also matters. Break long books into small pieces. Small pieces make search precise.

Vector databases store these numbers. They find content similar to your question. Options include:

Pinecone
Weaviate
Qdrant
Milvus
pgvector

Better retrieval leads to better answers. The model needs the right data. Focus on these:

Chunking
Embeddings
Search quality
Metadata filtering
Reranking

Intelligence is not only in the model. Intelligence is finding the right info at the right time. RAG is a search problem using AI.

What did you learn while building RAG? Tell me in the comments.

Source: https://dev.to/threshika_vs/the-day-i-realized-rag-isnt-an-ai-problem-23ac Optional learning community: https://t.me/GyaanSetuAi

𝗥𝗔𝗚 𝗜𝘀 𝗔 𝗦𝗲𝗮𝗿𝗰𝗵 𝗣𝗿𝗼𝗯𝗹𝗲𝗺, 𝗡𝗼𝘁 𝗔𝗻 𝗔𝗜 𝗣𝗿𝗼𝗯𝗹𝗲𝗺

Continue reading

𝗥𝗔𝗚 𝗘𝘅𝗽𝗹𝗮𝗶𝗻𝗲𝗱 𝗳𝗼𝗿 𝗕𝗲𝗴𝗶𝗻𝗻𝗲𝗿𝘀

𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗔 𝗥𝗔𝗚 𝗣𝗶𝗽𝗲𝗹𝗶𝗻𝗲 𝗜𝗻 𝗔 𝗪𝗲𝗲𝗸𝗲𝗻𝗱

𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗮 𝗥𝗔𝗚 𝗽𝗶𝗽𝗲𝗹𝗶𝗻𝗲 𝗶𝗻 𝗮 𝘄𝗲𝗲𝗸𝗲𝗻𝗱

𝗥𝗔𝗚 𝗶𝗻 𝟴 𝗟𝗮𝘆𝗲𝗿𝘀: 𝗙𝗿𝗼𝗺 𝗧𝗼𝗸𝗲𝗻𝘀 𝘁𝗼 𝗣𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻

𝗙𝗿𝗼𝗺 𝗥𝗔𝗚 𝘁𝗼 𝗞𝗻𝗼𝘄𝗹𝗲𝗱𝗴𝗲 𝗗𝗶𝘀𝗰𝗼𝘃𝗲𝗿𝘆