𝗙𝗶𝘅𝗶𝗻𝗴 𝗔𝗜 𝗢𝗯𝘀𝗲𝗿𝘃𝗮𝗯𝗶𝗹𝗶𝘁𝘆 𝗶𝗻 𝗠𝗮𝘀𝘁𝗿𝗮

📅3 hours ago⏱2 min read

OpenTelemetry is the standard for monitoring modern systems. Traditional traces work for most software. They fail for AI applications.

When you build AI, you need specific answers: • Which model generated the output? • Which provider did you use? • How many tokens did you consume? • Which embedding model processed your documents? • What was the cost of the operation?

These questions matter most in Retrieval-Augmented Generation (RAG) systems.

While contributing to Mastra, I found a gap in RAG embedding observability. Mastra exported metadata for many AI tasks, but RAG embedding spans lacked standard attributes.

Observability tools saw the embedding operation. They did not understand the context. They missed model details, provider info, and token usage.

A RAG pipeline follows these steps: • Documents • Chunking • Embedding Model • Vector Database • Similarity Search • LLM Generation

The embedding stage is vital. If you lack data here, debugging performance becomes hard.

OpenTelemetry uses semantic conventions to create a common language. Instead of every tool using custom names, everyone follows one standard. This allows tools to read attributes like: • gen_ai.system • gen_ai.request.model • gen_ai.usage.input_tokens

I submitted a pull request to map Mastra RAG embedding data to these OpenTelemetry standards.

The work included: • Exporting embedding model metadata • Exporting provider information • Mapping token usage metrics • Aligning attributes with global standards

This lets observability systems understand embeddings without custom code.

Production AI systems need visibility. You need to know which model causes latency or which provider costs the most. Standardized telemetry provides these answers automatically.

Open source teaches a great lesson. Not every good contribution adds a new feature. Sometimes the best work makes existing systems easier to monitor and run.

If you build AI infrastructure, do not ignore observability. The best AI systems are observable.

Source: https://dev.to/akash_santra_3c96613546c6/fixing-ai-observability-how-i-added-genai-semantic-support-for-rag-embedding-spans-in-mastra-4db9

Optional learning community: https://t.me/GyaanSetuAi

𝗙𝗶𝘅𝗶𝗻𝗴 𝗔𝗜 𝗢𝗯𝘀𝗲𝗿𝘃𝗮𝗯𝗶𝗹𝗶𝘁𝘆 𝗶𝗻 𝗠𝗮𝘀𝘁𝗿𝗮

Continue reading

𝗢𝗯𝘀𝗲𝗿𝘃𝗮𝗯𝗶𝗹𝗶𝘁𝘆 𝗶𝗻 𝗔𝗜: 𝗠𝗼𝗻𝗶𝘁𝗼𝗿𝗶𝗻𝗴 𝗶𝘀 𝗡𝗼𝘁 𝗘𝗻𝗼𝘂𝗴𝗵

𝗘𝗺𝗯𝗲𝗱𝗱𝗶𝗻𝗴 𝗟𝗶𝗳𝗲𝗰𝘆𝗰𝗹𝗲: 𝗖𝗼𝘀𝘁 𝘃𝘀 𝗙𝗿𝗲𝘀𝗵𝗻𝗲𝘀𝘀

𝗩𝗲𝗰𝘁𝗼𝗿 𝗦𝗲𝗮𝗿𝗰𝗵 𝗜𝘀 𝗡𝗼𝘁 𝗘𝗻𝗼𝘂𝗴𝗵 𝗳𝗼𝗿 𝗣𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻 𝗔𝗜

𝗛𝘆𝗯𝗿𝗶𝗱 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗮𝗻𝗱 𝗔𝗴𝗲𝗻𝘁 𝗢𝗯𝘀𝗲𝗿𝘃𝗮𝗯𝗶𝗹𝗶𝘁𝘆

𝗔𝗜 𝗖𝗵𝗮𝘁𝗯𝗼𝘁 𝗗𝗲𝘃𝗲𝗹𝗼𝗽𝗺𝗲𝗻𝘁 𝗚𝘂𝗶𝗱𝗲 𝗳𝗼𝗿 𝗥𝗔𝗚 𝗦𝘆𝘀𝘁𝗲𝗺𝘀