𝗩𝗲𝗰𝘁𝗼𝗿 𝗧𝗮𝗯𝗹𝗲𝘀 𝟭𝟬𝟭: 𝗨𝗻𝗱𝗲𝗿𝘀𝘁𝗮𝗻𝗱𝗶𝗻𝗴 𝗩𝗲𝗰𝘁𝗼𝗿 𝗮𝗻𝗱 𝗣𝗚𝗩𝗲𝗰𝘁𝗼𝗿

📅2 weeks ago⏱1 min read

You hear about vectors and pgvector. They sound complex. They are. You do not need to be an expert to use them.

A vector is a list of numbers. Example: [1, 2, 3, 4, 5].

Think of it as a point in space. Two numbers make a 2D point. Three numbers make a 3D point. Hundreds of numbers make a high-dimensional space.

Normal text search looks for exact words. Vector search looks for meaning.

Search for "Postgres database setup tutorial." The database finds "How to configure PostgreSQL." Words differ. Meaning is the same.

AI models turn text, images, or audio into vectors. These are embeddings. The AI puts similar ideas close together. Cat and Dog stay near each other. PostgreSQL and Kubernetes stay in another area.

You find the closest vector with these methods:

Euclidean Distance: Measures straight line distance.
Inner Product: Measures alignment.
Cosine Similarity: Measures the angle between vectors.

Dimensionality is the number of values in the list. More dimensions mean more detail.

Normalization makes all vectors the same length. This makes cosine similarity faster.

PostgreSQL has a tool called pgvector. It lets you store vectors in your tables. You use SQL to find the nearest vectors. The <-> operator finds the distance.

Comparing millions of vectors is slow. Use indexes like IVFFlat or HNSW to speed it up.

Use pgvector for:

Semantic search
RAG
Recommendations
Smart FAQs

Embeddings are points in space. AI is often geometry.

Source: https://dev.to/rmarsigli/vector-tables-101-understanding-vector-and-pgvector-once-and-for-all-3g68

𝗩𝗲𝗰𝘁𝗼𝗿 𝗧𝗮𝗯𝗹𝗲𝘀 𝟭𝟬𝟭: 𝗨𝗻𝗱𝗲𝗿𝘀𝘁𝗮𝗻𝗱𝗶𝗻𝗴 𝗩𝗲𝗰𝘁𝗼𝗿 𝗮𝗻𝗱 𝗣𝗚𝗩𝗲𝗰𝘁𝗼𝗿

Continue reading

𝗢𝗽𝘁𝗶𝗺𝗶𝘇𝗶𝗻𝗴 𝗣𝗼𝘀𝘁𝗴𝗿𝗲𝗦𝗤𝗟 𝗳𝗼𝗿 𝗔𝗜

𝗗𝗶𝘀𝘁𝗮𝗻𝗰𝗲𝘀 𝗮𝗻𝗱 𝗦𝗶𝗺𝗶𝗹𝗮𝗿𝗶𝘁𝘆 𝗶𝗻 𝗠𝗟

𝗩𝗲𝗰𝗍𝗼𝗿 𝗗𝗮𝗍𝗮𝗯𝗮𝗰𝗲𝘀 𝗶𝗻 𝗔𝗜 𝗣𝗿𝗼𝗷𝗲𝗰𝘁𝘀: 𝗔𝗿𝗲 𝗧𝗵𝗲𝘆 𝗥𝗲𝗮𝗹𝗹𝘆 𝗡𝗲𝗰𝗲𝘀 s𝗮𝗿𝘆?

𝗕𝘂𝗶𝗹𝗱 𝗦𝗲𝗺𝗮𝗻𝘁𝗶𝗰 𝗦𝗲𝗮𝗿𝗰𝗵 𝗪𝗶𝘁𝗵 𝗩𝗲𝗰𝘁𝗼𝗿 𝗗𝗮𝘁𝗮𝗯𝗮𝘀𝗲𝘀

𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗦𝗲𝗺𝗮𝗻𝘁𝗶𝗰 𝗦𝗲𝗮𝗿𝗰𝗵 𝘄𝗶𝘁𝗵 𝗽𝗴𝘃𝗲𝗰𝘁𝗼𝗿 𝗮𝗻𝗱 𝗢𝗽𝗲𝗻𝗔𝗜