𝗜 𝗕𝘂𝗶𝗹𝘁 𝘁𝗵𝗲 𝗪𝗿𝗼𝗻𝗴 𝗦𝗰𝗿𝗮𝗽𝗲𝗿 𝗙𝗶𝗿𝘀𝘁
I built a scraper. It worked. This was the problem.
The scraper grabbed titles and prices. The data looked clean. But the output was useless.
I treated a marketplace like a catalog. Marketplaces have state.
A basic scrape misses key facts:
- Is the item sold?
- Did it disappear?
- Where is the seller?
- What is the item condition?
- Did the price change?
I stopped designing for the page. I started designing for decisions.
The data became explicit. It tracked availability and price history. It flagged items for manual review.
Selectors are not the product. The state model is the product.
Clean data is easy to trust. Clean data without state is dangerous.
Check your marketplace scraper for these:
- Live items
- Sold items
- Disappearing items
- Seller country
- Item condition
- Price history
- Duplicate listings
Did you ever ship a tool with correct code but useless results?