𝗟𝗟𝗠𝘀 𝗙𝗼𝗿 𝗕𝗲𝘁𝘁𝗲𝗿 𝗪𝗲𝗯 𝗦𝗰𝗿𝗮𝗽𝗶𝗻𝗴

📅4 days ago⏱1 min read

I spent years writing scrapers. I used CSS selectors and regex. It worked until the website changed.

One layout update broke my code. I spent days fixing it. I lost the battle against changing HTML.

I tried a new way. I used LLMs. I stop guessing selectors. I send page text to the model.

My process is simple:

The results are better. It works on different layouts. It recognizes prices and stock status without specific rules.

There are trade-offs:

Choose your tool based on your needs:

I have not touched my code in three weeks. The LLM handles the fragile DOM for me.

How do you handle website changes? Do you use selectors or models?

Continue reading