7 Comments
User's avatar
Alex Willen's avatar

Hey all, author of this post here — I hope folks found this useful! If you have any questions about how to set something like this up for yourself, just @ mention me in the AIBMM chat, and I'd be happy to help.

Daria Cupareanu's avatar

Glad to have you featured & thanks for sharing a practical build from your business with the AIBMM community!

CP's avatar

Although I'm fond of web scraping, yet very fearful of the consequences of it, as web scraping is kind of mining gold, but no one wants to hand it over that easily.

I'm not sure about small, but almost every big player has set their guardrails around it to stop it and I'm pretty much sure that it has been raised now days, due to AI capability of doing almost anything.

What guardrails do you use before scraping these big players? or do you have any guide which tells what to take care of before doing or building any tools like this.

Alex Willen's avatar

I’d echo what Daria said and just add that in this case I’m not using a logged in Amazon account. I do fear that I could get in trouble and have issues with my Seller Central account if I was authenticated, but this way the worst thing they’ll do is block me from accessing pages.

Daria Cupareanu's avatar

I’d think of it more as collecting public information from pages you’re already allowed to access, not bypassing protections or mining platforms at scale.

Ofc there are big players that are not fond of scraping (LinkedIn comes to mind), so you have to be careful and look for safe approaches, official APIs, exports, or other allowed ways to get the data when possible.

Alex also covered a lot of this in the limitations section, including guardrails and things to consider before building something like this.

AI Meets Girlboss's avatar

You blow my mind every single week Daria 🩷🦩

Daria Cupareanu's avatar

Thanks so much, Pinkie 💜 this one is on Alex 💪🏼