Playground Sign in Start free
Blog

Notes from the team.

Practical writeups on web scraping, anti-bot tactics, structured extraction, SERPs and the messy realities of building data pipelines for AI.

A Simple Rule-based Scraper using Puppeteer's native methods

In our previous article [https://ujeebu.com/blog/simple-puppeteer-based-scraper-rule-based-extraction/] of the Puppeteer series we implemented a rule-based scraper based on headless Chrome using Puppeteer. We injected our scraping functions into the browser's context (window) then used those to execute scraping scenarios inside the browser. In this article we will try to achieve the same thing, but this time using Puppeteer's methods without injecting functions into the browser's context. Rew

Sam Mar 1, 2024 9 min read

Is Web Scraping Legal?

The issues of legality and ethics surrounding web scraping are a massive grey area. While some may be in favor of web scraping, others might not share the same enthusiasm. This is what makes the subject so controversial.

Sam Dec 23, 2022 8 min read