Overcoming Web Scraping Blocks: How IP Classification and CGNAT Affect Your Scraping Strategy

Web servers use various techniques to mitigate scraping attempts, including IP classification and identifying data center or suspicious traffic. Understanding how IP addresses are classified and how technologies like CGNAT (Carrier-Grade NAT) work is critical for overcoming these challenges.

Sam Sep 16, 2024 11 min read

Content Extraction Web Scraping

How to Scrape TikTok: A Comprehensive Guide

The TikTok API has several restrictions that limit what data you can access and how frequently you can query it. For this reason, web scraping becomes a viable solution, as long as it is done in compliance with TikTok’s Terms of Service.

Sam Sep 12, 2024 4 min read

Web Scraping Business Intelligence Product Price Monitoring

Web Scraping: An Essential Tool for Business Intelligence

One of the most powerful resources available to businesses nowadays is web scraping, an automated technique for extracting substantial amounts of publicly accessible data from online sources.

Sam Sep 9, 2024 6 min read

Mastering Web Scraping Proxies: The Complete Guide

Whether you're a beginner or looking to refine your web scraping skills, this guide will provide you with knowledge of web scraping proxies. Read now!

Vishesh Nagpal Sep 6, 2024 9 min read

How To Use Python and Beautiful Soup For Web Scraping

Data is the new gold. With the rise of AI and machine learning applications, this statement has never bee n more accurate. To extract the value from this data goldmine, businesses need robust tools to mine it, process it, and prepare it for actionable insights. Being data-conscious empowers organizations to move beyond intuition, leveraging concrete evidence and thorough analysis to inform their decisions. This approach enhances understanding of market dynamics, customer behaviors, and operatio

Manpreet Nagpal Aug 30, 2024 9 min read

How To Scrape Stock Market Data using Python

Stock market data is vital for traders, investors, and analysts looking to make informed decisions. Historical and real-time data on stock prices, trading volumes, financial ratios, and other metrics can provide valuable insights into a company's performance, help predict future stock movements, and guide investment strategies. Even build automated trading systems. This article will guide you through scraping stock market data using Python, with Yahoo Finance as our example website. Python has

Manpreet Nagpal Aug 28, 2024 6 min read

Content Extraction Web Scraping Amazon

Step-by-Step Guide to Scraping Amazon Product Data

Uncover the secrets of scraping Amazon product data with our comprehensive step-by-step guide. Scrape data without getting blocked with Ujeebu. Read more!

Manpreet Nagpal Jul 29, 2024 14 min read

Puppeteer

Puppeteer based Simple Data Scraper: Advanced Options

In this article, we show how Puppeteer's advanced capabilities can be used to make our scraper better equipped for handling real world use cases. Namely, we will explore options such as controlling page load behavior, HTTP Authentication, adding extra headers, changing user agent, etc...

Sam Mar 22, 2024 7 min read

Content Extraction Puppeteer

A Simple Rule-based Scraper using Puppeteer's native methods

In our previous article [https://ujeebu.com/blog/simple-puppeteer-based-scraper-rule-based-extraction/] of the Puppeteer series we implemented a rule-based scraper based on headless Chrome using Puppeteer. We injected our scraping functions into the browser's context (window) then used those to execute scraping scenarios inside the browser. In this article we will try to achieve the same thing, but this time using Puppeteer's methods without injecting functions into the browser's context. Rew

Sam Mar 1, 2024 9 min read

Content Extraction Puppeteer Web Scraping

Simple Puppeteer-based Scraper: Rule based extraction

In this article, we show how to scrape any website with a given set of rules using the Puppeteer library.

Sam Apr 23, 2023 10 min read

Content Extraction Puppeteer Web Scraping

A Simple Scraper using Puppeteer

Web scraping is the process of extracting data from websites. One popular library for web scraping is Puppeteer. Puppeteer is a Node.js library that provides a high-level API to control headless Chrome or Chromium over the DevTools Protocol.

Sam Jan 29, 2023 6 min read

Content Extraction Web Scraping

Is Web Scraping Legal?

The issues of legality and ethics surrounding web scraping are a massive grey area. While some may be in favor of web scraping, others might not share the same enthusiasm. This is what makes the subject so controversial.

Sam Dec 23, 2022 8 min read