Playground Sign in Start free
Use case · Lead extraction

Pull contact data from any site. No selectors needed.

Names, emails, phones, titles — extracted from directories, LinkedIn, conference sites, and company team pages with a single API call. AI handles the layout differences; you handle the outreach.

5,000 free credits · no card · failed requests not billed
The challenge

The lead extraction challenge.

Contact pages are deliberately heterogeneous. Anti-bot is real. Custom selectors break every time the layout updates. AI extraction sidesteps all of it.

Scattered contact data

Names, emails, and phones spread across page elements, nested in tables, hidden behind JS, embedded in PDFs.

Anti-scraping measures

Directories deploy CAPTCHAs, rate limits, bot detection, IP blocks — naive scrapers stop working hours in.

Inconsistent layouts

Every contact page is different. CSS selectors that work on one site break on the next. Rule-based extraction is fragile.

Scaling extraction

Thousands of prospect pages = concurrency management, retry logic, quality monitoring. The infra ends up bigger than the scraper.

Use cases

How teams use AI-powered lead extraction.

Directory contacts

Pull listings from every directory.

Business names, addresses, phones, emails, categories from Yellow Pages, Yelp, Google Maps, and niche industry directories. AI handles varied formats — no per-site selectors.

Business outcomes
  • AI Scraper prompt: extract business name, address, phone, email, category
  • JSON schema enforces consistent output across directories
  • Stealth mode + proxy rotation bypass anti-bot
  • Pagination handling for full listing results
LinkedIn profiles

Structured profile data without parsers.

Names, titles, companies, locations, work history. Pre-built LinkedIn templates emit clean JSON — no custom code per page type.

Business outcomes
  • Pre-built templates for profile / company / search pages
  • Browser fingerprinting + stealth avoid detection
  • Structured JSON: name, title, company, location
  • Rate limiting + session management for compliant scale
Team pages

Extract company contacts from team pages.

Submit team, about, and people pages (from a sitemap or your own URL list) and AI Scraper extracts names, titles, emails, and photos from each.

Business outcomes
  • Feed /about, /team, /people URLs — one per AI Scraper call
  • AI prompt: name, title, email, photo
  • Discover URLs via sitemap or SERP API queries
  • JS rendering covers SPAs and lazy-loaded team blocks
Event attendees

Speakers, sponsors, and attendees.

Pull speaker lists, sponsor contacts, attendee directories from conference sites and event platforms. AI-powered extraction handles every event-site layout variation.

Business outcomes
  • AI extraction of speaker names + titles + companies + bios
  • Extract from sponsor + exhibitor pages for company contacts
  • Schema validation for consistent output across events
  • CAPTCHA solving for gated attendee directories
Sources

Extract from any site.

AI-powered extraction adapts to any layout — no per-site config required.

LinkedIn

Profile data — names, titles, company, work history. Pre-built templates with stealth browsing.

Google Maps

Business listings: names, addresses, phones, websites, ratings, categories.

Yelp

Business contact info, categories, ratings, location data from listings and search results.

Industry directories

Niche directories — Clutch, G2, Capterra, trade-association member lists.

Company websites

Team, about, and contact pages across any company site.

Conference sites

Speakers, sponsors, attendee directories from event platforms.

Professional associations

Member directories from bar associations, medical boards, trade groups.

Review platforms

Clutch, G2, Trustpilot company profiles + contact data.

Start extracting No credit card required.
How it works

Three steps to structured lead data.

1

Configure extraction request

Send target URL to AI Scraper with a natural-language prompt for the contact fields. Define a JSON schema for consistent output. For multi-page directories, submit each page URL — discover them from the sitemap or via the SERP API first.

2

AI extracts contacts

Page renders in stealth browser, CAPTCHAs solved automatically, LLM identifies and extracts contact fields. No CSS selectors. Names, emails, phones, titles, companies into structured JSON.

3

Receive structured data

Clean, schema-validated JSON. Extraction metadata. Validation warnings flag missing fields. Direct CRM import via API or CSV export.

Try it

Try lead extraction in the playground.

Drop any directory or contact page URL and see structured output.

curl 'https://api.ujeebu.com/ai-scraper' \
  -H 'ApiKey: YOUR_API_KEY' \
  -d '{
    "url": "https://www.yelp.com/biz/blue-bottle-coffee-san-francisco",
    "prompt": "extract business name, address, phone, email, category, hours",
    "schema": {
      "type": "object",
      "properties": {
        "name": { "type": "string" },
        "phone": { "type": "string" },
        "email": { "type": "string" }
      },
      "required": ["name"]
    }
  }'
No API key required for testing in the playground. Powered by /ai-scraper
Features

Built for production lead extraction.

AI-powered detection

LLMs identify and extract contact info from any layout. No CSS selectors or XPath. Natural-language prompts work across thousands of sites.

Email pattern recognition

Mailto links, obfuscated text ("name [at] company [dot] com"), JS-rendered emails, contact forms. Visible + hidden email patterns covered.

Phone extraction

International formats: local notation, country codes, extensions, tel: links. AI recognises phones across headers, footers, contact sections.

Multi-page extraction

Submit each page URL — or pull URLs from a sitemap or SERP API discovery — then run AI Scraper across the list as an async batch job.

LinkedIn templates

Pre-built templates for profile, company, and search pages. Standardised fields. Stealth mode avoids detection.

Schema-validated JSON

Define JSON schema → enforced output structure across all sources. Validation warnings on missing fields. Direct CRM import.

FAQ

Frequently asked.

How does AI extraction differ from CSS-selector scraping?
Traditional scraping needs hand-written selectors that break on every redesign. AI extraction uses an LLM that understands page content semantically. You send a natural-language prompt ("extract names, emails, phones") and the AI identifies the data regardless of HTML structure. One prompt works across thousands of sites without per-site config or maintenance.
Can I extract from sites that block scraping?
Yes. The AI Scraper includes stealth mode with browser fingerprinting, automatic CAPTCHA solving, and rotating proxies. Pages render in a real browser environment indistinguishable from a regular user. For aggressive bot detection, enable premium residential proxies. CAPTCHA solving is on by default.
How do I extract a multi-page directory?
Build a list of listing URLs — from the site’s sitemap, by paginating known URL patterns, or by running SERP API queries to discover them — then submit the list as an async batch job to AI Scraper with your extraction prompt + schema. Monitor via status endpoint; download all results on completion as JSON/CSV/ZIP.
Output format?
Structured JSON. Define a JSON schema to enforce specific fields (name, email, phone, title, company). Nested objects and arrays supported. Missing fields are omitted and flagged in validation_warnings. Direct CRM/database import or programmatic processing. Async batch results can also download as ZIP.
How accurate is AI extraction?
Depends on source-page quality and prompt clarity. For well-structured pages (directories, team pages), accuracy is high. Improve via detailed prompts naming exact fields, JSON schema to enforce structure, temperature 0.0 (default) for deterministic output. Schema validation auto-retries once if output doesn’t match.
Credit cost?
AI Scraper: 15 credits with rotating proxies (default), 20 with advanced, 25 with premium. +5 credits when a CAPTCHA is solved. Async batch jobs charge per URL extracted. auto_proxy starts cheap and escalates only when needed — optimises credit spend automatically.
5,000 free credits to start.
No credit card. Failed requests cost zero.
Start free

Start extracting leads today.

Use AI-powered extraction to pull structured contact data from any site in minutes.

No credit card required.