Playground Sign in Start free
Auto Extract · Beta

Any URL in. Structured JSON out.

Point Auto Extract at any web page and get back clean, structured JSON: no selectors, no rules, no config. It detects the page type, generates the extraction schema with an LLM, and adapts on its own to articles, products, listings, profiles and more.

10
credits / call
0
selectors to write
4+
page types detected
POST /extract
POST /extract
{ "url": "https://blog.example.com/build-a-web-scraper" }

→ 200 OK · 10 credits
{
  "success":   true,
  "url":       "https://blog.example.com/build-a-web-scraper",
  "page_type": "article",
  "data": {
    "title":          "How to Build a Web Scraper",
    "author":         "John Doe",
    "published_date": "2024-01-15",
    "content":        "This comprehensive guide will teach you...",
    "tags":           ["web scraping", "tutorial", "python"],
    "reading_time":   "8 min read"
  }
}
Live playground

See it work, before you sign up.

Drop in a URL, run a real call against the live API, and watch the JSON come back in about a second. No API key required.

No selectors, ever

Skip the CSS-selector archaeology. Auto Extract reads the page, works out its structure with an LLM, and returns typed JSON. You supply a URL, and that is the whole integration.

It knows the page type

Article, product, listing, profile, recipe, research paper: Auto Extract recognises what it is looking at and returns the right shape. The response tells you which via a page_type field.

Rendering & anti-bot built in

JavaScript rendering, auto_proxy escalation (datacenter → residential → premium → managed-Selenium) and CAPTCHA solving are on by default. Tough sites just work, with no extra flags to go hunting for.

Rules cached, calls fast

The first page of a template generates the extraction rules; every page after reuses them, fast and consistent. Pass force_refresh to regenerate when a site changes.

Bring your own HTML

Already have the HTML? Send it alongside the URL and Auto Extract skips the fetch step entirely, handy when the page is already in hand.

Charged only on success

Failed fetches are free; credits are billed per successful extraction and returned in the Ujb-credits response header. No surprises.

Drop-in code

Copy. Paste. Ship.

curl -X POST https://api.ujeebu.com/extract \
  -H "Content-Type: application/json" \
  -H "ApiKey: $UJEEBU_KEY" \
  -d '{"url": "https://shop.example.com/p/wireless-headphones"}'
import os, requests

r = requests.post(
    "https://api.ujeebu.com/extract",
    headers={"ApiKey": os.environ["UJEEBU_KEY"]},
    json={"url": "https://shop.example.com/p/wireless-headphones"},
)
res = r.json()
print(res["page_type"], res["data"])
const res = await fetch(
  "https://api.ujeebu.com/extract",
  {
    method: "POST",
    headers: { "Content-Type": "application/json", ApiKey: process.env.UJEEBU_KEY },
    body: JSON.stringify({ url: "https://shop.example.com/p/wireless-headphones" }),
  },
);
const { page_type, data } = await res.json();
body := strings.NewReader(`{"url":"https://shop.example.com/p/wireless-headphones"}`)
req, _ := http.NewRequest("POST", "https://api.ujeebu.com/extract", body)
req.Header.Set("Content-Type", "application/json")
req.Header.Set("ApiKey", os.Getenv("UJEEBU_KEY"))
res, _ := http.DefaultClient.Do(req)
What people build with it

Real things real teams shipped this quarter.

Articles & news at scale

Feed any blog or news URL and get title, author, date, body and tags back, clean and model-ready for RAG, summarisation or archival. No per-site tuning, ever.

E-commerce product data

Point it at product pages across any store and get name, price, rating, availability and description in one consistent shape, built for price monitoring and catalogue building.

Directories & listings

Listing pages come back as an items array with a total_count: businesses, jobs, classifieds. One call turns a directory page into rows.

Feed agents the live web

Give an AI agent a single URL-to-JSON tool it can call on sites it has never seen. Auto Extract handles the structure so the agent can reason about clean data.

Ship Auto Extract tonight.

5,000 credits free. No card. Real residential proxies on the free tier.