Developers building data pipelines
Skip the BeautifulSoup boilerplate. Define fields, run via API, get clean JSON — same shape, every time.
pick a template or write CSS selectors, no code.
What this tool does
Hacker News, GitHub Trending, Product Hunt, Reddit, generic blog. Pick a template, swap the URL, click Run.
Define your own `{name, selector, attr?, multiple?}` fields. No XPath, no JavaScript — just CSS you already know.
HTTP for static pages (~1s), browser for SPAs (~5-10s), stealth for Cloudflare-protected sites (~10-15s).
Set an item_selector and we apply your fields per match — get an array of rows like a database query result.
Copy clean JSON to clipboard, download as CSV for Sheets, or view as an interactive table.
Registered as an MCP tool — Cursor, Claude Desktop, Continue all call it natively. No tool wrapping needed.
What you get
One row per matched item, with all your field names filled in. Same shape every time — easy to consume in any language.
Auto-detected columns, URL fields turn into clickable links. Toggle to JSON view for raw inspection.
Drop into Google Sheets, Notion databases, Airtable, or any analytics tool.
POST /api/extract for scripts. /mcp for AI agents. Both return the same JSON shape as the UI.
Who uses this
Skip the BeautifulSoup boilerplate. Define fields, run via API, get clean JSON — same shape, every time.
Our MCP server registers extract as a tool. Your agent calls it natively; you don't write a single line of scrape code.
Pull product cards from Amazon, posts from Reddit, repos from GitHub — into Sheets or Notion via CSV.
Track Hacker News, Product Hunt, dev.to daily — feed the JSON into your newsletter or dashboard.
How to use
Templates ship for Hacker News, GitHub Trending, Product Hunt, Reddit, and generic blogs. Or open the advanced editor and add your own `{name, selector}` pairs.
HTTP mode (~1 second) for static pages; Browser mode for JS-rendered SPAs; Stealth mode to defeat Cloudflare. Set a limit to cap how many items come back.
Toggle between table and JSON view, then copy to clipboard or download as JSON / CSV. Feeds straight into Sheets, Notion, Airtable, your database, or an LLM context window.
What people say
More tools
Free sitemap extractor — get every URL.
→ URL list
Scrape webpage to Markdown.
→ CONTENT — text + images
10-section SEO data extraction.
→ AUDIT — title / meta / schema / links
Free GEO/AEO audit — AI search readiness score.
→ GEO audit report
Design system extractor — colors, fonts, CSS.
→ design system (colors / fonts / CSS)
…or scroll down to write your own selectors