Get Sitemap

Retrieve a single sitemap by its ID, including its full configuration with selectors and start URLs.

Get Account

Retrieve account information including the user's email, name, and remaining page credits.

Get Data Quality

Retrieve the data quality report for a scraping job. Reports whether the scraped data meets configurable thresholds for record count, failed/empty page rates, and column fill rates.

Get Problematic Urls

Retrieve URLs that encountered problems during a scraping job, including empty pages, failed requests, and pages with no extracted values.

Delete Sitemap

Permanently delete a sitemap and all its associated configuration. This action cannot be undone.

Get Scraping Job

Retrieve the full status and statistics of a scraping job, including page counts, execution progress, and timing information.

Delete Scraping Job

Permanently delete a scraping job and its associated data. This action cannot be undone.

List Scraping Jobs

List scraping jobs with pagination support. Optionally filter by sitemap ID or tag.

Create Scraping Job

Execute a sitemap by creating a new scraping job. Configure the driver, proxy, timing, and optionally override start URLs. The job will begin processing and can be monitored using the Get Scraping Job tool.

Manage Scheduler

Enable, disable, or retrieve the cron-based scheduler for a sitemap. When enabled, scraping jobs run automatically at specified intervals. Use action "get" to view current settings, "enable" to configure and activate, or "disable" to turn off.

Create Sitemap

Create a new sitemap that defines the structure and rules for scraping a website. A sitemap includes start URLs and a tree of CSS selectors that specify what data to extract from each page.

Update Sitemap

Update an existing sitemap's configuration including its name, start URLs, and selectors. The full sitemap definition must be provided.

List Sitemaps

List all sitemaps in your account with pagination support. Optionally filter by tag name.

Download Scraped Data

Download the scraped data from a completed scraping job. Returns data as JSON records or raw CSV text based on the chosen format.

Webscraper Io

Tools

Get Sitemap

Get Account

Get Data Quality

Get Problematic Urls

Delete Sitemap

Get Scraping Job

Delete Scraping Job

List Scraping Jobs

Create Scraping Job

Manage Scheduler

Create Sitemap

Update Sitemap

List Sitemaps

Download Scraped Data