Built by Metorial, the integration platform for agentic AI.
Retrieve a single sitemap by its ID, including its full configuration with selectors and start URLs.
Retrieve account information including the user's email, name, and remaining page credits.
Retrieve the data quality report for a scraping job. Reports whether the scraped data meets configurable thresholds for record count, failed/empty page rates, and column fill rates.
Retrieve URLs that encountered problems during a scraping job, including empty pages, failed requests, and pages with no extracted values.
Permanently delete a sitemap and all its associated configuration. This action cannot be undone.
Retrieve the full status and statistics of a scraping job, including page counts, execution progress, and timing information.
Permanently delete a scraping job and its associated data. This action cannot be undone.
List scraping jobs with pagination support. Optionally filter by sitemap ID or tag.
Execute a sitemap by creating a new scraping job. Configure the driver, proxy, timing, and optionally override start URLs. The job will begin processing and can be monitored using the Get Scraping Job tool.
Enable, disable, or retrieve the cron-based scheduler for a sitemap. When enabled, scraping jobs run automatically at specified intervals. Use action "get" to view current settings, "enable" to configure and activate, or "disable" to turn off.
Create a new sitemap that defines the structure and rules for scraping a website. A sitemap includes start URLs and a tree of CSS selectors that specify what data to extract from each page.
Update an existing sitemap's configuration including its name, start URLs, and selectors. The full sitemap definition must be provided.
List all sitemaps in your account with pagination support. Optionally filter by tag name.
Download the scraped data from a completed scraping job. Returns data as JSON records or raw CSV text based on the chosen format.