Built by Metorial, the integration platform for agentic AI.
Retrieve the fully rendered HTML of a web page after JavaScript execution. Provide a URL to navigate to, or supply raw HTML to render in the browser. Returns the complete DOM including dynamically generated content from single-page applications.
Run a Google Lighthouse audit on a web page to analyze performance, accessibility, SEO, best practices, and PWA metrics. Filter by categories or specific audits to reduce response size. Returns audit scores and detailed metrics.
Extract structured data from a fully rendered web page using CSS selectors. The page is loaded in a real browser with JavaScript execution, then elements matching your selectors are extracted with their text content, inner HTML, attributes, and bounding box positions.
Generate a PDF from a web page or raw HTML. Navigate to a URL or render provided HTML in a headless browser and export it as a PDF. Supports page formatting options including paper size, margins, headers/footers, background printing, and landscape orientation. Returns the PDF bytes as a Slate attachment with metadata in the tool output.
Capture a screenshot of a web page or rendered HTML. Supports full-page captures, custom viewports, clipping regions, and multiple image formats (PNG, JPEG, WebP). Returns the image bytes as a Slate attachment with metadata in the tool output.
Perform a web search and optionally scrape each result page. Returns structured search results with titles, URLs, and descriptions. When scraping is enabled, each result URL is fetched and processed into clean markdown, raw HTML, or other formats. Supports multiple sources (web, news, images), language filtering, time-based filtering, and category filters.
Access a protected web page by bypassing basic bot detection mechanisms. Returns the page content, cookies, and/or a screenshot after unblocking. Useful for sites that block standard headless browser requests. For sites with advanced protections or interactive CAPTCHAs, BrowserQL may be needed instead.
Execute custom JavaScript/Puppeteer code in a headless browser context. Browserless sets up a browser and page, then runs your code with access to the Puppeteer `page` object. Use this for multi-step browser interactions like navigating, filling forms, clicking buttons, and extracting data within a single request.
Start, inspect, list, or cancel Browserless Crawl jobs. Crawls asynchronously discover site URLs and scrape pages into structured, LLM-ready results.
Discover URLs on a website with Browserless Map. Returns a deduplicated list of pages with optional title and description metadata, with search relevance, sitemap behavior, geo-targeting, and URL filtering controls.
Scrape a URL with Browserless Smart Scrape. Browserless automatically escalates from fast HTTP fetching to proxying, headless browser rendering, and page-gating CAPTCHA solving as needed. Returns requested HTML, markdown, links, and optional file attachments for screenshots or PDFs.
Fetch a URL through Browserless and return its native content type as a Slate attachment. Use this to download unknown file types, PDFs, images, rendered HTML, or a ZIP containing a page and its linked resources.
Run custom Puppeteer code and return the file that Chrome downloads during execution. Use this for sites where the target file is produced after clicks, form interactions, or client-side Blob creation.