Built by Metorial, the integration platform for agentic AI.

Learn More

    Provider Summary

    • extract web page data

    • search knowledge graph entities

    • enrich organization records

    • enrich person records

    • analyze text for entities

    • extract sentiment from text

    • crawl websites at scale

    • bulk process URL lists

    • extract product pricing details

    • classify web page types

Diffbot

Extract structured data from web pages using AI-powered computer vision and NLP. Automatically classify and parse articles, products, events, jobs, discussions, images, and videos into clean JSON. Search a knowledge graph of 10+ billion entities (organizations, people, articles, products) using Diffbot Query Language (DQL). Enrich person and organization records from minimal identifiers. Analyze natural language text to extract entities, facts, sentiment, and relationships. Crawl entire websites or process bulk URL lists through extraction APIs. Define custom extraction rules with CSS selectors for specific domains. Receive webhook notifications when crawl or bulk jobs complete.

License

This integration is licensed under the AGPL-3.0 License.

Built with ❤️ by Metorial