diffbot.com
Web Crawler API: Crawlbot
https://www.diffbot.com/products/crawlbot
Crawlbot is smart spidering. Crawlbot uses any Diffbot API to extract data from entire sites. Whether for product prices, historical weather, content migration or even three years of Hacker News archives. Crawlbot creates a structured index of practically any site's data. Sign up for a Diffbot plan. Use our Analyze API. To automatically find and extract all article, product or other supported pages. All crawls are instantly searchable using our Search API. Subscribers to our Plus. Yup, it's got an API.
diffbot.com
Article API: clean text extraction from news articles and blog posts
https://www.diffbot.com/products/automatic/article
The Article API automatically extracts clean text from news articles and blog posts—returning normalized HTML and plaintext, author and date information, related images/videos and more from any article on any site. Sign up for a Diffbot plan. Test Drive the Article API. Please enter a URL to test. Diffbot's Article API has been the overwhelming winner in quality shootouts since anyone thought to start testing such things (in 2011). Compare text-extraction methods. Pair the Article API with Crawlbot.
diffbot.com
Discussion API: Automatic comment extraction, review extraction and forum extraction
https://www.diffbot.com/products/automatic/discussion
Diffbot's Discussion API structures the full content of forum threads, article comments, product reviews and more. Sign up for a Diffbot plan. Test Drive the Discussion API. Please enter a URL to test. Like all of Diffbot's Automatic APIs, the Discussion API needs no rules or training. Send it any page containing a discussion and let Diffbot do the rest. Get All the Pages. Long forum thread spanning multiple pages? No problem. Use the. Argument to automatically concatenate as many pages as you need.
diffbot.com
Video API: automatic video extraction from web pages
https://www.diffbot.com/products/automatic/video
Diffbot's Video API extracts detailed information from video-specific pages. Sign up for a Diffbot plan. Test Drive the Video API. Please enter a URL to test. Like all Diffbot's Automatic APIs, the Video API works right out of the box, with no need for rules or training. Get the Raw Bits. Where possible Diffbot extracts the raw source content in addition to embeddable HTML. Extract web pages as structured data. No rules required.
diffbot.com
About Diffbot
https://www.diffbot.com/company
We Structure the World's Knowledge. Diffbot is a team of AI engineers building a universal database of structured information, to provide knowledge as a service to all intelligent applications. Whether you are building an app that uses web content, an enterprise business application, or a smart robotic assistant, we've got you covered! See all coverage and official releases in News and Press. The New York Times: The Race Is On to Control Artificial Intelligence, and Tech’s Future. BBC: An AI That Can Read.
diffbot.com
Web Data Extraction and Web Crawling APIs - Diffbot
https://www.diffbot.com/products
Using AI, computer vision, machine learning and natural language processing, Diffbot provides software developers with tools to extract and understand objects from any web page. Diffbot's Automatic APIs automatically extract content from supported page types: articles, products, discussions, images and more. Diffbot uses advanced AI technology to retrieve clean, structured data without need for manual rules or site-specific training. Test our Automatic APIs. Crawlbot and Bulk Processing.
diffbot.com
Custom Web Extraction APIs - Diffbot
https://www.diffbot.com/products/custom
Use the Custom API Toolkit to override or add fields to our Automatic APIs. Or create completely customized APIs of your own. To get started: grab a token, then log-in to the Developer Dashboard. With a free token you can correct APIs as much as you need and write up to five Custom API rules. Sign up for a Diffbot plan. Want more than what's extracted in our Article API. Add or edit the fields our computer-vision engine automatically returns. The World Is Yours. Fast As All Get-Out. Works on any site.
diffbot.com
Image API: image extraction and analysis
https://www.diffbot.com/products/automatic/image
Diffbot's Image API extracts and analyzes individual images and image-heavy pages. Sign up for a Diffbot plan. Test Drive the Image API. Please enter a URL to test. Like all of Diffbot's Automatic APIs, the Image API needs no rules or training. Send it any image-heavy page and let Diffbot do the rest. The Image API automatically evaluates image content and generates tags based on its identified elements. Field to see where else on the web an image (or its variants) has been seen.
diffbot.com
Product API: automatic product data extraction from web pages
https://www.diffbot.com/products/automatic/product
The Product API extracts complete data from any shopping or e-commerce product page. Retrieve full pricing information, product IDs (SKU, UPC, MPN), images, product specifications, brand and more. Sign up for a Diffbot plan. Test Drive the Product API. Please enter a URL to test. Like all of our Automatic APIs, the Product API needs no rules or training. Send it any product page and let Diffbot do the rest. Technology is built-in to the Product API to automatically extract reviews from most product pages.
SOCIAL ENGAGEMENT