Endpoint
Authentication
This endpoint requires authentication using a Bearer token. Include your API key in the Authorization header:Request Body
The URLs to extract data from. URLs should be in glob format.
Prompt to guide the extraction process
Schema to define the structure of the extracted data. Must conform to JSON Schema.
When true, the extraction will use web search to find additional data
When true, sitemap.xml files will be ignored during website scanning
When true, subdomains of the provided URLs will also be scanned
When true, the sources used to extract the data will be included in the response as
sources keyAdditional scraping options to apply. See the scrape endpoint for available options.
If invalid URLs are specified in the urls array, they will be ignored. Instead of them failing the entire request, an extract using the remaining valid URLs will be performed, and the invalid URLs will be returned in the invalidURLs field of the response.
Response
Indicates whether the extract job was successfully started
The unique identifier of the extract job. Use this to check the status.
If ignoreInvalidURLs is true, this is an array containing the invalid URLs that were specified in the request. If there were no invalid URLs, this will be an empty array. If ignoreInvalidURLs is false, this field will be undefined.
Examples
Error Responses
Invalid Request - Invalid input data.
Server Error - An unexpected error occurred on the server.