Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Query Parameters
The ID of the page content to retrieve. Available in the response of /v1/crawls/{crawl_id}/pages
, /v1/scrapes/{scrape_id}
or /v1/batches/{batch_id}/items
endpoints
Optional array to retrieve only specific formats in production. If not provided, all formats will be returned.
Response
Successful response with page content.
HTML content of the page, if requested and available.
Markdown content of the page, if requested and available.
JSON content of the page returned from parsers, if requested and available.
S3 bucket URL of html. Expires in 7 days.
S3 bucket URL of markdown. Expires in 7 days.
S3 bucket URL of json. Expires in 7 days.
If size of content objects exceeds the 6MB limit. If true, use hosted S3 urls to get content.