GET
/
v1
/
crawls
/
{crawlId}
/
pages

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Path Parameters

crawl_id
string
required

The ID of the crawl to retrieve the list of URLs for.

Query Parameters

cursor
integer

Optional integer representing the index to start fetching content from. Useful to paginate until all URLs are fetched. Start with 0, then provide response['cursor'] value of the last request.

limit
integer

Optional integer to limit the number of results returned. Recommended 10-50 results at a time. Paginated using cursor. Maximum 10MB of content can be fetched in a single request.

search_query
string

An optional search query to sort the results by relevance. Uses the original search_query by default if provided.

formats
enum<string>[]

Deprecated: Use /retrieve endpoint with retrieve_id.

Array of formats to fetch (e.g., ["html", "markdown"]).

Available options:
html,
markdown

Response

200 - application/json
crawl_id
string

Crawl ID

cursor
integer

To be passed in the query in next request to get the next items.

metadata
object
object
string

The kind of object. "crawl" for this endpoint.

pages
object[]
pages_count
number
search_query
string
status
string

in_progress or completed