Skip to main content
GET
/
workspaces
/
{workspaceId}
/
crawler
/
jobs
/
{jobId}
/
pages
List crawler job pages
curl --request GET \
  --url https://eu-gcp-api.vg-stuff.com/v3/workspaces/{workspaceId}/crawler/jobs/{jobId}/pages \
  --header 'Authorization: Bearer <token>'
{
  "success": true,
  "message": "<string>",
  "data": {
    "pages": [
      {
        "id": "<string>",
        "url": "<string>",
        "urlHash": "<string>",
        "title": "<string>",
        "description": "<string>",
        "scrapedAt": 123,
        "imageUrl": "<string>",
        "userId": "<string>",
        "mdCharCount": 123,
        "htmlCharCount": 123,
        "failed": true
      }
    ],
    "total": 123,
    "page": 123,
    "pageSize": 123
  }
}

Overview

Returns the scraped pages currently stored for a crawler job.

Includes

  • URL
  • Title
  • Description
  • Character counts
  • Failed page marker
Use this endpoint to build a page review UI before exporting or importing scraped content elsewhere.

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Path Parameters

workspaceId
string
required
jobId
string
required

Query Parameters

page
integer
Required range: x >= 1
limit
integer
Required range: 1 <= x <= 100

Response

Successful response

success
boolean
required
message
string
required
data
object
required