Amazon Scraper

Collects structured product data from Amazon listings — including titles, prices, ratings, and images.

Endpoint

POST https://api.crawlbyte.ai/api/tasks

Basic Configuration (Required)

{
  "type": "amazon",
  "input": [
    "B0BLS3Y632",
    "B09B2SBHQK"
  ],
  "multithread": false
}

Parameters

Field
Type
Description

type

string

Always "amazon"

input

array

List of ASINs

multithread

boolean

Use true for faster processing with multiple threads

Advanced Configuration (Optional)

{
  "type": "amazon",
  "input": [
    "B0BLS3Y632",
    "B09B2SBHQK"
  ],
  "user_agent_preset": "chrome",
  "user_agent_custom": "",
  "headers": "{\"X-Test\":\"abc\"}",
  "cookie": "session=xyz",
  "proxy": "http://username:password@ip:port"
}

Optional Parameters

Field
Type
Description

user_agent_preset

string

Preset user-agent. Options: chrome, firefox, edge, opera, safari, ios-safari, android-chrome, custom

user_agent_custom

string

Used if user_agent_preset is custom.

headers

string

JSON-formatted string of headers.

cookie

string

key=value;

proxy

string

http://username:password@ip:port

Pricing

  • $0.075 per successful task This is a pay-as-you-go pricing model — you’re only charged when an Amazon task successfully returns product data such as title, price, and availability.

You can view your current credit balance and usage history in the Crawlbyte Dashboard.

Response

The response contains metadata about the task. For amazon type, the most important fields are status and result.

{
  "id": "bd3e89ed-815e-4395-98a3-521ede71cc4d",
  "status": "completed",
  "result": {
    // Parsed availability and pricing data
  }
}
  • result is a JSON object object containing the final scraped product data — no further polling is required.

Status Types

Status
Meaning

queued

Task was accepted and added to the processing queue.

processing

Task is currently running.

completed

Task finished and product data was successfully collected.

failed

Task encountered an error (e.g., invalid input, no results, or system issue).

Polling

If status is queued or processing, continue polling the task until it's completed or failed.

GET https://api.crawlbyte.ai/api/tasks/:id
  • You’ll receive the same structure with an updated status.

  • Only poll until you receive completed or failed.

  • Recommended interval: every 2–3 seconds.

SDK Usage

You can run this task using any official Crawlbyte SDK:

Each SDK provides a simple way to:

  • Create the task

  • Poll for status

  • Handle the final result

Refer to the SDKs section for installation, examples, and setup instructions.

Notes

  • Only valid input objects with correct ASINs will return results.

  • Crawlbyte handles retries, rendering, fingerprinting, and anti-bot logic internally — no need to manage it yourself.

  • Use multithread: true in advanced settings if running large volumes.

  • Ensure all required fields like region and the list of ASINs are properly structured.

  • The Amazon response includes all relevant product data such as title, price, availability, and seller details.

  • Average task duration is ~2 seconds.

Last updated