HAR Scraper

Scrape real estate listings directly from HAR using structured input data. This task supports flexible filtering by city and price, and allows sorting the results by listing date or price.

Endpoint

POST https://api.crawlbyte.ai/api/tasks

Basic Configuration (Required)

{
  "type": "har",
  "input": [
    "{\"city\":\"Dallas\",\"priceMin\":10000,\"priceMax\":50000}"
  ],
  "sortBy": "CHEAPEST_FIRST"
}

Parameters

Field

Type

Description

type

string

Always "har"

input

array

Array of JSON-formatted strings with city and optional price filters

sortBy

string

Controls result order: • NEWEST_FIRST • OLDEST_FIRST • CHEAPEST_FIRST • EXPENSIVE_FIRST

multithread

boolean

Use true for faster processing with multiple threads

Input Object Format

"{\"city\":\"Houston\",\"priceMin\":100000,\"priceMax\":200000}"

city (required): Target city name
priceMin (optional): Minimum price
priceMax (optional): Maximum price

You may use only priceMin or priceMax if desired.

Advanced Configuration (Optional)

{
  "type": "har",
  "input": [
    "{\"city\":\"Dallas\",\"priceMin\":10000,\"priceMax\":50000}"
  ],
  "sortBy": "CHEAPEST_FIRST",
  "user_agent_preset": "chrome",
  "user_agent_custom": "",
  "headers": "{\"X-Test\":\"abc\"}",
  "cookie": "session=xyz",
  "proxy": "http://username:password@ip:port"
}

Optional Parameters

Field

Type

Description

user_agent_preset

string

Preset user-agent. Options: chrome, firefox, edge, opera, safari, ios-safari, android-chrome, custom

user_agent_custom

string

Used if user_agent_preset is custom.

headers

string

JSON-formatted string of headers.

string

key=value;

proxy

string

http://username:password@ip:port

Pricing

$0.005 per successful task This is a pay-as-you-go pricing model — you're only charged when a HAR task successfully returns listings.

You can view your current credit balance and usage history in the Crawlbyte Dashboard.

Response

The response contains metadata about the task. For har type, the most important fields are status and result.

{
  "id": "bd3e89ed-815e-4395-98a3-521ede71cc4d",
  "status": "completed",
  "result": {
    // all listings here
  }
}

result is a JSON object, that's the final scraped results — no further polling is needed.

Status Types

Status

Meaning

queued

Task was accepted and added to the processing queue.

processing

Task is currently running.

completed

Task finished and listings were collected.

failed

Task encountered an error (invalid URL, etc.).

Polling

If status is queued or processing, continue polling the task until it's completed or failed.

GET https://api.crawlbyte.ai/api/tasks/:id

You’ll receive the same structure with an updated status.
Only poll until you receive completed or failed.
Recommended interval: every 3–5 seconds.

SDK Usage

You can run this task using any official Crawlbyte SDK:

Each SDK provides a simple way to:

Create the task
Poll for status
Handle the final result

Refer to the SDKs section for installation, examples, and setup instructions.

Notes

Input values must be valid — misspelled cities or malformed price fields will result in empty results or failure.
The input must always be a JSON string, even if it contains only the city field.
sortBy is required and must exactly match one of the supported options.
Crawlbyte automatically handles pagination, anti-bot protections, and request sequencing.
multithread: true can be used in advanced settings for faster bulk processing.

PreviousFlyFrontier App Scraper NextLinkedIn Scraper

Last updated 4 months ago

hashtagEndpoint

hashtagBasic Configuration (Required)

hashtagParameters

hashtagInput Object Format

hashtagAdvanced Configuration (Optional)

hashtagOptional Parameters

hashtagPricing

hashtagResponse

hashtagStatus Types

hashtagPolling

hashtagSDK Usage

hashtagNotes

Endpoint

Basic Configuration (Required)

Parameters

Input Object Format

Advanced Configuration (Optional)

Optional Parameters

Pricing

Response

Status Types

Polling

SDK Usage

Notes