Yelp Scraper
Scrapes public data from Yelp business profiles or review listings. Crawlbyte handles fingerprinting, anti-bot protections, and pagination — so you can collect structured data from Yelp effortlessly.
Endpoint
POST https://api.crawlbyte.ai/api/tasksBasic Configuration (Required)
Business Profile
{
"type": "yelp",
"input": [
"https://www.yelp.com/biz/some-restaurant"
],
"dataType": "profiles",
"multithread": false
}Reviews
{
"type": "yelp",
"input": [
"https://www.yelp.com/biz/some-restaurant"
],
"dataType": "reviews",
"sortBy": "NEWEST_FIRST",
"multithread": false
}
Parameters
type
string
Always "yelp"
input
array
Array of Yelp business URLs
dataType
string
"profiles" or "reviews"
sortBy
string
Only for reviews — controls review order:
• NEWEST_FIRST
• OLDEST_FIRST
• HIGHEST_RATED
• LOWEST_RATED
multithread
boolean
Use true for faster processing with multiple threads
Advanced Configuration (Optional)
{
"type": "yelp",
"input": [
"https://www.yelp.com/biz/some-restaurant"
],
"dataType": "reviews",
"sortBy": "NEWEST_FIRST",
"user_agent_preset": "chrome",
"headers": "{\"X-Test\":\"abc\"}",
"cookie": "",
"proxy": "http://username:password@ip:port"
}Optional Parameters
user_agent_preset
string
Preset user-agent. Options: chrome, firefox, edge, opera, safari, ios-safari, android-chrome, custom
user_agent_custom
string
Used if user_agent_preset is custom.
headers
string
JSON-formatted string of headers.
cookie
string
key=value;
proxy
string
http://username:password@ip:port
Pricing
$0.005 per successful task This is a pay-as-you-go pricing model — you're only charged when a Yelp task successfully returns listings or reviews.
You can view your current credit balance and usage history in the Crawlbyte Dashboard.
Response
The response contains metadata about the task. For the yelp type, the key fields in the response are status and result.
{
"id": "af3e12f2-8f45-43b0-8a7b-cabbbb94c1e9",
"status": "completed",
"result": "JSON_RESULT_HERE"
}If
resultis a hash, you must poll/api/tasks/:idto retrieve the full data.If
resultis a JSON object, the data is already available — no polling needed.
Status Types
queued
Task was accepted and added to the processing queue.
processing
Task is currently running.
completed
Task finished successfully and data was collected from Yelp.
failed
Task encountered an error (e.g. bad proxy, invalid URL, etc.).
Polling
If the initial status is queued or processing, you should poll for task completion.
GET https://api.crawlbyte.ai/api/tasks/:idYou’ll receive the same structure with an updated
status.Only poll until you receive
completedorfailed.Average time: 2–4 seconds, but longer for reviews (full scraping).
SDK Usage
You can run this task using any official Crawlbyte SDK:
Each SDK provides a simple way to:
Create the task
Poll for status
Handle the final result
Refer to the SDKs section for installation, examples, and setup instructions.
Notes
This task supports public Yelp business pages and review listings.
If
dataTypeis set to"reviews", thesortByfield is required. Accepted values:NEWEST_FIRST,OLDEST_FIRST,HIGHEST_RATED,LOWEST_RATED.When scraping reviews, Crawlbyte fetches all available reviews across all pages, which may take longer for listings with high volume.
Crawlbyte handles fingerprinting, bot detection, and pagination internally — no need to configure anything manually.
You can batch multiple business URLs using
multithread: true.
Last updated