Skip to content

Core Concepts

Understanding these core concepts will help you make the most of CoreClaw.

A Worker is a program that extracts data from websites. In CoreClaw:

  • Workers are pre-built and ready to use
  • You can also create your own Workers
  • Workers run in the cloud
TypeDescriptionUse Case
StaticScrapes static HTML pagesSimple websites, blogs
DynamicHandles JavaScript-rendered contentSPAs, modern websites
APIDirectly calls website APIsSites with public APIs

A Task is a saved configuration for running a Worker:

  • Pre-configured input parameters
  • Can be scheduled for automatic runs
  • Reusable for repeated data collection
  • Automation: Schedule runs at specific intervals
  • Consistency: Use the same parameters every time
  • Monitoring: Track task performance over time

A Run is a single execution of a Worker:

  • Each run has a unique ID
  • Contains logs and results
  • Can be monitored in real-time
StateDescription
READYRun is queued and waiting
RUNNINGRun is currently executing
SUCCEEDEDRun completed successfully
FAILEDRun encountered an error
ABORTEDRun was manually stopped

Web Unlocker is CoreClaw’s anti-detection technology:

  • Bypasses CAPTCHAs automatically
  • Rotates proxies to avoid IP blocks
  • Handles browser fingerprinting
  • Manages cookies and sessions
  • Target site has anti-bot protection
  • You encounter CAPTCHAs
  • IP blocking is an issue
  • Geographic restrictions apply

CoreClaw provides several storage options:

Storage TypeDescriptionBest For
DatasetStructured data in tabular formatTabular data, CSV export
Key-Value StoreFiles and arbitrary dataImages, PDFs, JSON
Request QueueURLs to be scrapedCrawling multiple pages

CoreClaw uses a usage-based pricing model:

  • Charged per Worker run
  • Based on execution time and resources
  • Different Workers have different costs
  • Charged per GB of data transferred
  • Includes both input and output data
  • Charged per GB/month
  • For stored datasets and results
┌─────────────┐ ┌─────────────┐ ┌─────────────┐
│ Input │────▶│ Worker │────▶│ Output │
│ Parameters │ │ Runtime │ │ Data │
└─────────────┘ └─────────────┘ └─────────────┘
┌─────────────┐
│ Task │
│ Scheduler │
└─────────────┘