Firecrawl
Clawdia can use Firecrawl as a fallback extractor forweb_fetch. It is a hosted
content extraction service that supports bot circumvention and caching, which helps
with JS-heavy sites or pages that block plain HTTP fetches.
Get an API key
- Create a Firecrawl account and generate an API key.
- Store it in config or set
FIRECRAWL_API_KEYin the gateway environment.
Configure Firecrawl
firecrawl.enableddefaults to true when an API key is present.maxAgeMscontrols how old cached results can be (ms). Default is 2 days.
Stealth / bot circumvention
Firecrawl exposes a proxy mode parameter for bot circumvention (basic, stealth, or auto).
Clawdia always uses proxy: "auto" plus storeInCache: true for Firecrawl requests.
If proxy is omitted, Firecrawl defaults to auto. auto retries with stealth proxies if a basic attempt fails, which may use more credits
than basic-only scraping.
How web_fetch uses Firecrawl
web_fetch extraction order:
- Readability (local)
- Firecrawl (if configured)
- Basic HTML cleanup (last fallback)
