2026-01-15 04:07:29 +00:00
|
|
|
|
---
|
2026-01-18 01:42:40 +00:00
|
|
|
|
summary: "Web search + fetch tools (Brave Search API, Perplexity direct/OpenRouter)"
|
2026-01-15 04:07:29 +00:00
|
|
|
|
read_when:
|
|
|
|
|
|
- You want to enable web_search or web_fetch
|
|
|
|
|
|
- You need Brave Search API key setup
|
2026-01-18 08:08:36 +08:00
|
|
|
|
- You want to use Perplexity Sonar for web search
|
2026-01-15 04:07:29 +00:00
|
|
|
|
---
|
|
|
|
|
|
|
|
|
|
|
|
# Web tools
|
|
|
|
|
|
|
|
|
|
|
|
Clawdbot ships two lightweight web tools:
|
|
|
|
|
|
|
2026-01-20 07:27:25 +00:00
|
|
|
|
- `web_search` — Search the web via Brave Search API (default) or Perplexity Sonar (direct or via OpenRouter).
|
2026-01-15 04:07:29 +00:00
|
|
|
|
- `web_fetch` — HTTP fetch + readable extraction (HTML → markdown/text).
|
|
|
|
|
|
|
|
|
|
|
|
These are **not** browser automation. For JS-heavy sites or logins, use the
|
|
|
|
|
|
[Browser tool](/tools/browser).
|
|
|
|
|
|
|
|
|
|
|
|
## How it works
|
|
|
|
|
|
|
2026-01-18 08:08:36 +08:00
|
|
|
|
- `web_search` calls your configured provider and returns results.
|
|
|
|
|
|
- **Brave** (default): returns structured results (title, URL, snippet).
|
|
|
|
|
|
- **Perplexity**: returns AI-synthesized answers with citations from real-time web search.
|
2026-01-15 04:07:29 +00:00
|
|
|
|
- Results are cached by query for 15 minutes (configurable).
|
|
|
|
|
|
- `web_fetch` does a plain HTTP GET and extracts readable content
|
|
|
|
|
|
(HTML → markdown/text). It does **not** execute JavaScript.
|
2026-01-15 07:42:01 +00:00
|
|
|
|
- `web_fetch` is enabled by default (unless explicitly disabled).
|
2026-01-15 04:07:29 +00:00
|
|
|
|
|
2026-01-18 08:08:36 +08:00
|
|
|
|
## Choosing a search provider
|
|
|
|
|
|
|
|
|
|
|
|
| Provider | Pros | Cons | API Key |
|
|
|
|
|
|
|----------|------|------|---------|
|
|
|
|
|
|
| **Brave** (default) | Fast, structured results, free tier | Traditional search results | `BRAVE_API_KEY` |
|
2026-01-20 07:27:25 +00:00
|
|
|
|
| **Perplexity** | AI-synthesized answers, citations, real-time | Requires Perplexity or OpenRouter access | `OPENROUTER_API_KEY` or `PERPLEXITY_API_KEY` |
|
2026-01-18 08:08:36 +08:00
|
|
|
|
|
2026-01-18 01:42:40 +00:00
|
|
|
|
See [Brave Search setup](/brave-search) and [Perplexity Sonar](/perplexity) for provider-specific details.
|
|
|
|
|
|
|
2026-01-18 08:08:36 +08:00
|
|
|
|
Set the provider in config:
|
|
|
|
|
|
|
|
|
|
|
|
```json5
|
|
|
|
|
|
{
|
|
|
|
|
|
tools: {
|
|
|
|
|
|
web: {
|
|
|
|
|
|
search: {
|
|
|
|
|
|
provider: "brave" // or "perplexity"
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
```
|
|
|
|
|
|
|
2026-01-18 01:42:40 +00:00
|
|
|
|
Example: switch to Perplexity Sonar (direct API):
|
|
|
|
|
|
|
|
|
|
|
|
```json5
|
|
|
|
|
|
{
|
|
|
|
|
|
tools: {
|
|
|
|
|
|
web: {
|
|
|
|
|
|
search: {
|
|
|
|
|
|
provider: "perplexity",
|
|
|
|
|
|
perplexity: {
|
|
|
|
|
|
apiKey: "pplx-...",
|
|
|
|
|
|
baseUrl: "https://api.perplexity.ai",
|
|
|
|
|
|
model: "perplexity/sonar-pro"
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
```
|
|
|
|
|
|
|
2026-01-15 04:25:19 +00:00
|
|
|
|
## Getting a Brave API key
|
|
|
|
|
|
|
|
|
|
|
|
1) Create a Brave Search API account at https://brave.com/search/api/
|
2026-01-15 09:17:15 +00:00
|
|
|
|
2) In the dashboard, choose the **Data for Search** plan (not “Data for AI”) and generate an API key.
|
2026-01-15 05:08:51 +00:00
|
|
|
|
3) Run `clawdbot configure --section web` to store the key in config (recommended), or set `BRAVE_API_KEY` in your environment.
|
2026-01-15 04:25:19 +00:00
|
|
|
|
|
|
|
|
|
|
Brave provides a free tier plus paid plans; check the Brave API portal for the
|
|
|
|
|
|
current limits and pricing.
|
|
|
|
|
|
|
2026-01-15 05:08:51 +00:00
|
|
|
|
### Where to set the key (recommended)
|
|
|
|
|
|
|
|
|
|
|
|
**Recommended:** run `clawdbot configure --section web`. It stores the key in
|
|
|
|
|
|
`~/.clawdbot/clawdbot.json` under `tools.web.search.apiKey`.
|
|
|
|
|
|
|
|
|
|
|
|
**Environment alternative:** set `BRAVE_API_KEY` in the Gateway process
|
2026-01-21 17:45:12 +00:00
|
|
|
|
environment. For a gateway install, put it in `~/.clawdbot/.env` (or your
|
2026-01-24 09:49:35 +00:00
|
|
|
|
service environment). See [Env vars](/help/faq#how-does-clawdbot-load-environment-variables).
|
2026-01-15 05:08:51 +00:00
|
|
|
|
|
2026-01-18 01:42:40 +00:00
|
|
|
|
## Using Perplexity (direct or via OpenRouter)
|
2026-01-18 08:08:36 +08:00
|
|
|
|
|
|
|
|
|
|
Perplexity Sonar models have built-in web search capabilities and return AI-synthesized
|
|
|
|
|
|
answers with citations. You can use them via OpenRouter (no credit card required - supports
|
|
|
|
|
|
crypto/prepaid).
|
|
|
|
|
|
|
|
|
|
|
|
### Getting an OpenRouter API key
|
|
|
|
|
|
|
|
|
|
|
|
1) Create an account at https://openrouter.ai/
|
|
|
|
|
|
2) Add credits (supports crypto, prepaid, or credit card)
|
|
|
|
|
|
3) Generate an API key in your account settings
|
|
|
|
|
|
|
|
|
|
|
|
### Setting up Perplexity search
|
|
|
|
|
|
|
|
|
|
|
|
```json5
|
|
|
|
|
|
{
|
|
|
|
|
|
tools: {
|
|
|
|
|
|
web: {
|
|
|
|
|
|
search: {
|
|
|
|
|
|
enabled: true,
|
|
|
|
|
|
provider: "perplexity",
|
|
|
|
|
|
perplexity: {
|
|
|
|
|
|
// API key (optional if OPENROUTER_API_KEY or PERPLEXITY_API_KEY is set)
|
|
|
|
|
|
apiKey: "sk-or-v1-...",
|
2026-01-20 07:27:25 +00:00
|
|
|
|
// Base URL (key-aware default if omitted)
|
2026-01-18 08:08:36 +08:00
|
|
|
|
baseUrl: "https://openrouter.ai/api/v1",
|
|
|
|
|
|
// Model (defaults to perplexity/sonar-pro)
|
|
|
|
|
|
model: "perplexity/sonar-pro"
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
|
|
**Environment alternative:** set `OPENROUTER_API_KEY` or `PERPLEXITY_API_KEY` in the Gateway
|
2026-01-21 17:45:12 +00:00
|
|
|
|
environment. For a gateway install, put it in `~/.clawdbot/.env`.
|
2026-01-18 08:08:36 +08:00
|
|
|
|
|
2026-01-20 07:27:25 +00:00
|
|
|
|
If no base URL is set, Clawdbot chooses a default based on the API key source:
|
|
|
|
|
|
|
|
|
|
|
|
- `PERPLEXITY_API_KEY` or `pplx-...` → `https://api.perplexity.ai`
|
|
|
|
|
|
- `OPENROUTER_API_KEY` or `sk-or-...` → `https://openrouter.ai/api/v1`
|
|
|
|
|
|
- Unknown key formats → OpenRouter (safe fallback)
|
2026-01-18 01:42:40 +00:00
|
|
|
|
|
2026-01-18 08:08:36 +08:00
|
|
|
|
### Available Perplexity models
|
|
|
|
|
|
|
|
|
|
|
|
| Model | Description | Best for |
|
|
|
|
|
|
|-------|-------------|----------|
|
|
|
|
|
|
| `perplexity/sonar` | Fast Q&A with web search | Quick lookups |
|
|
|
|
|
|
| `perplexity/sonar-pro` (default) | Multi-step reasoning with web search | Complex questions |
|
|
|
|
|
|
| `perplexity/sonar-reasoning-pro` | Chain-of-thought analysis | Deep research |
|
|
|
|
|
|
|
2026-01-15 04:07:29 +00:00
|
|
|
|
## web_search
|
|
|
|
|
|
|
2026-01-18 08:08:36 +08:00
|
|
|
|
Search the web using your configured provider.
|
2026-01-15 04:07:29 +00:00
|
|
|
|
|
|
|
|
|
|
### Requirements
|
|
|
|
|
|
|
2026-01-15 07:42:01 +00:00
|
|
|
|
- `tools.web.search.enabled` must not be `false` (default: enabled)
|
2026-01-18 08:08:36 +08:00
|
|
|
|
- API key for your chosen provider:
|
|
|
|
|
|
- **Brave**: `BRAVE_API_KEY` or `tools.web.search.apiKey`
|
|
|
|
|
|
- **Perplexity**: `OPENROUTER_API_KEY`, `PERPLEXITY_API_KEY`, or `tools.web.search.perplexity.apiKey`
|
2026-01-15 04:07:29 +00:00
|
|
|
|
|
|
|
|
|
|
### Config
|
|
|
|
|
|
|
|
|
|
|
|
```json5
|
|
|
|
|
|
{
|
|
|
|
|
|
tools: {
|
|
|
|
|
|
web: {
|
|
|
|
|
|
search: {
|
|
|
|
|
|
enabled: true,
|
|
|
|
|
|
apiKey: "BRAVE_API_KEY_HERE", // optional if BRAVE_API_KEY is set
|
|
|
|
|
|
maxResults: 5,
|
|
|
|
|
|
timeoutSeconds: 30,
|
|
|
|
|
|
cacheTtlMinutes: 15
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
|
|
### Tool parameters
|
|
|
|
|
|
|
|
|
|
|
|
- `query` (required)
|
|
|
|
|
|
- `count` (1–10; default from config)
|
2026-01-16 23:16:59 +00:00
|
|
|
|
- `country` (optional): 2-letter country code for region-specific results (e.g., "DE", "US", "ALL"). If omitted, Brave chooses its default region.
|
2026-01-16 23:48:40 +01:00
|
|
|
|
- `search_lang` (optional): ISO language code for search results (e.g., "de", "en", "fr")
|
|
|
|
|
|
- `ui_lang` (optional): ISO language code for UI elements
|
|
|
|
|
|
|
|
|
|
|
|
**Examples:**
|
|
|
|
|
|
|
|
|
|
|
|
```javascript
|
|
|
|
|
|
// German-specific search
|
|
|
|
|
|
await web_search({
|
|
|
|
|
|
query: "TV online schauen",
|
|
|
|
|
|
count: 10,
|
|
|
|
|
|
country: "DE",
|
|
|
|
|
|
search_lang: "de"
|
|
|
|
|
|
});
|
|
|
|
|
|
|
|
|
|
|
|
// French search with French UI
|
|
|
|
|
|
await web_search({
|
|
|
|
|
|
query: "actualités",
|
|
|
|
|
|
country: "FR",
|
|
|
|
|
|
search_lang: "fr",
|
|
|
|
|
|
ui_lang: "fr"
|
|
|
|
|
|
});
|
|
|
|
|
|
```
|
2026-01-15 04:07:29 +00:00
|
|
|
|
|
|
|
|
|
|
## web_fetch
|
|
|
|
|
|
|
|
|
|
|
|
Fetch a URL and extract readable content.
|
|
|
|
|
|
|
|
|
|
|
|
### Requirements
|
|
|
|
|
|
|
2026-01-15 07:42:01 +00:00
|
|
|
|
- `tools.web.fetch.enabled` must not be `false` (default: enabled)
|
2026-01-17 00:00:15 +00:00
|
|
|
|
- Optional Firecrawl fallback: set `tools.web.fetch.firecrawl.apiKey` or `FIRECRAWL_API_KEY`.
|
2026-01-15 04:07:29 +00:00
|
|
|
|
|
|
|
|
|
|
### Config
|
|
|
|
|
|
|
|
|
|
|
|
```json5
|
|
|
|
|
|
{
|
|
|
|
|
|
tools: {
|
|
|
|
|
|
web: {
|
|
|
|
|
|
fetch: {
|
|
|
|
|
|
enabled: true,
|
|
|
|
|
|
maxChars: 50000,
|
|
|
|
|
|
timeoutSeconds: 30,
|
|
|
|
|
|
cacheTtlMinutes: 15,
|
2026-01-21 02:52:27 +00:00
|
|
|
|
maxRedirects: 3,
|
2026-01-17 00:00:15 +00:00
|
|
|
|
userAgent: "Mozilla/5.0 (Macintosh; Intel Mac OS X 14_7_2) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/122.0.0.0 Safari/537.36",
|
|
|
|
|
|
readability: true,
|
|
|
|
|
|
firecrawl: {
|
|
|
|
|
|
enabled: true,
|
|
|
|
|
|
apiKey: "FIRECRAWL_API_KEY_HERE", // optional if FIRECRAWL_API_KEY is set
|
|
|
|
|
|
baseUrl: "https://api.firecrawl.dev",
|
|
|
|
|
|
onlyMainContent: true,
|
|
|
|
|
|
maxAgeMs: 86400000, // ms (1 day)
|
|
|
|
|
|
timeoutSeconds: 60
|
|
|
|
|
|
}
|
2026-01-15 04:07:29 +00:00
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
}
|
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
|
|
### Tool parameters
|
|
|
|
|
|
|
|
|
|
|
|
- `url` (required, http/https only)
|
|
|
|
|
|
- `extractMode` (`markdown` | `text`)
|
|
|
|
|
|
- `maxChars` (truncate long pages)
|
|
|
|
|
|
|
|
|
|
|
|
Notes:
|
2026-01-17 00:00:15 +00:00
|
|
|
|
- `web_fetch` uses Readability (main-content extraction) first, then Firecrawl (if configured). If both fail, the tool returns an error.
|
|
|
|
|
|
- Firecrawl requests use bot-circumvention mode and cache results by default.
|
|
|
|
|
|
- `web_fetch` sends a Chrome-like User-Agent and `Accept-Language` by default; override `userAgent` if needed.
|
2026-01-21 02:52:27 +00:00
|
|
|
|
- `web_fetch` blocks private/internal hostnames and re-checks redirects (limit with `maxRedirects`).
|
2026-01-15 04:07:29 +00:00
|
|
|
|
- `web_fetch` is best-effort extraction; some sites will need the browser tool.
|
2026-01-17 00:00:15 +00:00
|
|
|
|
- See [Firecrawl](/tools/firecrawl) for key setup and service details.
|
2026-01-15 04:07:29 +00:00
|
|
|
|
- Responses are cached (default 15 minutes) to reduce repeated fetches.
|
|
|
|
|
|
- If you use tool profiles/allowlists, add `web_search`/`web_fetch` or `group:web`.
|
2026-01-16 23:17:55 +00:00
|
|
|
|
- If the Brave key is missing, `web_search` returns a short setup hint with a docs link.
|