Jina AI MCP — 安裝 & 即時演示

為什麼要用

核心特性

First-party — official Jina AI MCP
read_url returns clean markdown — handles JS-rendered sites
Search across web, arXiv, SSRN, images, and BibTeX in one interface
Processing tools: rerank, classify, dedupe (text and images), extract_pdf

即時演示

實際使用效果

jina.replay ▶ 就緒

0/0

安裝

選擇你的客戶端

~/Library/Application Support/Claude/claude_desktop_config.json · Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "jina": {
      "command": "npx",
      "args": [
        "-y",
        "mcp-remote",
        "https://mcp.jina.ai/sse"
      ]
    }
  }
}

開啟 Claude Desktop → Settings → Developer → Edit Config。儲存後重啟應用。

~/.cursor/mcp.json · .cursor/mcp.json

{
  "mcpServers": {
    "jina": {
      "command": "npx",
      "args": [
        "-y",
        "mcp-remote",
        "https://mcp.jina.ai/sse"
      ]
    }
  }
}

Cursor 使用與 Claude Desktop 相同的 mcpServers 格式。專案級設定優先於全域。

VS Code → Cline → MCP Servers → Edit

{
  "mcpServers": {
    "jina": {
      "command": "npx",
      "args": [
        "-y",
        "mcp-remote",
        "https://mcp.jina.ai/sse"
      ]
    }
  }
}

點擊 Cline 側欄中的 MCP Servers 圖示，然後選 "Edit Configuration"。

~/.codeium/windsurf/mcp_config.json

{
  "mcpServers": {
    "jina": {
      "command": "npx",
      "args": [
        "-y",
        "mcp-remote",
        "https://mcp.jina.ai/sse"
      ]
    }
  }
}

格式與 Claude Desktop 相同。重啟 Windsurf 生效。

~/.continue/config.json

{
  "mcpServers": [
    {
      "name": "jina",
      "command": "npx",
      "args": [
        "-y",
        "mcp-remote",
        "https://mcp.jina.ai/sse"
      ]
    }
  ]
}

Continue 使用伺服器物件陣列，而非映射。

~/.config/zed/settings.json

{
  "context_servers": {
    "jina": {
      "command": {
        "path": "npx",
        "args": [
          "-y",
          "mcp-remote",
          "https://mcp.jina.ai/sse"
        ]
      }
    }
  }
}

加入 context_servers。Zed 儲存後熱重載。

claude mcp add jina -- npx -y mcp-remote https://mcp.jina.ai/sse

一行命令搞定。用 claude mcp list 驗證，claude mcp remove 移除。

使用場景

實戰用法： Jina AI

Digest recent arXiv papers on a topic

👤 Researchers, ML engineers staying current ⏱ ~20 min intermediate

何時使用： You want to know what's new on arXiv about your topic without reading 50 abstracts.

前置條件

Optional Jina API key — jina.ai → dashboard → API key (free tier works for light use)

步驟

Search arXiv

Use search_arxiv to find papers from the last 30 days about 'speculative decoding for LLM inference'. Return top 20.✓ 已複製

→ Paper list with titles, authors, abstracts
Rerank by relevance

Use sort_by_relevance to rerank against this query: 'practical speedups in production inference, not pure research'. Keep top 8.✓ 已複製

→ Reranked list
Summarize each

For the top 8, extract_pdf the paper, summarize in 3 bullets: contribution, method, reported speedup. Output as a markdown table.✓ 已複製

→ Digest-ready summary table

結果： A weekly research digest on your topic in 10 minutes.

注意事項

extract_pdf on every result is expensive — credits add up — Rerank first to cut candidates, only extract the top N

搭配使用： notion

Convert a batch of URLs into clean markdown for RAG

👤 AI engineers building retrieval systems ⏱ ~15 min intermediate

何時使用： You have a list of URLs to ingest. You want clean markdown, not raw HTML or a parsing pipeline.

步驟

Read URLs in parallel

Use parallel_read_url on this list [URLs]. Return markdown for each with original URL as key.✓ 已複製

→ Markdown per URL
Dedupe near-duplicates

Use deduplicate_strings at 0.9 similarity to drop near-duplicate pages (common for mirror docs).✓ 已複製

→ Deduped set with IDs of dropped pages
Save to disk

Save each to ./knowledge/<slug>.md where slug is derived from the URL path.✓ 已複製

→ Markdown files ready for embedding pipeline

結果： A clean corpus for your embedding/indexing step, without writing any scraping code.

注意事項

Paywalled or JS-auth-walled pages return blank/garbage — Spot check a few URLs — if the content is thin, fall back to playwright for auth flows

搭配使用： filesystem · firecrawl

Classify a batch of text with custom labels

👤 Data analysts, growth teams ⏱ ~15 min beginner

何時使用： You have N free-text items (tickets, reviews, survey responses) and want them bucketed into your taxonomy.

步驟

Define labels

My labels: ['bug', 'feature_request', 'question', 'praise', 'other']. Sample the first 10 items and sanity-check the labels fit.✓ 已複製

→ Labels validated against samples
Batch classify

Use classify_text on all items with those labels. Return {id, text, label, confidence}.✓ 已複製

→ Labelled dataset
Review low-confidence

Flag items where confidence < 0.6 for manual review. Summarize: distribution, outliers, likely missing labels.✓ 已複製

→ Review queue + taxonomy feedback

結果： A labeled dataset without fine-tuning a classifier or writing prompts per item.

注意事項

Labels are ambiguous and classifier flip-flops on near-ties — Make labels mutually exclusive; if items span categories, allow multi-label output

搭配使用： filesystem

組合

與其他 MCP 搭配，撬動十倍槓桿

jina + notion

Weekly research digest posted to Notion

Search arXiv for new 'agentic RAG' papers this week. Summarize each and create a Notion page in the Research Digest database.✓ 已複製

jina + firecrawl

Jina for single pages, Firecrawl for full crawls — same clean-markdown output

For the list of URLs, use parallel_read_url (Jina). For the 3 full docs sites, use Firecrawl crawl. Merge into one knowledge dir.✓ 已複製

jina + filesystem

Build a local markdown knowledge base from a reading list

Read each URL in urls.txt, dedupe, save to ./knowledge/<hash>.md. Overwrite only if content changed.✓ 已複製

工具

此 MCP 暴露的能力

工具	輸入參數	何時呼叫	成本
search_web	query, num_results?	General web search	credits per call
search_arxiv / search_ssrn / search_bibtex / search_images / search_jina_blog	query	Targeted searches	credits per call
parallel_search_web / parallel_search_arxiv / parallel_search_ssrn	query[]	Multi-query research in one call	credits × N queries
read_url	url	Clean content extraction from any URL	credits per page
parallel_read_url	url[]	Batch webpage ingestion	credits × N pages
capture_screenshot_url	url	Visual snapshot of a page	credits
sort_by_relevance	documents, query	Rerank after search for quality	credits
classify_text	texts, labels	Zero-shot classification	credits per text
deduplicate_strings / deduplicate_images	items, threshold	Remove near-duplicates from a corpus	credits
extract_pdf	url or file	Get structured content from PDFs	credits per PDF
expand_query / primer / guess_datetime_url	utility	Helpers around search tuning	credits (minor)

成本與限制

運行它的成本

API 配額: Free tier available with rate limits; paid tiers scale
每次呼叫 Token 數: Output is the bigger cost — PDFs and dedupes can return 10k+ tokens
費用: Jina API credits, typically measured per-request. See jina.ai/pricing.
提示: Rerank before extracting — extract_pdf is expensive. Cache read_url outputs locally; most pages don't change daily.

安全

權限、密鑰、影響範圍

憑證儲存： JINA_API_KEY env var (optional for many tools, required for heavy use)

資料出站： All calls to api.jina.ai / r.jina.ai / s.jina.ai — queries and URLs visible to Jina

故障排查

常見錯誤與修復

429 Too Many Requests

Free tier has low rate limits. Add a JINA_API_KEY env var and upgrade at jina.ai for burst capacity.

read_url returns empty markdown

Page may be auth-walled or bot-blocked. Try with different User-Agent via tool options, or fall back to playwright/firecrawl.

classify_text assigns everything to 'other'

Your labels may be too narrow or too similar. Add label descriptions ('bug: user reports something broken') for better zero-shot accuracy.

search_arxiv misses recent papers

arXiv index may lag; cross-check with a direct arxiv.org search. Use expand_query to broaden terms.

替代方案

Jina AI 對比其他方案

替代方案	何時用它替代	權衡
Firecrawl	You need full-site crawls or JSON-schema extraction	Crawl-focused; Jina's superpower is the breadth of processing tools beyond just reading
Exa Search MCP	You want semantic/neural web search as a primary interface	Stronger on semantic retrieval; narrower than Jina's toolbox
Brave Search MCP	You want independent search index + privacy	Search only, no reader/rerank/classify

Jina AI

為什麼要用

核心特性

即時演示

實際使用效果

安裝

選擇你的客戶端

使用場景

實戰用法： Jina AI

Digest recent arXiv papers on a topic

前置條件

步驟

注意事項

Convert a batch of URLs into clean markdown for RAG

步驟

注意事項

Classify a batch of text with custom labels

步驟

注意事項

組合

與其他 MCP 搭配，撬動十倍槓桿

工具

此 MCP 暴露的能力

成本與限制

運行它的成本

安全

權限、密鑰、影響範圍

故障排查

常見錯誤與修復

替代方案

Jina AI 對比其他方案

更多

資源