arXiv MCP — Installieren & Live-Demo

Warum nutzen

Hauptfunktionen

Search by query, category (cs.AI, cs.CL, stat.ML, etc.), date range, author
Download PDF to a local cache directory
Extract text from a downloaded paper for summarization or QA
List locally cached papers so you don't re-download
No API key — arXiv's query API is public

Live-Demo

In der Praxis

arxiv.replay ▶ bereit

0/0

Installieren

Wählen Sie Ihren Client

~/Library/Application Support/Claude/claude_desktop_config.json · Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "arxiv": {
      "command": "uvx",
      "args": [
        "arxiv-mcp-server"
      ]
    }
  }
}

Öffne Claude Desktop → Settings → Developer → Edit Config. Nach dem Speichern neu starten.

~/.cursor/mcp.json · .cursor/mcp.json

{
  "mcpServers": {
    "arxiv": {
      "command": "uvx",
      "args": [
        "arxiv-mcp-server"
      ]
    }
  }
}

Cursor nutzt das gleiche mcpServers-Schema wie Claude Desktop. Projektkonfiguration schlägt die globale.

VS Code → Cline → MCP Servers → Edit

{
  "mcpServers": {
    "arxiv": {
      "command": "uvx",
      "args": [
        "arxiv-mcp-server"
      ]
    }
  }
}

Klicken Sie auf das MCP-Servers-Symbol in der Cline-Seitenleiste, dann "Edit Configuration".

~/.codeium/windsurf/mcp_config.json

{
  "mcpServers": {
    "arxiv": {
      "command": "uvx",
      "args": [
        "arxiv-mcp-server"
      ]
    }
  }
}

Gleiche Struktur wie Claude Desktop. Windsurf neu starten zum Übernehmen.

~/.continue/config.json

{
  "mcpServers": [
    {
      "name": "arxiv",
      "command": "uvx",
      "args": [
        "arxiv-mcp-server"
      ]
    }
  ]
}

Continue nutzt ein Array von Serverobjekten statt einer Map.

~/.config/zed/settings.json

{
  "context_servers": {
    "arxiv": {
      "command": {
        "path": "uvx",
        "args": [
          "arxiv-mcp-server"
        ]
      }
    }
  }
}

In context_servers hinzufügen. Zed lädt beim Speichern neu.

claude mcp add arxiv -- uvx arxiv-mcp-server

Einzeiler. Prüfen mit claude mcp list. Entfernen mit claude mcp remove.

Anwendungsfälle

Praxisnahe Nutzung: arXiv

Build a mini literature survey on a niche topic

👤 Researchers, grad students, curious engineers ⏱ ~25 min intermediate

Wann einsetzen: You're starting work on a topic (e.g. 'speculative decoding') and want the 10 most relevant recent papers with summaries.

Voraussetzungen

Local cache dir writable — Default under user home; override via ARXIV_STORAGE_PATH

Ablauf

Search strategically

Search arXiv for 'speculative decoding' in cs.CL or cs.LG, last 12 months, sort by relevance. Top 20.✓ Kopiert

→ List of arxiv ids + titles + abstracts
Download the top candidates

Download the top 10 papers locally.✓ Kopiert

→ Papers cached; return local paths
Summarize each in one paragraph

For each downloaded paper, extract text and write a 4-line summary: problem, method, result, limitations. Preserve the arxiv id.✓ Kopiert

→ Structured summaries with citations

Ergebnis: A 10-paper survey table ready for a related-work section or blog post.

Fallstricke

arXiv relevance sort is weak; you'll miss important papers sorted elsewhere — Also search sorted by submittedDate desc; triangulate via Semantic Scholar / Google Scholar for citation counts

Kombinieren mit: filesystem · qdrant

Deep-read a single paper with Q&A

👤 Anyone reading a dense paper ⏱ ~20 min beginner

Wann einsetzen: You have one specific paper (say, the FlashAttention-3 paper) and want to understand it without reading the full PDF alone.

Ablauf

Download the paper

Download arxiv paper 2405.12345. Report number of pages and total word count.✓ Kopiert

→ File cached + stats
Summarize by section

Read the paper. Give me a section-by-section summary. For each section: goal, key points, any equations worth understanding (in plain English).✓ Kopiert

→ Structured walkthrough
Ask targeted questions

Specific Q: [your question]. Answer only from the paper; cite the section and any equation numbers.✓ Kopiert

→ Grounded answer with cites

Ergebnis: Paper-level understanding in 20 minutes instead of 2 hours.

Fallstricke

PDF extraction mangles equations and tables — For heavy-math papers, ask Claude to note 'equation extraction may be unreliable' and cross-check critical formulas against the PDF

Weekly digest of new papers in your field

👤 Academics, ML engineers tracking a subfield ⏱ ~15 min beginner

Wann einsetzen: Monday morning: 'what's new in cs.CL submitted in the last 7 days that's worth reading?'

Ablauf

Pull recent submissions

Search arXiv cs.CL submissions in the last 7 days. Return top 50 by relevance or arbitrary.✓ Kopiert

→ Recent papers list
Filter by keywords you care about

Keep only papers whose title or abstract mentions [your keywords]. Dedupe.✓ Kopiert

→ Narrowed shortlist
Abstract digest

For each kept paper, generate 2-line 'why it might matter' from the abstract. Mark 3 as must-reads.✓ Kopiert

→ Weekly digest

Ergebnis: A curated weekly reading list without doomscrolling arxiv-sanity.

Fallstricke

Abstracts oversell; 'must-read' tag can be wrong — Treat the tag as a prompt to read the abstract yourself, not as endorsement

Kombinieren mit: notion

Kombinationen

Mit anderen MCPs für 10-fache Wirkung

arxiv + qdrant

Build a searchable library of papers for semantic recall

Download the top 30 papers on 'mixture of experts'. Index each chunk into Qdrant collection papers_moe. Later answer: 'what tricks do MoE papers use for load balancing?'✓ Kopiert

arxiv + filesystem

Write a markdown survey file with inline citations

Download 10 papers on topic X, save summaries to /research/survey-X.md with [arxiv:id] links.✓ Kopiert

arxiv + notion

Post a weekly paper digest to a Notion research DB

Run the weekly digest for cs.CL, create a Notion page with the 5 must-reads as rows.✓ Kopiert

Werkzeuge

Was dieses MCP bereitstellt

Werkzeug	Eingaben	Wann aufrufen	Kosten
search_papers	query: str, category?, max_results?, date_range?	Discover relevant papers by query/category/date	free
download_paper	paper_id	Cache a PDF locally for extraction	free
read_paper	paper_id	Extract text from a cached paper for reading/QA	free
list_papers		See what's already downloaded to avoid re-fetch	free

Kosten & Limits

Was der Betrieb kostet

API-Kontingent: arXiv query API recommends ~1 req/3s; higher rates may get throttled
Tokens pro Aufruf: Search: 500–2000 tokens. Paper text: 5k–30k tokens per paper.
Kosten in €: Free
Tipp: Cache aggressively; re-reading a paper's extracted text is free once downloaded.

Sicherheit

Rechte, Secrets, Reichweite

Credential-Speicherung: None needed

Datenabfluss: Queries to export.arxiv.org; PDF downloads from arxiv.org

Respect arXiv's 1 req / 3s recommendation; don't parallelize aggressively.
Only cache papers with proper arXiv ids; do not mirror the full archive.

Fehlerbehebung

Häufige Fehler und Lösungen

Empty search results for a clearly existing topic

arXiv search is keyword-exact for quoted strings; try broader terms and the correct category prefix (cs.CL vs cs.AI).

Download failed / PDF unavailable

Very rare; some withdrawn papers 404. Confirm the id on arxiv.org/abs/<id>.

Extracted text is garbled

Some math-heavy papers have non-standard PDFs; try the source version if available, else note the limitation.

Alternativen

arXiv vs. andere

Alternative	Wann stattdessen	Kompromiss
Semantic Scholar MCP	You need citation counts and influence metrics	Not arXiv-specific; coverage varies
Papers with Code MCP	You want papers with code implementations linked	Smaller catalog, ML-focused

Mehr

Ressourcen

📖 Offizielle README auf GitHub lesen

🐙 Offene Issues ansehen

🔍 Alle 400+ MCP-Server und Skills durchsuchen