/ Verzeichnis / Playground / arXiv
● Community blazickjp ⚡ Sofort

arXiv

von blazickjp · blazickjp/arxiv-mcp-server

Search arXiv, download papers, and let Claude read + summarize them — a lightweight research assistant for the latest preprints.

The arxiv-mcp-server lets Claude search arXiv by keyword/category/date, download PDFs, and extract their text for in-chat reading. No API key; arXiv is fully public. Ideal for literature surveys, paper summarization, and keeping up with fast-moving ML/physics/CS subfields.

Warum nutzen

Hauptfunktionen

Live-Demo

In der Praxis

arxiv.replay ▶ bereit
0/0

Installieren

Wählen Sie Ihren Client

~/Library/Application Support/Claude/claude_desktop_config.json  · Windows: %APPDATA%\Claude\claude_desktop_config.json
{
  "mcpServers": {
    "arxiv": {
      "command": "uvx",
      "args": [
        "arxiv-mcp-server"
      ]
    }
  }
}

Öffne Claude Desktop → Settings → Developer → Edit Config. Nach dem Speichern neu starten.

~/.cursor/mcp.json · .cursor/mcp.json
{
  "mcpServers": {
    "arxiv": {
      "command": "uvx",
      "args": [
        "arxiv-mcp-server"
      ]
    }
  }
}

Cursor nutzt das gleiche mcpServers-Schema wie Claude Desktop. Projektkonfiguration schlägt die globale.

VS Code → Cline → MCP Servers → Edit
{
  "mcpServers": {
    "arxiv": {
      "command": "uvx",
      "args": [
        "arxiv-mcp-server"
      ]
    }
  }
}

Klicken Sie auf das MCP-Servers-Symbol in der Cline-Seitenleiste, dann "Edit Configuration".

~/.codeium/windsurf/mcp_config.json
{
  "mcpServers": {
    "arxiv": {
      "command": "uvx",
      "args": [
        "arxiv-mcp-server"
      ]
    }
  }
}

Gleiche Struktur wie Claude Desktop. Windsurf neu starten zum Übernehmen.

~/.continue/config.json
{
  "mcpServers": [
    {
      "name": "arxiv",
      "command": "uvx",
      "args": [
        "arxiv-mcp-server"
      ]
    }
  ]
}

Continue nutzt ein Array von Serverobjekten statt einer Map.

~/.config/zed/settings.json
{
  "context_servers": {
    "arxiv": {
      "command": {
        "path": "uvx",
        "args": [
          "arxiv-mcp-server"
        ]
      }
    }
  }
}

In context_servers hinzufügen. Zed lädt beim Speichern neu.

claude mcp add arxiv -- uvx arxiv-mcp-server

Einzeiler. Prüfen mit claude mcp list. Entfernen mit claude mcp remove.

Anwendungsfälle

Praxisnahe Nutzung: arXiv

Build a mini literature survey on a niche topic

👤 Researchers, grad students, curious engineers ⏱ ~25 min intermediate

Wann einsetzen: You're starting work on a topic (e.g. 'speculative decoding') and want the 10 most relevant recent papers with summaries.

Voraussetzungen
  • Local cache dir writable — Default under user home; override via ARXIV_STORAGE_PATH
Ablauf
  1. Search strategically
    Search arXiv for 'speculative decoding' in cs.CL or cs.LG, last 12 months, sort by relevance. Top 20.✓ Kopiert
    → List of arxiv ids + titles + abstracts
  2. Download the top candidates
    Download the top 10 papers locally.✓ Kopiert
    → Papers cached; return local paths
  3. Summarize each in one paragraph
    For each downloaded paper, extract text and write a 4-line summary: problem, method, result, limitations. Preserve the arxiv id.✓ Kopiert
    → Structured summaries with citations

Ergebnis: A 10-paper survey table ready for a related-work section or blog post.

Fallstricke
  • arXiv relevance sort is weak; you'll miss important papers sorted elsewhere — Also search sorted by submittedDate desc; triangulate via Semantic Scholar / Google Scholar for citation counts
Kombinieren mit: filesystem · qdrant

Deep-read a single paper with Q&A

👤 Anyone reading a dense paper ⏱ ~20 min beginner

Wann einsetzen: You have one specific paper (say, the FlashAttention-3 paper) and want to understand it without reading the full PDF alone.

Ablauf
  1. Download the paper
    Download arxiv paper 2405.12345. Report number of pages and total word count.✓ Kopiert
    → File cached + stats
  2. Summarize by section
    Read the paper. Give me a section-by-section summary. For each section: goal, key points, any equations worth understanding (in plain English).✓ Kopiert
    → Structured walkthrough
  3. Ask targeted questions
    Specific Q: [your question]. Answer only from the paper; cite the section and any equation numbers.✓ Kopiert
    → Grounded answer with cites

Ergebnis: Paper-level understanding in 20 minutes instead of 2 hours.

Fallstricke
  • PDF extraction mangles equations and tables — For heavy-math papers, ask Claude to note 'equation extraction may be unreliable' and cross-check critical formulas against the PDF

Weekly digest of new papers in your field

👤 Academics, ML engineers tracking a subfield ⏱ ~15 min beginner

Wann einsetzen: Monday morning: 'what's new in cs.CL submitted in the last 7 days that's worth reading?'

Ablauf
  1. Pull recent submissions
    Search arXiv cs.CL submissions in the last 7 days. Return top 50 by relevance or arbitrary.✓ Kopiert
    → Recent papers list
  2. Filter by keywords you care about
    Keep only papers whose title or abstract mentions [your keywords]. Dedupe.✓ Kopiert
    → Narrowed shortlist
  3. Abstract digest
    For each kept paper, generate 2-line 'why it might matter' from the abstract. Mark 3 as must-reads.✓ Kopiert
    → Weekly digest

Ergebnis: A curated weekly reading list without doomscrolling arxiv-sanity.

Fallstricke
  • Abstracts oversell; 'must-read' tag can be wrong — Treat the tag as a prompt to read the abstract yourself, not as endorsement
Kombinieren mit: notion

Kombinationen

Mit anderen MCPs für 10-fache Wirkung

arxiv + qdrant

Build a searchable library of papers for semantic recall

Download the top 30 papers on 'mixture of experts'. Index each chunk into Qdrant collection papers_moe. Later answer: 'what tricks do MoE papers use for load balancing?'✓ Kopiert
arxiv + filesystem

Write a markdown survey file with inline citations

Download 10 papers on topic X, save summaries to /research/survey-X.md with [arxiv:id] links.✓ Kopiert
arxiv + notion

Post a weekly paper digest to a Notion research DB

Run the weekly digest for cs.CL, create a Notion page with the 5 must-reads as rows.✓ Kopiert

Werkzeuge

Was dieses MCP bereitstellt

WerkzeugEingabenWann aufrufenKosten
search_papers query: str, category?, max_results?, date_range? Discover relevant papers by query/category/date free
download_paper paper_id Cache a PDF locally for extraction free
read_paper paper_id Extract text from a cached paper for reading/QA free
list_papers See what's already downloaded to avoid re-fetch free

Kosten & Limits

Was der Betrieb kostet

API-Kontingent
arXiv query API recommends ~1 req/3s; higher rates may get throttled
Tokens pro Aufruf
Search: 500–2000 tokens. Paper text: 5k–30k tokens per paper.
Kosten in €
Free
Tipp
Cache aggressively; re-reading a paper's extracted text is free once downloaded.

Sicherheit

Rechte, Secrets, Reichweite

Credential-Speicherung: None needed
Datenabfluss: Queries to export.arxiv.org; PDF downloads from arxiv.org

Fehlerbehebung

Häufige Fehler und Lösungen

Empty search results for a clearly existing topic

arXiv search is keyword-exact for quoted strings; try broader terms and the correct category prefix (cs.CL vs cs.AI).

Download failed / PDF unavailable

Very rare; some withdrawn papers 404. Confirm the id on arxiv.org/abs/<id>.

Extracted text is garbled

Some math-heavy papers have non-standard PDFs; try the source version if available, else note the limitation.

Alternativen

arXiv vs. andere

AlternativeWann stattdessenKompromiss
Semantic Scholar MCPYou need citation counts and influence metricsNot arXiv-specific; coverage varies
Papers with Code MCPYou want papers with code implementations linkedSmaller catalog, ML-focused

Mehr

Ressourcen

📖 Offizielle README auf GitHub lesen

🐙 Offene Issues ansehen

🔍 Alle 400+ MCP-Server und Skills durchsuchen