prometheus-mcp-server — Установка & Живое демо

Зачем использовать

Ключевые функции

6 focused tools over the Prometheus API
PromQL instant + range queries with custom intervals
Metric discovery and metadata for agents that don't know your label space
Auth options: basic, bearer, mTLS, custom headers

Живое демо

Как выглядит на практике

prometheus.replay ▶ готово

0/0

Установка

Выберите клиент

~/Library/Application Support/Claude/claude_desktop_config.json · Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "prometheus": {
      "command": "uvx",
      "args": [
        "prometheus-mcp-server"
      ],
      "_inferred": true
    }
  }
}

Откройте Claude Desktop → Settings → Developer → Edit Config. Перезапустите после сохранения.

~/.cursor/mcp.json · .cursor/mcp.json

{
  "mcpServers": {
    "prometheus": {
      "command": "uvx",
      "args": [
        "prometheus-mcp-server"
      ],
      "_inferred": true
    }
  }
}

Cursor использует ту же схему mcpServers, что и Claude Desktop. Конфиг проекта приоритетнее глобального.

VS Code → Cline → MCP Servers → Edit

{
  "mcpServers": {
    "prometheus": {
      "command": "uvx",
      "args": [
        "prometheus-mcp-server"
      ],
      "_inferred": true
    }
  }
}

Щёлкните значок MCP Servers на боковой панели Cline, затем "Edit Configuration".

~/.codeium/windsurf/mcp_config.json

{
  "mcpServers": {
    "prometheus": {
      "command": "uvx",
      "args": [
        "prometheus-mcp-server"
      ],
      "_inferred": true
    }
  }
}

Тот же формат, что и Claude Desktop. Перезапустите Windsurf для применения.

~/.continue/config.json

{
  "mcpServers": [
    {
      "name": "prometheus",
      "command": "uvx",
      "args": [
        "prometheus-mcp-server"
      ]
    }
  ]
}

Continue использует массив объектов серверов, а не map.

~/.config/zed/settings.json

{
  "context_servers": {
    "prometheus": {
      "command": {
        "path": "uvx",
        "args": [
          "prometheus-mcp-server"
        ]
      }
    }
  }
}

Добавьте в context_servers. Zed перезагружается автоматически.

claude mcp add prometheus -- uvx prometheus-mcp-server

Однострочная команда. Проверить: claude mcp list. Удалить: claude mcp remove.

Сценарии использования

Реальные сценарии: prometheus-mcp-server

How to diagnose a latency spike with Prometheus + Claude

👤 On-call SREs ⏱ ~10 min intermediate

Когда использовать: A service p99 alert fires — you need context without memorizing PromQL.

Предварительные требования

Prometheus URL reachable — Set PROMETHEUS_URL in the MCP config; add auth if protected

Поток

Scope the spike

Query http request p99 latency for service X in the last hour, 30-second resolution. Compare to the last 7 days baseline.✓ Скопировано

→ Range query result showing the spike
Find correlated metrics

For the spike window, what other metrics for service X moved >2 sigma? CPU, memory, GC, queue depth?✓ Скопировано

→ Candidate culprit metrics
Narrow by label

Break down the spike by pod/host labels. Is it one pod or fleet-wide?✓ Скопировано

→ Per-label decomposition

Итог: A hypothesis tied to specific metrics in under 5 minutes.

Подводные камни

Query returns no data — Check label names with list_metrics — label casing and delimiters vary between exporters

Сочетать с: kubectl

Generate a weekly SLO compliance report from Prometheus

👤 SRE leads ⏱ ~25 min intermediate

Когда использовать: Friday SLO review — you want numbers not vibes.

Поток

Define the SLIs

For service X, compute this week's availability (success/total ratio) and latency (requests under threshold / total) as numbers.✓ Скопировано

→ Two ratios with burn rate
Compare to SLO

Availability SLO = 99.9%, latency SLO = 95%. Am I above or below? Project error budget exhaustion.✓ Скопировано

→ Verdict + days of budget remaining

Итог: Defensible SLO report with numbers not 'mostly fine'.

Сочетать с: google-sheets

Audit Prometheus scrape target health with Claude

👤 Platform engineers ⏱ ~15 min intermediate

Когда использовать: You suspect half your targets are down but haven't checked.

Поток

Get targets

Call get_targets. Group by job; which have any DOWN instances?✓ Скопировано

→ Table of job → up/down counts
Investigate

For the worst offender, show the lastError for the DOWN instances. Likely cause?✓ Скопировано

→ Actionable cause per target

Итог: Rescued scrapes in minutes.

Комбинации

Сочетайте с другими MCP — эффект x10

prometheus + kubectl

Pair metric anomalies with pod state

For the service with the latency spike, correlate Prometheus data with kubectl describe on its pods.✓ Скопировано

prometheus + sentry

Metric spike + error spike correlation

Sentry shows errors doubled at 14:00 — what Prometheus metrics moved at the same time?✓ Скопировано

Инструменты

Что предоставляет этот MCP

Инструмент	Входные данные	Когда вызывать	Стоимость
health_check		Verify connectivity	1 API call
execute_query	query: promql, time?	Instant snapshot	1 query
execute_range_query	query, start, end, step	Time-series analysis	1 query (may be expensive)
list_metrics	match?: str	Discovery when you don't know the metric name	1 API call
get_metric_metadata	metric: str	Understand units before computing	1 API call
get_targets		Scrape health	1 API call

Стоимость и лимиты

Во что обходится

Квота API: Prometheus scales with your server; expensive queries can stress it
Токенов на вызов: Range queries with many series can hit 10k+ tokens
Деньги: Free
Совет: Use step wisely on range queries; 10s resolution over 24h is 8640 samples per series

Безопасность

Права, секреты, радиус поражения

Минимальные скоупы: read-only access to Prometheus API

Хранение учётных данных: Bearer token or basic auth in env; mTLS cert paths if used

Исходящий трафик: Your Prometheus URL only

Устранение неполадок

Частые ошибки и исправления

Query returns empty with no error

Metric/label name doesn't exist. Use list_metrics with a match prefix to verify

Range query times out

Reduce time range or increase step. Prometheus query engine has per-query resource limits

401 with bearer token

Token lacks read permission on /api/v1; check reverse proxy if Prometheus is behind one

Проверить: curl -H 'Authorization: Bearer $T' $PROMETHEUS_URL/api/v1/status/config

Альтернативы

prometheus-mcp-server в сравнении

Альтернатива	Когда использовать	Компромисс
Grafana MCP	You already visualize in Grafana and want dashboard/alert ops	Heavier; more features than you may need
Datadog MCP	Datadog is your metrics store	Paid; different query language

Ещё

Ресурсы

📖 Читать официальный README на GitHub

🐙 Открытые задачи

🔍 Все 400+ MCP-серверов и Skills