SciAgent-Skills

为什么要用

核心特性

197 specialized life-science skills
Coverage: RNA-seq, single-cell (scRNA), drug discovery, proteomics, variant calling
BixBench accuracy 92.0% — higher than generic agents
Canonical tool invocation (samtools, bcftools, salmon, scanpy, Seurat, etc.)
Reproducibility patterns (conda/mamba envs, config files)
Powers OmicsHorizon platform

实时演示

实际使用效果

sciagent-skill.replay ▶ 就绪

0/0

安装

选择你的客户端

~/Library/Application Support/Claude/claude_desktop_config.json · Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "sciagent-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/jaechang-hits/SciAgent-Skills",
        "~/.claude/skills/SciAgent-Skills"
      ],
      "_inferred": true
    }
  }
}

打开 Claude Desktop → Settings → Developer → Edit Config。保存后重启应用。

~/.cursor/mcp.json · .cursor/mcp.json

{
  "mcpServers": {
    "sciagent-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/jaechang-hits/SciAgent-Skills",
        "~/.claude/skills/SciAgent-Skills"
      ],
      "_inferred": true
    }
  }
}

Cursor 使用与 Claude Desktop 相同的 mcpServers 格式。项目级配置优先于全局。

VS Code → Cline → MCP Servers → Edit

{
  "mcpServers": {
    "sciagent-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/jaechang-hits/SciAgent-Skills",
        "~/.claude/skills/SciAgent-Skills"
      ],
      "_inferred": true
    }
  }
}

点击 Cline 侧栏中的 MCP Servers 图标，然后选 "Edit Configuration"。

~/.codeium/windsurf/mcp_config.json

{
  "mcpServers": {
    "sciagent-skill": {
      "command": "git",
      "args": [
        "clone",
        "https://github.com/jaechang-hits/SciAgent-Skills",
        "~/.claude/skills/SciAgent-Skills"
      ],
      "_inferred": true
    }
  }
}

格式与 Claude Desktop 相同。重启 Windsurf 生效。

~/.continue/config.json

{
  "mcpServers": [
    {
      "name": "sciagent-skill",
      "command": "git",
      "args": [
        "clone",
        "https://github.com/jaechang-hits/SciAgent-Skills",
        "~/.claude/skills/SciAgent-Skills"
      ]
    }
  ]
}

Continue 使用服务器对象数组，而非映射。

~/.config/zed/settings.json

{
  "context_servers": {
    "sciagent-skill": {
      "command": {
        "path": "git",
        "args": [
          "clone",
          "https://github.com/jaechang-hits/SciAgent-Skills",
          "~/.claude/skills/SciAgent-Skills"
        ]
      }
    }
  }
}

加入 context_servers。Zed 保存后热重载。

claude mcp add sciagent-skill -- git clone https://github.com/jaechang-hits/SciAgent-Skills ~/.claude/skills/SciAgent-Skills

一行命令搞定。用 claude mcp list 验证，claude mcp remove 卸载。

使用场景

实战用法： SciAgent-Skills

How to run a bulk RNA-seq differential expression analysis

👤 Biologists and bioinformaticians with FASTQ files and a need for DE results ⏱ ~120 min advanced

何时使用： You have paired-end FASTQ, a reference, and a condition table.

前置条件

Skill installed — git clone https://github.com/jaechang-hits/SciAgent-Skills ~/.claude/skills/sciagent-skills
Conda/mamba + compute — Use a Linux server or HPC; macOS fine for small datasets

步骤

Plan the pipeline

Plan a bulk RNA-seq DE workflow for 2 conditions x 3 reps with salmon + tximport + DESeq2. Output expected files per step.✓ 已复制

→ Step list with tools and intermediate outputs
Run quantification

Run salmon quant for each sample; produce a script.✓ 已复制

→ Bash script with salmon invocations
DE analysis

Load quants via tximport, run DESeq2, produce MA + volcano plots and a top-50 gene table.✓ 已复制

→ R script + output files

结果： A DE table + plots you can hand to a PI or collaborator.

注意事项

Mis-matched reference versus annotation GTF — Check release numbers explicitly; salmon and DESeq2 can silently run on mismatched IDs
Low-count filtering removes biological signal — Use independent filtering or raise threshold gradually; don't blanket filter

Single-cell RNA-seq QC and clustering baseline

👤 Researchers starting scRNA analysis from 10x output ⏱ ~90 min advanced

何时使用： You have a cellranger output directory and need a first-pass Seurat or Scanpy analysis.

步骤

Load + QC

Load 10x data from cellranger output, do QC (pct.mt, counts, features), filter, normalize.✓ 已复制

→ QC plots + filtered object
Cluster

Run PCA, find neighbors, cluster at resolution 0.5, and UMAP.✓ 已复制

→ Cluster labels + UMAP plot
Marker genes

Find cluster markers, produce a top-5-per-cluster heatmap.✓ 已复制

→ Marker table + heatmap

结果： An annotated UMAP and marker table ready for biological interpretation.

注意事项

Over-clustering at the default resolution — Try a sweep 0.2-1.0 and pick based on silhouette + biological plausibility

Germline variant calling from WGS

👤 Genomics teams calling SNPs and indels ⏱ ~180 min advanced

何时使用： You have aligned BAMs and need a VCF using best-practices (GATK).

步骤

Plan pipeline

Plan GATK best-practices germline pipeline from aligned BAMs: BQSR, HaplotypeCaller, joint genotyping with GenomicsDBImport.✓ 已复制

→ Pipeline outline with expected runtimes
Generate scripts

Produce Snakemake rules for each step.✓ 已复制

→ Snakefile with rules and config

结果： A reproducible Snakemake pipeline ready to run on your cluster.

注意事项

Skipping BQSR on small cohorts — Still do it — GATK's downstream filters assume recalibrated quality scores

Triage small-molecule hits from a screen

👤 Med-chem groups triaging HTS results ⏱ ~60 min advanced

何时使用： You have a hit list and want to rank by drug-likeness + novelty.

步骤

Filter by Lipinski + PAINS

Compute Lipinski and PAINS flags on this SMILES list, output a filtered table.✓ 已复制

→ RDKit-based script + filtered CSV
Similarity to known drugs

For remaining hits, compute Tanimoto similarity to ChEMBL approved drugs; flag >0.85 as known-scaffold.✓ 已复制

→ Similarity table with flags

结果： A triaged hit list prioritized for follow-up.

组合

与其他 MCP 搭配，撬动十倍杠杆

sciagent-skill + filesystem

Read local FASTQ/BAM files and have SciAgent plan the analysis pipeline on actual data

List the FASTQ files in ~/data/rnaseq/, then use SciAgent to plan a salmon + DESeq2 pipeline for these samples.✓ 已复制

sciagent-skill + github

Search nf-core or Bioconductor repos for reference implementations, then adapt with SciAgent

Search nf-core/rnaseq on GitHub for how they handle trimming, then use SciAgent to adapt that step for my pipeline.✓ 已复制

工具

此 MCP 暴露的能力

工具	输入参数	何时调用	成本
Bulk RNA-seq workflow	FASTQ + design	Standard DE analysis	compute
scRNA-seq workflow	10x output	Single-cell baseline	compute
Variant calling	BAMs	WGS/WES cohorts	compute
Proteomics analysis	MS data	MS-based proteomics	compute
Drug discovery triage	SMILES list	Hit triage	compute

成本与限制

运行它的成本

API 配额: None at skill level
每次调用 Token 数: 10-50k per pipeline design
费用: Free skills; compute costs depend on dataset size
提示: Plan the pipeline as a script first, then run; don't keep Claude in the loop during long compute.

安全

权限、密钥、影响范围

凭据存储： No credentials

数据出站： Pipeline designs and snippets go to Claude. Actual patient data should stay on HIPAA-compliant compute.

Do not paste patient identifiers or raw sequence data that could be re-identified into prompts.
Always review generated variant-calling parameters — wrong caller settings can produce misleading results.

故障排查

常见错误与修复

tximport fails on missing transcript-to-gene mapping

Regenerate tx2gene from the same GTF you used for the salmon index; mismatch is the usual cause.

Seurat object too large for available RAM

Subsample or switch to on-disk storage with BPCells or DelayedArray-backed Seurat

Conda environment conflicts during pipeline setup

Use mamba instead of conda for faster resolution; pin exact versions in environment.yml rather than floating constraints

验证： mamba env create -f environment.yml --dry-run

Generated script references wrong genome build (hg19 vs hg38)

Explicitly state the genome build in your first prompt; SciAgent defaults may not match your data

替代方案

SciAgent-Skills 对比其他方案

替代方案	何时用它替代	权衡
Galaxy / nf-core	You want audited, community pipelines rather than LLM-generated scripts	Less conversational; slower to customize

为什么要用

核心特性

实时演示

实际使用效果

安装

选择你的客户端

使用场景

实战用法： SciAgent-Skills

How to run a bulk RNA-seq differential expression analysis

前置条件

步骤

注意事项

Single-cell RNA-seq QC and clustering baseline

步骤

注意事项

Germline variant calling from WGS

步骤

注意事项

Triage small-molecule hits from a screen

步骤

组合

与其他 MCP 搭配，撬动十倍杠杆

工具

此 MCP 暴露的能力

成本与限制

运行它的成本

安全

权限、密钥、影响范围

故障排查

常见错误与修复

替代方案

SciAgent-Skills 对比其他方案

更多

资源