The first bioinformatics-native AI agent skill library.
Built on OpenClaw (180k+ GitHub stars). Local-first. Privacy-focused. Reproducible.
A community contributor built a nutrigenomics skill and ran it β from raw genetic data to personalised nutrition report with radar charts, heatmaps, and reproducibility bundle:
https://github.com/ClawBio/ClawBio/releases/download/v0.2.0/david-nutrigx-demo.mp4
PharmGx Reporter: 12 genes, 51 drugs, under 1 second
You read a paper. You want to reproduce Figure 3. So you:
/home/jsmith/data/.Now imagine the same paper published a skill:
python ancestry_pca.py --demo --output fig3
# Figure 3 reproduced. Identical. SHA-256 verified. 30 seconds.
Thatβs ClawBio. Every figure in your paper should be one command away from reproduction.
A skill is a domain expertβs knowledge β frozen into code β that an AI agent executes correctly every time.
ChatGPT / Claude = a smart generalist who guesses at bioinformatics
π¦ ClawBio skill = a domain expert's proven pipeline that the AI executes
commands.sh, environment.yml, and SHA-256 checksums. Anyone can reproduce it without the agent.SKILL.md + Python scripts) that plugs into the orchestrator.Ask Claude to βprofile my pharmacogenes from this 23andMe file.β Itβll write plausible Python. But:
ClawBio encodes the correct bioinformatics decisions so the agent gets it right first time, every time.
Every ClawBio analysis ships with a reproducibility bundle β not as an afterthought, but as part of the output:
report/
βββ report.md # Full analysis with figures and tables
βββ figures/ # Publication-quality PNGs
βββ tables/ # CSV data tables
βββ commands.sh # Exact commands to reproduce
βββ environment.yml # Conda environment snapshot
βββ checksums.sha256 # SHA-256 of every input and output file
Why this matters: a reviewer can re-run your analysis in 30 seconds. A collaborator can reproduce your Figure 3 without emailing you. Future-you can regenerate results two years later from the same bundle.
| Skill | Status | Description |
|---|---|---|
| Bio Orchestrator | MVP | Routes bioinformatics requests to the right specialist skill |
| PharmGx Reporter | MVP | Pharmacogenomic report: 12 genes, 51 drugs, CPIC guidelines |
| Ancestry PCA | MVP | PCA decomposition vs SGDP (345 samples, 164 global populations) |
| Semantic Similarity | MVP | Semantic Isolation Index for 175 GBD diseases from 13.1M PubMed abstracts |
| Equity Scorer | Planned | HEIM diversity metrics from VCF/ancestry data |
| VCF Annotator | Planned | Variant annotation with VEP, ClinVar, gnomAD + ancestry context |
| Lit Synthesizer | Planned | PubMed/bioRxiv search with LLM summarisation and citation graphs |
| scRNA Orchestrator | Planned | Scanpy automation: QC, clustering, DE analysis, visualisation |
| Struct Predictor | Planned | AlphaFold/Boltz local structure prediction |
| Repro Enforcer | Planned | Export any analysis as Conda env + Singularity + Nextflow pipeline |
Generates a pharmacogenomic report from consumer genetic data (23andMe, AncestryDNA):
python pharmgx_reporter.py --input demo_patient.txt --output report
Demo result: CYP2D6 *4/*4 (Poor Metabolizer) β 10 drugs AVOID (codeine, tramadol, 7 TCAs, tamoxifen), 20 caution, 21 standard.
~7% of people are CYP2D6 Poor Metabolizers β codeine gives them zero pain relief. ~0.5% carry DPYD variants where standard 5-FU dose can be lethal. This skill catches both.
Runs principal component analysis on your cohort against the SGDP reference panel (345 samples, 164 global populations):
python ancestry_pca.py --demo --output ancestry_report
Demo result: 736 Peruvian samples across 28 indigenous populations. Amazonian groups (Matzes, Awajun, Candoshi) sit in genetic space that no SGDP population occupies β genuinely underrepresented, not just in GWAS, but in the reference panels themselves.
Computes a Semantic Isolation Index for diseases using 13.1M PubMed abstracts and PubMedBERT embeddings (768-dim):
python semantic_sim.py --demo --output sem_report
Key finding: Neglected tropical diseases are +38% more semantically isolated (P < 0.0001, Cohenβs d = 0.84). 14 of the 25 most isolated diseases are Global South priority conditions. Knowledge silos kill innovation β a malaria immunology breakthrough could help leishmaniasis, but the literatures donβt talk to each other.
Corpas et al. (2026). HEIM: Health Equity Index for Measuring structural bias in biomedical research. Under review.
# Install a skill
openclaw install skills/pharmgx-reporter
# Run with natural language
openclaw "Profile the pharmacogenes in my 23andMe file at data/raw_genotype.txt"
# Or run directly
python skills/pharmgx-reporter/pharmgx_reporter.py --input data/raw_genotype.txt --output report
Every skill includes demo data so you can try it immediately without your own files.
User: "Analyse the diversity in my VCF file"
β
ββββββββΌβββββββ
β Bio β β routes by file type + keywords
β Orchestratorβ
ββββββββ¬βββββββ
β
ββββββββΌβββββββββββββββββββββββββββββββββββββββββββ
β β
PharmGx Ancestry Semantic Equity VCF
Reporter PCA Similarity Scorer Annotator ...
β β
ββββββββ¬βββββββββββββββββββββββββββββββββββββββββββ
β
ββββββββΌβββββββ
β Markdown β β report + figures + checksums
β Report β + reproducibility bundle
βββββββββββββββ
Each skill is standalone β the orchestrator routes to the right one, but every skill also works independently.
See docs/architecture.md for the full design.
We want skills from the bioinformatics community. If you work with genomics, proteomics, metabolomics, imaging, or clinical data β wrap your pipeline as a skill.
| Skill | What | Your expertise |
|---|---|---|
| claw-gwas | PLINK/REGENIE automation | Statistical genetics |
| claw-metagenomics | Kraken2/MetaPhlAn wrapper | Microbiome |
| claw-acmg | Clinical variant classification | Clinical genomics |
| claw-pathway | GO/KEGG enrichment | Functional genomics |
| claw-phylogenetics | IQ-TREE/RAxML automation | Evolutionary biology |
| claw-proteomics | MaxQuant/DIA-NN | Proteomics |
| claw-spatial | Visium/MERFISH | Spatial transcriptomics |
See CONTRIBUTING.md for the submission process and templates/SKILL-TEMPLATE.md for the skill template.
ClawBio was announced at the London Bioinformatics Meetup on 26 February 2026.
If you use ClawBio in your research, please cite:
@software{clawbio_2026,
author = {Corpas, Manuel},
title = {ClawBio: An Open-Source Library of AI Agent Skills for Reproducible Bioinformatics},
year = {2026},
url = {https://github.com/ClawBio/ClawBio}
}
MIT β clone it, run it, build a skill, submit a PR. π¦