PepFold

Dataset

PGx Atlas

Pre-computed peptide candidates for 50 clinically validated pharmacogenomic variants

What’s included

  • 50 variants from CPIC Level A, FDA-labeled biomarkers, and PharmGKB VIP genes
  • ClinVar annotation and clinical significance for every variant
  • UniProt protein targets mapped per variant
  • Ranked peptide candidates with full 10-dimensional pharmacogenomic scores
  • ESMFold-predicted 3D structures for each candidate peptide
  • Fmoc-SPPS synthesis protocols ready for bench validation

JSON

Structured, machine-readable. Ideal for pipeline integration.

CSV

Flat tabular format. Import directly into R, Python, or Excel.

HTML

Human-readable reports with embedded 3D viewers and protocol details.

Sample variants

10 of 50 variants included in the atlas

rsIDGeneVariantClinical RelevanceTier
rs1801133MTHFRC677TFolate metabolismA
rs429358APOEε4Alzheimer’s riskB
rs4244285CYP2C19*2Clopidogrel responseA
rs1799853CYP2C9*2Warfarin dosingA
rs9923231VKORC1-1639G>AWarfarin sensitivityA
rs3745274CYP2B6*6Efavirenz metabolismA
rs1065852CYP2D6*10Codeine metabolismA
rs1800460TPMT*3CThiopurine toxicityA
rs4149056SLCO1B1*5Statin myopathyA
rs1799971OPRM1A118GOpioid responseB

Tier A = CPIC Level A evidence. Tier B = FDA-labeled or PharmGKB VIP.

Licensing

Sample

Free

5 variants, JSON only

Most popular

Research License

5,000EUR

50 variants, all formats

Commercial License

15,000EUR / year

50 variants + quarterly updates

Custom Panel

Custom

Your variants, your formats

Use cases

Drug discovery pipeline enrichment

Inject pre-scored peptide candidates into your screening workflow. Skip months of variant annotation and peptide generation.

Academic pharmacogenomics research

Cite curated PGx data in publications and grant applications. Ready-made datasets for cohort studies.

CRO screening panel design

Build client-facing panels from validated variant-peptide pairs. Each entry includes synthesis protocols.

Clinical decision support prototyping

Prototype CDS tools with real pharmacogenomic data. 10D scores map directly to clinical relevance tiers.

How scores are computed

Each peptide candidate is scored across 10 pharmacogenomic dimensions: binding affinity, Grant 12D structural projection, structural stability, clinical relevance, protease resistance, ADMET properties, permeability, aggregation propensity, novelty, and predicted half-life. Read the full methodology

Need a custom variant panel?

Submit your own rsID lists, gene panels, or patient cohort data. We generate the atlas for your specific research needs.

CPIC & PharmGKB sourcedESMFold validated structuresUpdated quarterly