What is an rsID?
Definition
An rsID (reference SNP cluster ID) is a unique identifier assigned by the NCBI dbSNP database to a specific genetic variant, typically a single nucleotide polymorphism (SNP). The format is 'rs' followed by a number — for example, rs429358 identifies the APOE4-defining variant. rsIDs serve as the universal language for referencing genetic variants across research, clinical testing, and bioinformatics tools.
Detailed Explanation
The dbSNP database, maintained by the National Center for Biotechnology Information (NCBI), has cataloged over 660 million human SNPs as of 2024. Each submission is assigned an rsID that remains stable across database builds, making it a reliable cross-reference. When a new variant is discovered and submitted, dbSNP checks whether it overlaps with an existing entry; if so, the existing rsID is used. This prevents duplication and ensures that the same variant always has the same identifier regardless of which research group discovered it or which genotyping platform detected it.
rsIDs are essential in clinical pharmacogenomics. Genetic test results from companies like 23andMe, AncestryDNA, or clinical pharmacogenomic panels report genotypes using rsIDs. A result like 'rs3892097 AG' tells a clinician that the patient is heterozygous for the CYP2D6*4 allele, which has implications for metabolizing drugs like codeine and tamoxifen. Clinical guidelines from CPIC and DPWG reference specific rsIDs when providing genotype-to-phenotype translations and dosing recommendations.
PepFold accepts rsIDs as its primary input format. Users enter between 1 and 20 rsIDs, and the pipeline resolves each one through ClinVar and UniProt to build a complete pharmacogenomic profile. The rsID-first approach means researchers and clinicians can go directly from a genetic test result to a peptide design report without manual variant annotation. Each rsID in the system links to a dedicated page with variant details, gene context, condition associations, and peptide design relevance.
Related Terms
A single nucleotide polymorphism (SNP, pronounced 'snip') is a variation at a single position in a DNA sequence among individuals. SNPs are the most common type of genetic variation in humans, with approximately 4-5 million SNPs per individual genome and over 660 million cataloged in the dbSNP database.
What is ClinVar?ClinVar is a freely accessible public database maintained by the National Center for Biotechnology Information (NCBI) that aggregates information about the relationships between human genetic variants and observed health conditions (phenotypes). Submitters — including clinical laboratories, research groups, and expert panels — classify variants using a standardized five-tier system: pathogenic, likely pathogenic, uncertain significance (VUS), likely benign, and benign.
What is Pharmacogenomics?Pharmacogenomics (PGx) is the study of how an individual's genetic makeup influences their response to medications. It combines pharmacology (the science of drugs) and genomics (the study of genes and their functions) to develop effective, personalized drug therapies based on a patient's DNA.
Related SNPs
Apply This Knowledge with PepFold
Submit rsIDs and get ranked peptide candidates with 3D structures and Fmoc-SPPS synthesis protocols in under 2 minutes.