TAGster: Single or Multi-Population Tag SNPs Selection, Evaluation and Visualization
National Institute of Environmental Health
Division of Intramural Research
Molecular & Genetic Epidemiology Group
TAGster is a tool to select, evaluate and visualize LD tag SNPs for single or multiple populations. The input files can be a set of dumped genotype files from
International HapMap Project (Hapmap format) or
Seattle SNPs (Prettybase format). The output is a set of LD tag SNPs for single or multiple populations.
The TAGster also creates figures for LD structure and genotype data with many customized tracks to facilitate investigators visually checking and optimizing tag SNPs.
The TAGster uses composite linkage disequilibrium (CLD) coefficient or r2 to measure patterns of linkage disequilibrium between polymorphic sites in a genome region within each of multiple populations, and employs 3 methods to select LD tag SNPs:
- A greedy algorithm to select a set containing a near-minimal number of LD tag SNPs for a single or multiple populations
- A computation efficient exhaustive method to select minimal number of tag SNPs for a single population
- A two-stage optimal method to select a set of near-minimal number of tag SNPs for multiple populations
- Windows Binary: tagster_win.zip(1MB)
- Linux Binary: tagster_linux.zip(1MB)
- Macintosh Binary: tagster_mac.zip(2MB)
- Software Manual: tagster_manual.pdf(215KB)
- Detailed Algorithms: tagster_algorithm.pdf(70KB)
- Evaluation Results: tagster_evaluation.pdf(133KB)
- Tagster Tutorial: tagster_tutorial.pdf(52KB)
- Source code: please contact
The TAGster was developed in and is maintained at the NIEHS by Zongli Xu, Norman L. Kaplan and Jack A Taylor.
Xu Z, Kaplan NL, Taylor JA. TAGster: efficient selection of LD tag SNPs in single or multiple populations. Bioinformatics. 2007 23:3254-5. Epub 2007 Sep 7.