SPSmart: adapting population based SNP genotype databases for fast and comprehensive web access
Loading...
Identifiers
Publication date
Advisors
Tutors
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
BMC
Abstract
Background: In the last five years large online resources of human variability have appeared,
notably HapMap, Perlegen and the CEPH foundation. These databases of genotypes with population
information act as catalogues of human diversity, and are widely used as reference sources for
population genetics studies. Although many useful conclusions may be extracted by querying
databases individually, the lack of flexibility for combining data from within and between each
database does not allow the calculation of key population variability statistics.
Results: We have developed a novel tool for accessing and combining large-scale genomic
databases of single nucleotide polymorphisms (SNPs) in widespread use in human population
genetics: SPSmart (SNPs for Population Studies). A fast pipeline creates and maintains a data mart
from the most commonly accessed databases of genotypes containing population information: data
is mined, summarized into the standard statistical reference indices, and stored into a relational
database that currently handles as many as 4 × 109 genotypes and that can be easily extended to
new database initiatives. We have also built a web interface to the data mart that allows the
browsing of underlying data indexed by population and the combining of populations, allowing
intuitive and straightforward comparison of population groups. All the information served is
optimized for web display, and most of the computations are already pre-processed in the data
mart to speed up the data browsing and any computational treatment requested.
Conclusion: In practice, SPSmart allows populations to be combined into user-defined groups,
while multiple databases can be accessed and compared in a few simple steps from a single query.
It performs the queries rapidly and gives straightforward graphical summaries of SNP population
variability through visual inspection of allele frequencies outlined in standard pie-chart format. In
addition, full numerical description of the data is output in statistical results panels that include
common population genetics metrics such as heterozygosity, Fst and In.
Description
Keywords
Bibliographic citation
Amigo, J., Salas, A., Phillips, C. et al. SPSmart: adapting population based SNP genotype databases for fast and comprehensive web access. BMC Bioinformatics 9, 428 (2008). https://doi.org/10.1186/1471-2105-9-428
Relation
Has part
Has version
Is based on
Is part of
Is referenced by
Is version of
Requires
Publisher version
https://doi.org/10.1186/1471-2105-9-428Sponsors
The grants from the Xunta de Galicia (PGIDIT06PXIB208079PR) and Fundación de Investigación Médica Mutua Madrileña awarded to AS partially
supported this project
Rights
© 2008 Amigo et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited








