The variant call format and VCFtools.

Publication Dbxref
PMID:21653522
Structured Abstract Part
  • SUMMARY
    The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.

  • AVAILABILITY
    http://vcftools.sourceforge.net

Title
The variant call format and VCFtools.
Publication Type
Journal Article
Additional Publication Type(s)
  • Research Support, N.I.H., Extramural
  • Research Support, N.I.H., Intramural
  • Research Support, Non-U.S. Gov't
Series Name
Bioinformatics (Oxford, England)
Volume
27
Publication Year
2011
Issue
15
Page Numbers
2156-8
DOI
10.1093/bioinformatics/btr330
Journal Abbreviation
Bioinformatics
EISSN
1367-4811
Publication Date
2011 Aug 01
Unique Local Identifier

Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R, 1000 Genomes Project Analysis Group. The variant call format and VCFtools.. Bioinformatics (Oxford, England). 2011 Aug 01; 27(15):2156-8.

Citation
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R, 1000 Genomes Project Analysis Group. The variant call format and VCFtools.. Bioinformatics (Oxford, England). 2011 Aug 01; 27(15):2156-8.
ISSN
1367-4811
Language Abbr
eng
Publication Model
Print-Electronic
Authors
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G, Durbin R, 1000 Genomes Project Analysis Group
Language
English
Elocation
10.1093/bioinformatics/btr330
Journal Country
England
Abstract

SUMMARY
The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.

AVAILABILITY
http://vcftools.sourceforge.net

Database Reference Annotations
Is Obsolete
False