You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output.
Click here to find out more.
Mendeley readers
Chapter title |
Overview of Sequence Data Formats
|
---|---|
Chapter number | 1 |
Book title |
Statistical Genomics
|
Published in |
Methods in molecular biology, January 2016
|
DOI | 10.1007/978-1-4939-3578-9_1 |
Pubmed ID | |
Book ISBNs |
978-1-4939-3576-5, 978-1-4939-3578-9
|
Authors |
Hongen Zhang, Zhang, Hongen |
Editors |
Ewy Mathé, Sean Davis |
Abstract |
Next-generation sequencing experiment can generate billions of short reads for each sample and processing of the raw reads will add more information. Various file formats have been introduced/developed in order to store and manipulate this information. This chapter presents an overview of the file formats including FASTQ, FASTA, SAM/BAM, GFF/GTF, BED, and VCF that are commonly used in analysis of next-generation sequencing data. |
Mendeley readers
The data shown below were compiled from readership statistics for 66 Mendeley readers of this research output. Click here to see the associated Mendeley record.
Geographical breakdown
Country | Count | As % |
---|---|---|
Sweden | 1 | 2% |
Unknown | 65 | 98% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Master | 11 | 17% |
Student > Bachelor | 8 | 12% |
Student > Doctoral Student | 8 | 12% |
Student > Ph. D. Student | 6 | 9% |
Researcher | 3 | 5% |
Other | 8 | 12% |
Unknown | 22 | 33% |
Readers by discipline | Count | As % |
---|---|---|
Biochemistry, Genetics and Molecular Biology | 15 | 23% |
Computer Science | 10 | 15% |
Agricultural and Biological Sciences | 9 | 14% |
Engineering | 3 | 5% |
Unspecified | 2 | 3% |
Other | 6 | 9% |
Unknown | 21 | 32% |