You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output.
Click here to find out more.
Mendeley readers
Chapter title |
Overview of Sequence Data Formats
|
---|---|
Chapter number | 1 |
Book title |
Statistical Genomics
|
Published in |
Methods in molecular biology, January 2016
|
DOI | 10.1007/978-1-4939-3578-9_1 |
Pubmed ID | |
Book ISBNs |
978-1-4939-3576-5, 978-1-4939-3578-9
|
Authors |
Hongen Zhang, Zhang, Hongen |
Editors |
Ewy Mathé, Sean Davis |
Abstract |
Next-generation sequencing experiment can generate billions of short reads for each sample and processing of the raw reads will add more information. Various file formats have been introduced/developed in order to store and manipulate this information. This chapter presents an overview of the file formats including FASTQ, FASTA, SAM/BAM, GFF/GTF, BED, and VCF that are commonly used in analysis of next-generation sequencing data. |
Mendeley readers
The data shown below were compiled from readership statistics for 62 Mendeley readers of this research output. Click here to see the associated Mendeley record.
Geographical breakdown
Country | Count | As % |
---|---|---|
Sweden | 1 | 2% |
Unknown | 61 | 98% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Master | 11 | 18% |
Student > Bachelor | 8 | 13% |
Student > Ph. D. Student | 6 | 10% |
Student > Doctoral Student | 3 | 5% |
Researcher | 3 | 5% |
Other | 7 | 11% |
Unknown | 24 | 39% |
Readers by discipline | Count | As % |
---|---|---|
Biochemistry, Genetics and Molecular Biology | 15 | 24% |
Agricultural and Biological Sciences | 9 | 15% |
Computer Science | 5 | 8% |
Engineering | 3 | 5% |
Immunology and Microbiology | 2 | 3% |
Other | 5 | 8% |
Unknown | 23 | 37% |