Chapter title |
Sequence Assembly.
|
---|---|
Chapter number | 2 |
Book title |
Bioinformatics
|
Published in |
Methods in molecular biology, January 2017
|
DOI | 10.1007/978-1-4939-6622-6_2 |
Pubmed ID | |
Book ISBNs |
978-1-4939-6620-2, 978-1-4939-6622-6
|
Authors |
Xiaoqiu Huang |
Editors |
Jonathan M. Keith |
Abstract |
We describe an efficient method for assembling short reads into long sequences. In this method, a hashing technique is used to compute overlaps between short reads, allowing base mismatches in the overlaps. Then an overlap graph is constructed, with each vertex representing a read and each edge representing an overlap. The overlap graph is explored by graph algorithms to find unique paths of reads representing contigs. The consensus sequence of each contig is constructed by computing alignments of multiple reads without gaps. This strategy has been implemented as a short read assembly program called PCAP.Solexa. We also describe how to use PCAP. Solexa in assembly of short reads. |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
United States | 2 | 2% |
Vietnam | 1 | 1% |
Brazil | 1 | 1% |
Portugal | 1 | 1% |
Canada | 1 | 1% |
India | 1 | 1% |
China | 1 | 1% |
Poland | 1 | 1% |
Unknown | 84 | 90% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Researcher | 22 | 24% |
Student > Ph. D. Student | 18 | 19% |
Student > Master | 17 | 18% |
Student > Bachelor | 9 | 10% |
Student > Postgraduate | 5 | 5% |
Other | 14 | 15% |
Unknown | 8 | 9% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 45 | 48% |
Computer Science | 15 | 16% |
Biochemistry, Genetics and Molecular Biology | 13 | 14% |
Engineering | 3 | 3% |
Business, Management and Accounting | 1 | 1% |
Other | 7 | 8% |
Unknown | 9 | 10% |