↓ Skip to main content

Research in Computational Molecular Biology

Overview of attention for book
Cover of 'Research in Computational Molecular Biology'

Table of Contents

  1. Altmetric Badge
    Book Overview
  2. Altmetric Badge
    Chapter 1 Tractatus: An Exact and Subquadratic Algorithm for Inferring Identical-by-Descent Multi-shared Haplotype Tracts
  3. Altmetric Badge
    Chapter 2 HapTree: A Novel Bayesian Framework for Single Individual Polyplotyping Using NGS Data
  4. Altmetric Badge
    Chapter 3 Changepoint Analysis for Efficient Variant Calling
  5. Altmetric Badge
    Chapter 4 On the Representation of de Bruijn Graphs
  6. Altmetric Badge
    Chapter 5 Exact Learning of RNA Energy Parameters from Structure
  7. Altmetric Badge
    Chapter 6 An Alignment-Free Regression Approach for Estimating Allele-Specific Expression Using RNA-Seq Data
  8. Altmetric Badge
    Chapter 7 The Generating Function Approach for Peptide Identification in Spectral Networks
  9. Altmetric Badge
    Chapter 8 Decoding coalescent hidden Markov models in linear time.
  10. Altmetric Badge
    Chapter 9 AptaCluster – A Method to Cluster HT-SELEX Aptamer Pools and Lessons from Its Application
  11. Altmetric Badge
    Chapter 10 Learning Sequence Determinants of Protein: Protein Interaction Specificity with Sparse Graphical Models
  12. Altmetric Badge
    Chapter 11 On Sufficient Statistics of Least-Squares Superposition of Vector Sets
  13. Altmetric Badge
    Chapter 12 IDBA-MTP: A Hybrid MetaTranscriptomic Assembler Based on Protein Information
  14. Altmetric Badge
    Chapter 13 MRFalign: Protein Homology Detection through Alignment of Markov Random Fields
  15. Altmetric Badge
    Chapter 14 An Integrated Model of Multiple-Condition ChIP-Seq Data Reveals Predeterminants of Cdx2 Binding
  16. Altmetric Badge
    Chapter 15 PASTA: Ultra-Large Multiple Sequence Alignment
  17. Altmetric Badge
    Chapter 16 Fast Flux Module Detection Using Matroid Theory
  18. Altmetric Badge
    Chapter 17 Building a Pangenome Reference for a Population
  19. Altmetric Badge
    Chapter 18 CSAX: Characterizing Systematic Anomalies in eXpression Data
  20. Altmetric Badge
    Chapter 19 WhatsHap: Haplotype Assembly for Future-Generation Sequencing Reads
  21. Altmetric Badge
    Chapter 20 Simultaneous Inference of Cancer Pathways and Tumor Progression from Cross-Sectional Mutation Data
  22. Altmetric Badge
    Chapter 21 dipSPAdes: Assembler for Highly Polymorphic Diploid Genomes
  23. Altmetric Badge
    Chapter 22 An Exact Algorithm to Compute the DCJ Distance for Genomes with Duplicate Genes
  24. Altmetric Badge
    Chapter 23 HIT’nDRIVE: Multi-driver Gene Prioritization Based on Hitting Time
  25. Altmetric Badge
    Chapter 24 Modeling Mutual Exclusivity of Cancer Mutations
  26. Altmetric Badge
    Chapter 25 Viral Quasispecies Assembly via Maximal Clique Enumeration
  27. Altmetric Badge
    Chapter 26 Correlated Protein Function Prediction via Maximization of Data-Knowledge Consistency
  28. Altmetric Badge
    Chapter 27 Bayesian Multiple Protein Structure Alignment
  29. Altmetric Badge
    Chapter 28 Gene-Gene Interactions Detection Using a Two-Stage Model
  30. Altmetric Badge
    Chapter 29 A Geometric Clustering Algorithm and Its Applications to Structural Data
  31. Altmetric Badge
    Chapter 30 A Spatial-Aware Haplotype Copying Model with Applications to Genotype Imputation
  32. Altmetric Badge
    Chapter 31 Traversing the k-mer Landscape of NGS Read Datasets for Quality Score Sparsification
  33. Altmetric Badge
    Chapter 32 Reconstructing Breakage Fusion Bridge Architectures Using Noisy Copy Numbers
  34. Altmetric Badge
    Chapter 33 Reconciliation with Non-binary Gene Trees Revisited
  35. Altmetric Badge
    Chapter 34 Learning Protein-DNA Interaction Landscapes by Integrating Experimental Data through Computational Models
  36. Altmetric Badge
    Chapter 35 Imputation of Quantitative Genetic Interactions in Epistatic MAPs by Interaction Propagation Matrix Completion
Attention for Chapter 31: Traversing the k-mer Landscape of NGS Read Datasets for Quality Score Sparsification
Altmetric Badge

About this Attention Score

  • In the top 25% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (89th percentile)
  • High Attention Score compared to outputs of the same age and source (97th percentile)

Mentioned by

blogs
1 blog
twitter
4 X users

Citations

dimensions_citation
8 Dimensions

Readers on

mendeley
10 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Chapter title
Traversing the k-mer Landscape of NGS Read Datasets for Quality Score Sparsification
Chapter number 31
Book title
Research in Computational Molecular Biology
Published in
Lecture notes in computer science, April 2014
DOI 10.1007/978-3-319-05269-4_31
Pubmed ID
Book ISBNs
978-3-31-905268-7, 978-3-31-905269-4
Authors

Y. William Yu, Deniz Yorukoglu, Bonnie Berger, Yu, Y. William, Yorukoglu, Deniz, Berger, Bonnie

Abstract

It is becoming increasingly impractical to indefinitely store raw sequencing data for later processing in an uncompressed state. In this paper, we describe a scalable compressive framework, Read-Quality-Sparsifier (RQS), which substantially outperforms the compression ratio and speed of other de novo quality score compression methods while maintaining SNP-calling accuracy. Surprisingly, RQS also improves the SNP-calling accuracy on a gold-standard, real-life sequencing dataset (NA12878) using a k-mer density profile constructed from 77 other individuals from the 1000 Genomes Project. This improvement in downstream accuracy emerges from the observation that quality score values within NGS datasets are inherently encoded in the k-mer landscape of the genomic sequences. To our knowledge, RQS is the first scalable sequence based quality compression method that can efficiently compress quality scores of terabyte-sized and larger sequencing datasets. An implementation of our method, RQS, is available for download at: http://rqs.csail.mit.edu/.

X Demographics

X Demographics

The data shown below were collected from the profiles of 4 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 10 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
United States 2 20%
Unknown 8 80%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 3 30%
Student > Doctoral Student 2 20%
Researcher 2 20%
Professor 1 10%
Student > Master 1 10%
Other 1 10%
Readers by discipline Count As %
Computer Science 6 60%
Agricultural and Biological Sciences 2 20%
Medicine and Dentistry 1 10%
Unknown 1 10%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 14. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 29 January 2015.
All research outputs
#2,205,205
of 22,747,498 outputs
Outputs from Lecture notes in computer science
#439
of 8,126 outputs
Outputs of similar age
#23,708
of 225,521 outputs
Outputs of similar age from Lecture notes in computer science
#2
of 88 outputs
Altmetric has tracked 22,747,498 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 90th percentile: it's in the top 10% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 8,126 research outputs from this source. They receive a mean Attention Score of 5.0. This one has done particularly well, scoring higher than 94% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 225,521 tracked outputs that were published within six weeks on either side of this one in any source. This one has done well, scoring higher than 89% of its contemporaries.
We're also able to compare this research output to 88 others from the same source and published within six weeks on either side of this one. This one has done particularly well, scoring higher than 97% of its contemporaries.