Document Type

Article

Publication Date

2013

DOI

10.1186/1471-2164-14-349

Publication Title

BMC Genomics

Volume

Pages

349 (12 pages)

Abstract

Background: Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) can locate transcription factor binding sites on genomic scale. Although many models and programs are available to call peaks, none has dominated its competition in comparison studies.

Results: We propose a rigorous statistical model, the normal-exponential two-peak (NEXT-peak) model, which parallels the physical processes generating the empirical data, and which can naturally incorporate mappability information. The model therefore estimates total strength of binding (even if some binding locations do not map uniquely into a reference genome, effectively censoring them); it also assigns an error to an estimated binding location. The comparison study with existing programs on real ChIP-seq datasets (STAT1, NRSF, and ZNF143) demonstrates that the NEXT-peak model performs well both in calling peaks and locating them. The model also provides a goodness-of-fit test, to screen out spurious peaks and to infer multiple binding events in a region.

Conclusions: The NEXT-peak program calls peaks on any test dataset about as accurately as any other, but provides unusual accuracy in the estimated location of the peaks it calls. NEXT-peak is based on rigorous statistics, so its model also provides a principled foundation for a more elaborate statistical analysis of ChIP-seq data.

Comments

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Original Publication Citation

Kim, N. K., Jayatillake, R. V., & Spouge, J. L. (2013). NEXT-peak: A normal-exponential two-peak model for peak-calling in ChIP-seq data. BMC Genomics, 14, 349. doi:10.1186/1471-2164-14-349

Repository Citation

Kim, Nak-Kyeong; Jayatillake, Rasika V.; and Spouge, John L., "NEXT-Peak: A Normal-Exponential Two-Peak Model for Peak-Calling in ChIP-seq Data" (2013). Mathematics & Statistics Faculty Publications. 50.
https://digitalcommons.odu.edu/mathstat_fac_pubs/50

Download

Included in

Biotechnology Commons, Genetics Commons, Microbiology Commons, Statistics and Probability Commons

COinS

ODU Digital Commons

Mathematics & Statistics Faculty Publications

NEXT-Peak: A Normal-Exponential Two-Peak Model for Peak-Calling in ChIP-seq Data

Document Type

Publication Date

DOI

Publication Title

Volume

Pages

Abstract

Comments

Original Publication Citation

Repository Citation

Included in

Search

Browse

Contribute

Links

Contact Us

ODU Digital Commons

Mathematics & Statistics Faculty Publications

NEXT-Peak: A Normal-Exponential Two-Peak Model for Peak-Calling in ChIP-seq Data

Authors

Document Type

Publication Date

DOI

Publication Title

Volume

Pages

Abstract

Comments

Original Publication Citation

Repository Citation

Included in

Share

Search

Browse

Contribute

Links

Contact Us