CORC  > 北京大学  > 数学科学学院
PDEGEM: Modeling non-uniform read distribution in RNA-Seq data
Xia, Yuchao ; Wang, Fugui ; Qian, Minping ; Qin, Zhaohui ; Deng, Minghua
2015
关键词SHORT OLIGONUCLEOTIDE MICROARRAYS GENE-EXPRESSION GENOME CELL ARRAYS
英文摘要Background: RNA-Seq is a powerful new technology to comprehensively analyze the transcriptome of any given cells. An important task in RNA-Seq data analysis is quantifying the expression levels of all transcripts. Although many methods have been introduced and much progress has been made, a satisfactory solution remains elusive. Results: In this article, we borrow the idea from the Positional Dependent Nearest Neighborhood (PDNN) model, originally developed for analyzing microarray data, to model the non-uniformity of read distribution in RNA-seq data. We propose a robust nonlinear regression model named PDEGEM, a Positional Dependent Energy Guided Expression Model to estimate the abundance of transcripts. Using real data, we find that the PDEGEM fits the data better than mseq in all three real datasets we tested. We also find that the expression measure obtained using PDEGEM showed higher correlation with that obtained from alterative assays for quantifying gene and isoform expressions. Conclusions: Based on these results, we believe that our PDEGEM can improve the accuracy in modeling and estimating the transcript abundance and isoform expression in RNA-Seq data. Additionally, although the stacking energy and positional weight of the PDEGEM are relatively related to sequencing platforms and species, they share some common trends, which indicates that the PDEGEM could partly reflect the mechanism of DNA binding between the template strain and the new synthesized read. The PDEGEM model can be freely downloaded at: http://www.math.pku.edu.cn/teachers/dengmh/PDEGEM.; SCI(E); CPCI-S(ISTP); dengmh@pku.edu.cn; 8
语种英语
出处SCI
出版者4th Annual Translational Bioinformatics Conference (TBC) / 8th International Conference on Systems Biology (ISB)
内容类型其他
源URL[http://hdl.handle.net/20.500.11897/493711]  
专题数学科学学院
生命科学学院
推荐引用方式
GB/T 7714
Xia, Yuchao,Wang, Fugui,Qian, Minping,et al. PDEGEM: Modeling non-uniform read distribution in RNA-Seq data. 2015-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace