激光生物学报摘要, 更新时间: 2007年4月29日
  由美国生科集团 (BVTech, Inc.) 主办
  
一种预测细胞凋亡蛋白的亚细胞定位的新方法
激光生物学报摘要 2007-2

张振慧1,王勇献2,王正华2 (1.国防科技大学理学院数学与系统科学系,长沙,410073; 2.国防科技大学并行与分布处理国家重点实验室,长沙,410073)

摘 要:细胞凋亡蛋白对生物体的发育、维持内环境稳定及人们理解细胞凋亡机制非常重要。文中提出了一种新的蛋白质序列特征提取方法——三肽离散源方法。计算了蛋白质序列中紧邻三联体的出现个数,利用离散增量极小化对凋亡蛋白进行定位预测;同时推广了张春霆等提出的内容平衡精度指数[13],使其能评估任意类的分类问题。实验结果表明:在凋亡蛋白定位预测研究中,三肽离散源方法在提高总体预测精度的同时,能够较好的解决样本不均衡问题;而内容平衡精度指数能比传统的总体预测精度更准确的评估预测算法的预测能力,有效的反映预测算法对样本不均衡问题的相容能力。


A New Method for the Subcellular Location Prediction of Apoptosis Proteins

ZHANG Zhen-hui1,WANG Yong-xian2, WANG Zheng-hua2* ( 1.Institute of science, National University of Defense Technology, Changsha, 410073, China; 2.National Laboratory for Parallel and Distributed Processing, National University of Defense Technology, Changsha, 410073, China)

Abstract: Apoptosis proteins have a central role in the development and homeostasis of an organism. These proteins are very important for understanding the mechanism of programmed cell death. A new encoding method based on tri-polypeptide composition is presented. By use of adjacent triune residues contents in the protein primary sequences, the increment of diversity is calculated to predict the subcellular location of apoptosis proteins. The content-balancing accuracy index presented by Zhang CT is extended to solve any classification problem. The experiment results show that for apoptosis protein subcellular location prediction, the method of tri-polypeptide diversity source can not only improve the overall prediction accuracy, but also solve the imbalance problem of samples. While the content-balancing accuracy index is much superior to the widely used overall prediction accuracy for evaluate prediction algorithms.


 

Bioinformatics, sequence analysis; GCG; Life Science News; Drug Discovery.
生命科学进展新闻网
生科新闻网是反映生命科学研究进展的新闻网站。致力于为学术界和工业界的生命科学研究人员和大专院校师生提供及时, 准确的新闻和研究动态。为促进科技交流,尽其绵薄之力。
This is a web site of life science news in Chinese. It provides news and research trens in life sciences and drug discovery. This site is updated continuously throughout the day.