CyclinPred: A SVM-based method for predicting cyclin protein sequences

Mridul K. Kalita, Umesh K. Nandal, Ansuman Pattnaik, Anandhan Sivalingam, Gowthaman Ramasamy, Manish Kumar, Gajendra P.S. Raghava, Dinesh Gupta

Research output: Contribution to journalArticlepeer-review

23 Scopus citations

Abstract

Functional annotation of protein sequences with low similarity to well characterized protein sequences is a major challenge of computational biology in the post genomic era. The cyclin protein family is once such important family of proteins which consists of sequences with low sequence similarity making discovery of novel cyclins and establishing orthologous relationships amongst the cyclins, a difficult task. The currently identified cyclin motifs and cyclin associated domains do not represent all of the identified and characterized cyclin sequences. We describe a Support Vector Machine (SVM) based classifier. CyclinPred, which can predict cyclin sequences with high efficiency. The SVM classifier was trained with features of selected cyclin and non cyclin protein sequences. The training features of the protein sequences include amino acid composition, dipeptide composition, secondary structure composition and PSI-BLAST generated Position Specific Scoring Matrix (PSSM) profiles. Results obtained from Leave-One-Out cross validation or jackknife test, self consistency and holdout tests prove that the SVM classifier trained with features of PSSM profile was more accurate than the classifiers based on either of the other features alone or hybrids of these features. A cyclin prediction server- CyclinPred has been setup based on SVM model trained with PSSM profiles. CyclinPred prediction results prove that the method may be used as a cyclin prediction tool, complementing conventional cyclin prediction methods. Copyright:

Original languageEnglish
Article numbere2605
JournalPLoS ONE
Volume3
Issue number7
DOIs
StatePublished - Jul 2 2008
Externally publishedYes

Fingerprint

Dive into the research topics of 'CyclinPred: A SVM-based method for predicting cyclin protein sequences'. Together they form a unique fingerprint.

Cite this