Detection Using Principle Component Analysis And Case Based Reasoning With Support Vector Machine

Decent Essays

Splice site detection using principle component analysis and case based reasoning with support vector machine

Srabanti Maji*1 and Haripada Bhunia2

1 Computer Science Department
Sri Guru Harkrishan College of Management and Technology, Raipur, Bahadurgarh;
Dist: Patiala,Punjab, India

2 Department of Chemical Engineering
Thapar University, Patiala-147004, India

*Address Correspondence to this author at
Dr. Srabanti Maji
Computer Science Department,
Sri Guru Harkrishan College of Management and Technology, Raipur, Bahadurgarh;
District: Patiala, Punjab, India

E-mail address: srabantiindia@gmail.com, srabanti9@gmail.com
Tel: +91-9356006454

ABSTRACT

Identification of coding region from genomic DNA sequence is the foremost step …show more content…

feature selection; and the final stage, in which a support vector machine (SVM) with Polynomial kernel is used for final classification. In comparison with other methods, the proposed SpliceCombo model outperforms other prediction models as the prediction accuracies are 97.25% sensitivity, 97.46% Specificity for donor splice site and 96.51% Sensitivity, 94.48% Specificity for acceptor splice site prediction.

Keywords: Gene Identification, Splicing Site, Principal Component Analysis (PCA); Cased Based Reasoning (CBR); Support Vector Machine(SVM)
*Correspondence to Srabanti Maji,
E-mail address: srabantiindia@gmail.com, srabanti9@gmail.com
Tel: +91-9356006454
Splice site detection using principle component analysis and case based reasoning with support vector machine

1. INTRODUCTION

Research in the genome sequencing technology have been creating an enormous amount of genomic sequencing data as its main objective is gene identification. In the eukaryotes, the prediction of a coding region depends upon the exon-intron structures recognition. Whereas its very challenging to predict exon intron structure in sequence due to its complexity of structure and vast length. Research analysis on the human genome have nearly 20,000–25,000 protein-coding genes [1]. Still, there are nearly 100,000 genes in human genome. Which indicates a huge number of genes are still unidentified [2,3]. Most of the computational techniques

Get Access

Detection Using Principle Component Analysis And Case Based Reasoning With Support Vector Machine

Cloning of Enhancer of Zeste Homolog 2 in Forward Orientation Into Escherichia Coli Using Histidine-Tagged Pbluescript Ii Ks+.

Cloning of Enhancer of Zeste Homolog 2 in Forward Orientation Into Escherichia Coli Using Histidine-Tagged Pbluescript Ii Ks+.

Nt1310 Unit 4 Section 1

Nt1310 Unit 4 Section 1

Nt1310 Unit 3 Test Report

Nt1310 Unit 3 Test Report

Against Proposition 69 and the DNA Fingerprint Act Essay

Against Proposition 69 and the DNA Fingerprint Act Essay

Week 5 Homework Key

Week 5 Homework Key

Annotated Bibliography On The Rna Profiling

Annotated Bibliography On The Rna Profiling

BICD1 Case Study

BICD1 Case Study

Tandem Repeats Essay

Tandem Repeats Essay

Inca Mummies Research Paper

Inca Mummies Research Paper

Agare Gel Lab Report

Agare Gel Lab Report

Circadian Clock Biological Lab

Circadian Clock Biological Lab

Ischemic Stroke Lab Report

Ischemic Stroke Lab Report

Annotated Bibliography On The Landscape Of Transcription

Annotated Bibliography On The Landscape Of Transcription

Ble Constructive Stricture: A Case Study

Ble Constructive Stricture: A Case Study

Improvement of the Quality in the Automobile Industry

Improvement of the Quality in the Automobile Industry

Related Topics