Εμφάνιση απλής εγγραφής

dc.creatorZintzaras, E.en
dc.creatorKowald, A.en
dc.date.accessioned2015-11-23T10:55:14Z
dc.date.available2015-11-23T10:55:14Z
dc.date.issued2010
dc.identifier10.1016/j.compbiomed.2010.03.006
dc.identifier.issn0010-4825
dc.identifier.urihttp://hdl.handle.net/11615/34965
dc.description.abstractClassification into multiple classes when the measured variables are outnumbered is a major methodological challenge in -omics studies. Two algorithms that overcome the dimensionality problem are presented: the forest classification tree (FCT) and the forest support vector machines (FSVM). In FCT, a set of variables is randomly chosen and a classification tree (CT) is grown using a forward classification algorithm. The process is repeated and a forest of CTs is derived. Finally, the most frequent variables from the trees with the smallest apparent misclassification rate (AMR) are used to construct a productive tree. In FSVM, the CTs are replaced by SVMs. The methods are demonstrated using prostate gene expression data for classifying tissue samples into four tumor types. For threshold split value 0.001 and utilizing 100 markers the productive CT consisted of 29 terminal nodes and achieved perfect classification (AMR=0). When the threshold value was set to 0.01, a tree with 17 terminal nodes was constructed based on 15 markers (AMR=7%). In FSVM, reducing the fraction of the forest that was used to construct the best classifier from the top 80% to the top 20% reduced the misclassification to 25% (when using 200 markers). The proposed methodologies may be used for identifying important variables in high dimensional data. Furthermore, the FT allows exploring the data structure and provides a decision rule. (C) 2010 Elsevier Ltd. All rights reserved.en
dc.source.uri<Go to ISI>://WOS:000278258500006
dc.subjectClassification treesen
dc.subjectSupport vector machinesen
dc.subjectForesten
dc.subjectClassificationen
dc.subjectAlgorithmsen
dc.subjectMicroarraysen
dc.subjectGene expressionen
dc.subjectGENE-EXPRESSION DATAen
dc.subjectCANCER CLASSIFICATIONen
dc.subjectPATTERN-RECOGNITIONen
dc.subjectSTATISTICAL-METHODSen
dc.subjectSAMPLE-SIZEen
dc.subjectSELECTIONen
dc.subjectASSOCIATIONen
dc.subjectPERFORMANCEen
dc.subjectVALIDATIONen
dc.subjectPREDICTIONen
dc.subjectBiologyen
dc.subjectComputer Science, Interdisciplinary Applicationsen
dc.subjectEngineering,en
dc.subjectBiomedicalen
dc.subjectMathematical & Computational Biologyen
dc.titleForest classification trees and forest support vector machines algorithms: Demonstration using microarray dataen
dc.typejournalArticleen


Αρχεία σε αυτό το τεκμήριο

ΑρχείαΜέγεθοςΤύποςΠροβολή

Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Εμφάνιση απλής εγγραφής