uu.seUppsala University Publications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Statistical Method for Determining Importance of Variables in an Information System
Uppsala University, Teknisk-naturvetenskapliga vetenskapsområdet, Faculty of Science and Technology, Biology, The Linnaeus Centre for Bioinformatics. Department of Cell and Molecular Biology, Bioinformatics.
Uppsala University, Teknisk-naturvetenskapliga vetenskapsområdet, Faculty of Science and Technology, Biology, The Linnaeus Centre for Bioinformatics. Department of Cell and Molecular Biology, Bioinformatics.
2006 (English)In: Lecture Notes in Computer Science: Rough Sets and Current Trends in Computing, ISSN 0302-9743, Vol. 4259/2006Article in journal (Refereed) Published
Abstract [en]

A new method for estimation of attributes’ importance for supervised classification, based on the random forest approach, is presented. Essentially, an iterative scheme is applied, with each step consisting of several runs of the random forest program. Each run is performed on a suitably modified data set: values of each attribute found unimportant at earlier steps are randomly permuted between objects. At each step, apparent importance of an attribute is calculated and the attribute is declared unimportant if its importance is not uniformly better than that of the attributes earlier found unimportant. The procedure is repeated until only attributes scoring better than the randomized ones are retained. Statistical significance of the results so obtained is verified. This method has been applied to 12 data sets of biological origin. The method was shown to be more reliable than that based on standard application of a random forest to assess attributes’ importance.

Place, publisher, year, edition, pages
2006. Vol. 4259/2006
Identifiers
URN: urn:nbn:se:uu:diva-15867DOI: doi:10.1007/11908029OAI: oai:DiVA.org:uu-15867DiVA: diva2:43638
Available from: 2008-03-11 Created: 2008-03-11 Last updated: 2011-01-11

Open Access in DiVA

No full text

Other links

Publisher's full texthttp://www.springerlink.com/content/75111w4k11h44808/

Authority records BETA

Kierczak, Marcin M.

Search in DiVA

By author/editor
Kierczak, Marcin M.
By organisation
The Linnaeus Centre for BioinformaticsBioinformatics

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 381 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf