An Empirical Evaluation of Feature Selection Stability and Classification Accuracy

dc.contributor.author Mustafa Buyukkececi
dc.contributor.author Mehmet Cudi Okur
dc.contributor.author Büyükkeçeci, Mustafa
dc.contributor.author Okur, Mehmet
dc.date.accessioned 2025-10-06T16:20:28Z
dc.date.issued 2024
dc.description.abstract The performance of inductive learners can be negatively affected by high -dimensional datasets. To address this issue feature selection methods are used. Selecting relevant features and reducing data dimensions is essential for having accurate machine learning models. Stability is an important criterion in feature selection. Stable feature selection algorithms maintain their feature preferences even when small variations exist in the training set. Studies have emphasized the importance of stable feature selection particularly in cases where the number of samples is small and the dimensionality is high. In this study we evaluated the relationship between stability measures as well as feature selection stability and classification accuracy using the Pearson 's Correlation Coefficient (also known as Pearson 's Product -Moment Correlation Coefficient or simply Pearson's r ). We conducted an extensive series of experiments using five filter and two wrapper feature selection methods three classifiers for subset and classification performance evaluation and eight real -world datasets taken from two different data repositories. We measured the stability of feature selection methods using a total of twelve stability metrics. Based on the results of correlation analyses we have found that there is a lack of substantial evidence supporting a linear relationship between feature selection stability and classification accuracy. However a strong positive correlation has been observed among several stability metrics.
dc.identifier.doi 10.35378/gujs.998964
dc.identifier.issn 2147-1762
dc.identifier.scopus 2-s2.0-85196141205
dc.identifier.uri http://dx.doi.org/10.35378/gujs.998964
dc.identifier.uri https://gcris.yasar.edu.tr/handle/123456789/6395
dc.identifier.uri https://doi.org/10.35378/gujs.998964
dc.identifier.uri https://search.trdizin.gov.tr/en/yayin/detay/1303226
dc.language.iso English
dc.publisher GAZI UNIV
dc.relation.ispartof Gazi University Journal of Science
dc.rights info:eu-repo/semantics/openAccess
dc.source GAZI UNIVERSITY JOURNAL OF SCIENCE
dc.subject Feature selection, Selection stability, Classification accuracy, Filter methods, Wrapper methods
dc.subject ALGORITHMS, BIAS
dc.subject Filter Methods
dc.subject Selection Stability
dc.subject Bilgisayar Bilimleri, Yazılım Mühendisliği
dc.subject İstatistik Ve Olasılık
dc.subject Classification Accuracy
dc.subject Feature Selection
dc.subject Wrapper Methods
dc.title An Empirical Evaluation of Feature Selection Stability and Classification Accuracy
dc.type Article
dspace.entity.type Publication
gdc.author.id 0000-0002-0096-9087
gdc.author.id 0000-0002-1970-8952
gdc.author.scopusid 56342800400
gdc.author.scopusid 55190894600
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department
gdc.description.departmenttemp [Buyukkececi, Mustafa] Univerlist, Izmir, Turkiye; [Okur, Mehmet Cudi] Yasar Univ, Fac Engn, Dept Software Engn, Izmir, Turkiye
gdc.description.endpage 620
gdc.description.issue 2
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
gdc.description.startpage 606
gdc.description.volume 37
gdc.description.woscitationindex Emerging Sources Citation Index
gdc.identifier.openalex W4388873525
gdc.identifier.trdizinid 1303226
gdc.identifier.wos WOS:001240215800005
gdc.index.type WoS
gdc.index.type TR-Dizin
gdc.index.type Scopus
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 1.0
gdc.oaire.influence 2.418663E-9
gdc.oaire.isgreen false
gdc.oaire.keywords Engineering
gdc.oaire.keywords Mühendislik
gdc.oaire.keywords Feature selection;Selection stability;Classification accuracy;Filter methods;Wrapper methods
gdc.oaire.popularity 2.9699079E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 03 medical and health sciences
gdc.oaire.sciencefields 0302 clinical medicine
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.openalex.collaboration National
gdc.openalex.fwci 0.1737
gdc.openalex.normalizedpercentile 0.59
gdc.opencitations.count 1
gdc.plumx.mendeley 5
gdc.plumx.scopuscites 1
gdc.scopus.citedcount 1
gdc.virtual.author Okur, Mehmet Cudi
gdc.virtual.author Büyükkeçeci, Mustafa
gdc.wos.citedcount 1
oaire.citation.endPage 620
oaire.citation.startPage 606
publicationissue.issueNumber 2
publicationvolume.volumeNumber 37
relation.isAuthorOfPublication 58ef0d78-7bea-46de-9b83-ebe11bcc5dd5
relation.isAuthorOfPublication 057d206c-0a44-4fac-b6d0-24462ee56f2d
relation.isAuthorOfPublication.latestForDiscovery 58ef0d78-7bea-46de-9b83-ebe11bcc5dd5
relation.isOrgUnitOfPublication ac5ddece-c76d-476d-ab30-e4d3029dee37
relation.isOrgUnitOfPublication.latestForDiscovery ac5ddece-c76d-476d-ab30-e4d3029dee37

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
10.35378-gujs.998964-1988005.pdf
Size:
1.16 MB
Format:
Adobe Portable Document Format