Comparing filter and wrapper approaches for feature selection in handwritten character recognition

IRIS

It is generally agreed that the selection of an appropriate set of features is a fundamental process in the development of any pattern recognition system. Its purpose is to identify the truly distinctive subset of features to reduce the size of the search space, without decreasing the classification performance. This problem is particularly relevant in the field of handwriting recognition, due to the enormous variability of character shape, which has led to the development of a large variety of feature sets that are becoming increasingly larger in terms of the number of attributes. While promising, the results achieved so far have several limitations, which include, among others, the computational complexity of selecting and evaluat-ing feature subsets and the difficulty in evaluating the interactions among features. In a previous study, we tried to overcome some of the above limitations by adopting a feature-ranking-based technique: a large study was carried out considering different filter-based techniques for feature subset evaluation. The aim of this work is to extend the previous study by presenting a broad comparison between fil-ter and wrapper techniques for feature selection in the field of handwritten character recognition. In the experiments, we analysed one of the most effective and widely used set of features in handwriting recognition, applied to standard real-word databases of handwritten characters. The experimental results confirmed that filter and wrapper approaches achieve similar performances, with the former selecting fewer features at a lower computational cost.(c) 2023 Elsevier B.V. All rights reserved.

Comparing filter and wrapper approaches for feature selection in handwritten character recognition

Cilia, ND;D'Alessandro, T;De Stefano, C;Fontanella, F;Freca, ASD

2023-01-01

Abstract

It is generally agreed that the selection of an appropriate set of features is a fundamental process in the development of any pattern recognition system. Its purpose is to identify the truly distinctive subset of features to reduce the size of the search space, without decreasing the classification performance. This problem is particularly relevant in the field of handwriting recognition, due to the enormous variability of character shape, which has led to the development of a large variety of feature sets that are becoming increasingly larger in terms of the number of attributes. While promising, the results achieved so far have several limitations, which include, among others, the computational complexity of selecting and evaluat-ing feature subsets and the difficulty in evaluating the interactions among features. In a previous study, we tried to overcome some of the above limitations by adopting a feature-ranking-based technique: a large study was carried out considering different filter-based techniques for feature subset evaluation. The aim of this work is to extend the previous study by presenting a broad comparison between fil-ter and wrapper techniques for feature selection in the field of handwritten character recognition. In the experiments, we analysed one of the most effective and widely used set of features in handwriting recognition, applied to standard real-word databases of handwritten characters. The experimental results confirmed that filter and wrapper approaches achieve similar performances, with the former selecting fewer features at a lower computational cost.(c) 2023 Elsevier B.V. All rights reserved.

Scheda breve

Scheda completa

Scheda completa (DC)

Anno

2023

Appare nelle tipologie:

1.1 Articolo in rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11387/158565

Citazioni

ND

21

17

social impact