Considerable progress has been made in the theory of covering-based rough sets. However, there has been a lack of research on their application to classification tasks, especially for nominal data. In this paper, we propose a representative-based classification approach for nominal data using covering-based rough sets. The classifier training task considers three issues. First, we define the neighborhood of an instance. The size of the neighborhood is determined by a similarity threshold θ. Second, we determine the maximal neighborhood of each instance in the positive region by computing its minimal θ value. These neighborhoods form a covering of the positive region. Third, we employ two covering reduction techniques to select a minimal set of instances called representatives. To classify a new instance, we compute its similarity with each representative. The similarity and minimal θ of the representative determine the distance. Representatives with the minimal distance are employed to obtain the class label. Experimental results on different datasets indicate that the classifier is comparable with or better than the ID3, C4.5, NEC, and NCR algorithms.
Zhang, B., Min, F., Ciucci, D. (2015). Representative-based classification through covering-based neighborhood rough sets. APPLIED INTELLIGENCE, 43(4), 840-854 [10.1007/s10489-015-0687-5].
Representative-based classification through covering-based neighborhood rough sets
CIUCCI, DAVIDE ELIOUltimo
2015
Abstract
Considerable progress has been made in the theory of covering-based rough sets. However, there has been a lack of research on their application to classification tasks, especially for nominal data. In this paper, we propose a representative-based classification approach for nominal data using covering-based rough sets. The classifier training task considers three issues. First, we define the neighborhood of an instance. The size of the neighborhood is determined by a similarity threshold θ. Second, we determine the maximal neighborhood of each instance in the positive region by computing its minimal θ value. These neighborhoods form a covering of the positive region. Third, we employ two covering reduction techniques to select a minimal set of instances called representatives. To classify a new instance, we compute its similarity with each representative. The similarity and minimal θ of the representative determine the distance. Representatives with the minimal distance are employed to obtain the class label. Experimental results on different datasets indicate that the classifier is comparable with or better than the ID3, C4.5, NEC, and NCR algorithms.File | Dimensione | Formato | |
---|---|---|---|
Representativet_0416.pdf
accesso aperto
Descrizione: Post-print
Dimensione
621.67 kB
Formato
Adobe PDF
|
621.67 kB | Adobe PDF | Visualizza/Apri |
2015-AppInt.pdf
Solo gestori archivio
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Dimensione
1.93 MB
Formato
Adobe PDF
|
1.93 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.