Multivariate temporal, or time, series classification is, in a way, the temporal generalization of (numeric) classification, as every instance is described by multiple time series instead of multiple values. Symbolic classification is the machine learning strategy to extract explicit knowledge from a data set, and the problem of symbolic classification of multivariate temporal series requires the design, implementation, and test of ad-hoc machine learning algorithms, such as, for example, algorithms for the extraction of temporal versions of decision trees. One of the most well-known algorithms for decision tree extraction from categorical data is Quinlan’s ID3, which was later extended to deal with numerical attributes, resulting in an algorithm known as C4.5, and implemented in many open-sources data mining libraries, including the so-called Weka, which features an implementation of C4.5 called J48. ID3 was recently generalized to deal with temporal data in form of timelines, which can be seen as discrete (categorical) versions of multivariate time series, and such a generalization, based on the interval temporal logic HS, is known as Temporal ID3. In this paper we introduce Temporal C4.5, that allows the extraction of temporal decision trees from undiscretized multivariate time series, describe its implementation, called Temporal J48, and discuss the outcome of a set of experiments with the latter on a collection of public data sets, comparing the results with those obtained by other, classical, multivariate time series classification methods.

Sciavicco, G., Stan, I. (2020). Knowledge extraction with interval temporal logic decision trees. In Leibniz International Proceedings in Informatics, LIPIcs (pp.1-16). Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing [10.4230/LIPIcs.TIME.2020.9].

Knowledge extraction with interval temporal logic decision trees

Stan, IE
2020

Abstract

Multivariate temporal, or time, series classification is, in a way, the temporal generalization of (numeric) classification, as every instance is described by multiple time series instead of multiple values. Symbolic classification is the machine learning strategy to extract explicit knowledge from a data set, and the problem of symbolic classification of multivariate temporal series requires the design, implementation, and test of ad-hoc machine learning algorithms, such as, for example, algorithms for the extraction of temporal versions of decision trees. One of the most well-known algorithms for decision tree extraction from categorical data is Quinlan’s ID3, which was later extended to deal with numerical attributes, resulting in an algorithm known as C4.5, and implemented in many open-sources data mining libraries, including the so-called Weka, which features an implementation of C4.5 called J48. ID3 was recently generalized to deal with temporal data in form of timelines, which can be seen as discrete (categorical) versions of multivariate time series, and such a generalization, based on the interval temporal logic HS, is known as Temporal ID3. In this paper we introduce Temporal C4.5, that allows the extraction of temporal decision trees from undiscretized multivariate time series, describe its implementation, called Temporal J48, and discuss the outcome of a set of experiments with the latter on a collection of public data sets, comparing the results with those obtained by other, classical, multivariate time series classification methods.
paper
Decision Trees; Explainable AI; Interval Temporal Logic; Time series;
English
27th International Symposium on Temporal Representation and Reasoning, TIME 2020 - 23 September 2020 through 25 September 2020
2020
Munoz-Velasco, E; Ozaki, A; Theobald, M
Leibniz International Proceedings in Informatics, LIPIcs
9783959771672
2020
178
1
16
9
https://drops.dagstuhl.de/opus/volltexte/2020/12977/
open
Sciavicco, G., Stan, I. (2020). Knowledge extraction with interval temporal logic decision trees. In Leibniz International Proceedings in Informatics, LIPIcs (pp.1-16). Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing [10.4230/LIPIcs.TIME.2020.9].
File in questo prodotto:
File Dimensione Formato  
Sciavicco-2020-Leibniz International Proceedings in Informatics, LIPIcs-VoR.pdf

accesso aperto

Descrizione: licensed under Creative Commons License CC-BY
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 497.29 kB
Formato Adobe PDF
497.29 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/524145
Citazioni
  • Scopus 24
  • ???jsp.display-item.citation.isi??? ND
Social impact