A fundamental feature of the software process consists in its own stochastic nature. A convenient approach for extracting the stochastic dynamics of a process from log data is that of modelling the process as a Markov model: in this way the discovery of the short/medium range dynamics of the process is cast in terms of the learning of Markov models of different orders, i.e. in terms of learning the corresponding transition matrices. In this paper we show that the use of a full Bayesian approach in the learning process helps providing robustness against statistical noise and over-fitting, as the size of a transition matrix grows exponentially with the order of the model. We give a specific model-model similarity definition and the corresponding calculation procedure to be used in model-to-sequence or sequence-to-sequence conformance assessment, this similarity definition could also be applied to other inferential tasks, such as unsupervised process learning.

Colombo, A., Damiani, E., Gianini, G. (2006). Discovering the software process by means of stochastic workflow analysis. JOURNAL OF SYSTEMS ARCHITECTURE, 52(11), 684-692 [10.1016/j.sysarc.2006.06.012].

Discovering the software process by means of stochastic workflow analysis

Gianini, G
2006

Abstract

A fundamental feature of the software process consists in its own stochastic nature. A convenient approach for extracting the stochastic dynamics of a process from log data is that of modelling the process as a Markov model: in this way the discovery of the short/medium range dynamics of the process is cast in terms of the learning of Markov models of different orders, i.e. in terms of learning the corresponding transition matrices. In this paper we show that the use of a full Bayesian approach in the learning process helps providing robustness against statistical noise and over-fitting, as the size of a transition matrix grows exponentially with the order of the model. We give a specific model-model similarity definition and the corresponding calculation procedure to be used in model-to-sequence or sequence-to-sequence conformance assessment, this similarity definition could also be applied to other inferential tasks, such as unsupervised process learning.
Articolo in rivista - Articolo scientifico
Bayesian methods; Machine learning; Markov chains; Similarity measures; Software process; Stochastic dynamics; Time-sequence analysis; Workflow management;
English
2006
52
11
684
692
none
Colombo, A., Damiani, E., Gianini, G. (2006). Discovering the software process by means of stochastic workflow analysis. JOURNAL OF SYSTEMS ARCHITECTURE, 52(11), 684-692 [10.1016/j.sysarc.2006.06.012].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/455398
Citazioni
  • Scopus 10
  • ???jsp.display-item.citation.isi??? 7
Social impact