New and evolving Artificial Intelligence (AI) applications span across the full spectrum of computing resources, seamlessly integrating across Edge and Cloud platforms. Deploying applications at the network edge reduces latency, while cloud computing ensures a higher processing power. Managing resources in such a dynamic and diverse environment requires a strategic approach to satisfy Quality of Service requirements and minimize costs. To address this challenge, we propose FIGARO (reinForcement learnInG mAnagement acRoss computing cOntinuum), which exploits offline training and imitation learning to speed up the training of reinforcement learning-based agents able to control resources in the full cloud continuum stack. By extending our framework, we designed a hierarchical system structure and tested agents that only need to manage one computational layer at a time. This approach enables the system to efficiently manage multiple application components in complex AI pipelines. The results demonstrate the effectiveness of the hierarchical method, as the local agents dynamically scale computational resources, limiting QoS constraint violations to a maximum of 1.4% in the reference use-case application.

Cavadini, R., Sedghani, H., Filippini, F., Ardagna, D. (2026). Runtime Management of Artificial Intelligence Applications Through Hierarchical Reinforcement Learning. In Performance Evaluation Methodologies and Tools 17th EAI International Conference, Valuetools 2024, Milan, Italy, December 12–13, 2024, Proceedings (pp.252-273). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-032-06818-7_14].

Runtime Management of Artificial Intelligence Applications Through Hierarchical Reinforcement Learning

Filippini F.;
2026

Abstract

New and evolving Artificial Intelligence (AI) applications span across the full spectrum of computing resources, seamlessly integrating across Edge and Cloud platforms. Deploying applications at the network edge reduces latency, while cloud computing ensures a higher processing power. Managing resources in such a dynamic and diverse environment requires a strategic approach to satisfy Quality of Service requirements and minimize costs. To address this challenge, we propose FIGARO (reinForcement learnInG mAnagement acRoss computing cOntinuum), which exploits offline training and imitation learning to speed up the training of reinforcement learning-based agents able to control resources in the full cloud continuum stack. By extending our framework, we designed a hierarchical system structure and tested agents that only need to manage one computational layer at a time. This approach enables the system to efficiently manage multiple application components in complex AI pipelines. The results demonstrate the effectiveness of the hierarchical method, as the local agents dynamically scale computational resources, limiting QoS constraint violations to a maximum of 1.4% in the reference use-case application.
paper
Artificial Intelligence; Computing Continuum; Reinforcement Learning;
English
17th EAI International Conference on Performance Evaluation Methodologies and Tools, Valuetools 2024 - December 12–13, 2024
2024
Gribaudo, M; Iacono, M; Sarvestani, SS
Performance Evaluation Methodologies and Tools 17th EAI International Conference, Valuetools 2024, Milan, Italy, December 12–13, 2024, Proceedings
9783032068170
2026
663
252
273
embargoed_20270102
Cavadini, R., Sedghani, H., Filippini, F., Ardagna, D. (2026). Runtime Management of Artificial Intelligence Applications Through Hierarchical Reinforcement Learning. In Performance Evaluation Methodologies and Tools 17th EAI International Conference, Valuetools 2024, Milan, Italy, December 12–13, 2024, Proceedings (pp.252-273). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-032-06818-7_14].
File in questo prodotto:
File Dimensione Formato  
Cavadini et al-2026-EAI-AAM.pdf

embargo fino al 02/01/2027

Tipologia di allegato: Author’s Accepted Manuscript, AAM (Post-print)
Licenza: Licenza open access specifica dell’editore
Dimensione 4.5 MB
Formato Adobe PDF
4.5 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/601084
Citazioni
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
Social impact