New and evolving Artificial Intelligence (AI) applications span across the full spectrum of computing resources, seamlessly integrating across Edge and Cloud platforms. Deploying applications at the network edge reduces latency, while cloud computing ensures a higher processing power. Managing resources in such a dynamic and diverse environment requires a strategic approach to satisfy Quality of Service requirements and minimize costs. To address this challenge, we propose FIGARO (reinForcement learnInG mAnagement acRoss computing cOntinuum), which exploits offline training and imitation learning to speed up the training of reinforcement learning-based agents able to control resources in the full cloud continuum stack. By extending our framework, we designed a hierarchical system structure and tested agents that only need to manage one computational layer at a time. This approach enables the system to efficiently manage multiple application components in complex AI pipelines. The results demonstrate the effectiveness of the hierarchical method, as the local agents dynamically scale computational resources, limiting QoS constraint violations to a maximum of 1.4% in the reference use-case application.
Cavadini, R., Sedghani, H., Filippini, F., Ardagna, D. (2026). Runtime Management of Artificial Intelligence Applications Through Hierarchical Reinforcement Learning. In Performance Evaluation Methodologies and Tools 17th EAI International Conference, Valuetools 2024, Milan, Italy, December 12–13, 2024, Proceedings (pp.252-273). Springer Science and Business Media Deutschland GmbH [10.1007/978-3-032-06818-7_14].
Runtime Management of Artificial Intelligence Applications Through Hierarchical Reinforcement Learning
Filippini F.;
2026
Abstract
New and evolving Artificial Intelligence (AI) applications span across the full spectrum of computing resources, seamlessly integrating across Edge and Cloud platforms. Deploying applications at the network edge reduces latency, while cloud computing ensures a higher processing power. Managing resources in such a dynamic and diverse environment requires a strategic approach to satisfy Quality of Service requirements and minimize costs. To address this challenge, we propose FIGARO (reinForcement learnInG mAnagement acRoss computing cOntinuum), which exploits offline training and imitation learning to speed up the training of reinforcement learning-based agents able to control resources in the full cloud continuum stack. By extending our framework, we designed a hierarchical system structure and tested agents that only need to manage one computational layer at a time. This approach enables the system to efficiently manage multiple application components in complex AI pipelines. The results demonstrate the effectiveness of the hierarchical method, as the local agents dynamically scale computational resources, limiting QoS constraint violations to a maximum of 1.4% in the reference use-case application.| File | Dimensione | Formato | |
|---|---|---|---|
|
Cavadini et al-2026-EAI-AAM.pdf
embargo fino al 02/01/2027
Tipologia di allegato:
Author’s Accepted Manuscript, AAM (Post-print)
Licenza:
Licenza open access specifica dell’editore
Dimensione
4.5 MB
Formato
Adobe PDF
|
4.5 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


