Large-scale image annotation remains a critical bottleneck in training high-performing machine learning models, particularly for multi-class classification tasks. Existing annotation tools often lack dedicated support for classification and fail to integrate human-in-the-loop strategies that balance automation with expert supervision. In this paper, we introduce PyTAGIT, an open-source annotation framework designed for fast, scalable, and interactive multi-class labeling. PyTAGIT combines AI-assisted classification with intuitive user interactions such as drag-and-drop assignment, t-SNE-based exploration, and iterative refinement through confidence thresholds. The tool operates efficiently even on mid-range laptops and supports datasets with thousands of samples and dozens of classes. Extensive experiments on seven diverse datasets demonstrate that PyTAGIT achieves the best annotation accuracy on six datasets and second-best on the remaining one, significantly outperforming both traditional manual tagging and vision-language baselines (CLIP, LLM+CLIP, BLIP-2). Furthermore, PyTAGIT consistently completes full dataset annotation within a one-hour time budget, drastically reducing annotation time compared to manual and automatic alternatives.

Piccoli, F., Rota, C., Kumar, R., Ciocca, G. (2026). PyTAGIT: a scalable and interactive human-in-the-loop tool for fast image annotation. NEURAL COMPUTING & APPLICATIONS, 38(7) [10.1007/s00521-025-11725-1].

PyTAGIT: a scalable and interactive human-in-the-loop tool for fast image annotation

Piccoli, Flavio
;
Rota, Claudio;Kumar, Rajesh;Ciocca, Gianluigi
2026

Abstract

Large-scale image annotation remains a critical bottleneck in training high-performing machine learning models, particularly for multi-class classification tasks. Existing annotation tools often lack dedicated support for classification and fail to integrate human-in-the-loop strategies that balance automation with expert supervision. In this paper, we introduce PyTAGIT, an open-source annotation framework designed for fast, scalable, and interactive multi-class labeling. PyTAGIT combines AI-assisted classification with intuitive user interactions such as drag-and-drop assignment, t-SNE-based exploration, and iterative refinement through confidence thresholds. The tool operates efficiently even on mid-range laptops and supports datasets with thousands of samples and dozens of classes. Extensive experiments on seven diverse datasets demonstrate that PyTAGIT achieves the best annotation accuracy on six datasets and second-best on the remaining one, significantly outperforming both traditional manual tagging and vision-language baselines (CLIP, LLM+CLIP, BLIP-2). Furthermore, PyTAGIT consistently completes full dataset annotation within a one-hour time budget, drastically reducing annotation time compared to manual and automatic alternatives.
Articolo in rivista - Articolo scientifico
Human-in-the-loop annotation, Interactive image labeling, AI-assisted classification, Large-scale dataset annotation, Pseudo-labeling
English
25-mar-2026
2026
38
7
226
none
Piccoli, F., Rota, C., Kumar, R., Ciocca, G. (2026). PyTAGIT: a scalable and interactive human-in-the-loop tool for fast image annotation. NEURAL COMPUTING & APPLICATIONS, 38(7) [10.1007/s00521-025-11725-1].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/598962
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact