Machine learning (ML) now plays a direct role in decisions that shape people’s access to healthcare, credit, employment, and justice. When these systems learn from data that carries social bias, they can unintentionally reinforce inequality rather than help reduce it. This thesis focuses on improving fairness through in-processing strategies while keeping decision processes understandable to those affected by them. The central idea is to blend the transparency of fuzzy rules with the predictive strengths of neural networks (NNs). In doing so, the work explores how sensitive information influences model behaviour and proposes mechanisms to keep that influence under control. The thesis unfolds in three main stages. The first contribution introduces an in-processing learning strategy that uses attribution signals to identify when sensitive features dominate predictions and suppresses such behaviour during training. The second contribution updates the Adaptive Neuro-Fuzzy Inference System (ANFIS) by integrating kernel functions to increase flexibility and by injecting attribution feedback to improve stability and rule usefulness. The third contribution builds on this idea by allowing both the rule base and membership functions to evolve while the network trains, making the system more responsive to how bias appears across different data contexts. These developments are later unified into a single framework that works with commonly used attribution techniques, such as Integrated Gradients (IGs) and SmoothGrad. The framework reports fairness indicators alongside classification performance and offers interpretable, rule-level insights into where harmful bias may arise. The methods are evaluated on simulated datasets and public benchmarks, including Adult, Correctional Offender Management Profiling for alternative sanctions (COMPAS), and German Credit. Across all cases, the proposed models reduce group-level disparity measures such as statistical parity difference and disparate impact, while maintaining accuracy comparable to standard baselines. The results confirm that it is possible to improve both interpretability and fairness at the same time when neuro-fuzzy reasoning is embedded into the core of the learning process.

Machine learning (ML) now plays a direct role in decisions that shape people’s access to healthcare, credit, employment, and justice. When these systems learn from data that carries social bias, they can unintentionally reinforce inequality rather than help reduce it. This thesis focuses on improving fairness through in-processing strategies while keeping decision processes understandable to those affected by them. The central idea is to blend the transparency of fuzzy rules with the predictive strengths of neural networks (NNs). In doing so, the work explores how sensitive information influences model behaviour and proposes mechanisms to keep that influence under control. The thesis unfolds in three main stages. The first contribution introduces an in-processing learning strategy that uses attribution signals to identify when sensitive features dominate predictions and suppresses such behaviour during training. The second contribution updates the Adaptive Neuro-Fuzzy Inference System (ANFIS) by integrating kernel functions to increase flexibility and by injecting attribution feedback to improve stability and rule usefulness. The third contribution builds on this idea by allowing both the rule base and membership functions to evolve while the network trains, making the system more responsive to how bias appears across different data contexts. These developments are later unified into a single framework that works with commonly used attribution techniques, such as Integrated Gradients (IGs) and SmoothGrad. The framework reports fairness indicators alongside classification performance and offers interpretable, rule-level insights into where harmful bias may arise. The methods are evaluated on simulated datasets and public benchmarks, including Adult, Correctional Offender Management Profiling for alternative sanctions (COMPAS), and German Credit. Across all cases, the proposed models reduce group-level disparity measures such as statistical parity difference and disparate impact, while maintaining accuracy comparable to standard baselines. The results confirm that it is possible to improve both interpretability and fairness at the same time when neuro-fuzzy reasoning is embedded into the core of the learning process.

Shah, S (2026). In-Processing Neuro-Fuzzy Approaches for Fairness-Aware and Explainable Machine Learning. (Tesi di dottorato, , 2026).

In-Processing Neuro-Fuzzy Approaches for Fairness-Aware and Explainable Machine Learning

SHAH, SAHAR
2026

Abstract

Machine learning (ML) now plays a direct role in decisions that shape people’s access to healthcare, credit, employment, and justice. When these systems learn from data that carries social bias, they can unintentionally reinforce inequality rather than help reduce it. This thesis focuses on improving fairness through in-processing strategies while keeping decision processes understandable to those affected by them. The central idea is to blend the transparency of fuzzy rules with the predictive strengths of neural networks (NNs). In doing so, the work explores how sensitive information influences model behaviour and proposes mechanisms to keep that influence under control. The thesis unfolds in three main stages. The first contribution introduces an in-processing learning strategy that uses attribution signals to identify when sensitive features dominate predictions and suppresses such behaviour during training. The second contribution updates the Adaptive Neuro-Fuzzy Inference System (ANFIS) by integrating kernel functions to increase flexibility and by injecting attribution feedback to improve stability and rule usefulness. The third contribution builds on this idea by allowing both the rule base and membership functions to evolve while the network trains, making the system more responsive to how bias appears across different data contexts. These developments are later unified into a single framework that works with commonly used attribution techniques, such as Integrated Gradients (IGs) and SmoothGrad. The framework reports fairness indicators alongside classification performance and offers interpretable, rule-level insights into where harmful bias may arise. The methods are evaluated on simulated datasets and public benchmarks, including Adult, Correctional Offender Management Profiling for alternative sanctions (COMPAS), and German Credit. Across all cases, the proposed models reduce group-level disparity measures such as statistical parity difference and disparate impact, while maintaining accuracy comparable to standard baselines. The results confirm that it is possible to improve both interpretability and fairness at the same time when neuro-fuzzy reasoning is embedded into the core of the learning process.
ZOPPIS, ITALO FRANCESCO
BIANCO, SIMONE
Machine Learning; Fairness-Aware; XAI; Neuro-Fuzzy Systems; In-Processing
Machine Learning; Fairness-Aware; XAI; Neuro-Fuzzy Systems; In-Processing
English
3-mar-2026
38
2024/2025
open
Shah, S (2026). In-Processing Neuro-Fuzzy Approaches for Fairness-Aware and Explainable Machine Learning. (Tesi di dottorato, , 2026).
File in questo prodotto:
File Dimensione Formato  
phd_unimib_899134.pdf

accesso aperto

Descrizione: In-Processing Neuro-Fuzzy Approaches for Fairness-Aware and Explainable Machine Learning
Tipologia di allegato: Doctoral thesis
Dimensione 2.14 MB
Formato Adobe PDF
2.14 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/610653
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
Social impact