Bicocca Open Archive

Machine learning (ML) now plays a direct role in decisions that shape people’s access to healthcare, credit, employment, and justice. When these systems learn from data that carries social bias, they can unintentionally reinforce inequality rather than help reduce it. This thesis focuses on improving fairness through in-processing strategies while keeping decision processes understandable to those affected by them. The central idea is to blend the transparency of fuzzy rules with the predictive strengths of neural networks (NNs). In doing so, the work explores how sensitive information influences model behaviour and proposes mechanisms to keep that influence under control. The thesis unfolds in three main stages. The first contribution introduces an in-processing learning strategy that uses attribution signals to identify when sensitive features dominate predictions and suppresses such behaviour during training. The second contribution updates the Adaptive Neuro-Fuzzy Inference System (ANFIS) by integrating kernel functions to increase flexibility and by injecting attribution feedback to improve stability and rule usefulness. The third contribution builds on this idea by allowing both the rule base and membership functions to evolve while the network trains, making the system more responsive to how bias appears across different data contexts. These developments are later unified into a single framework that works with commonly used attribution techniques, such as Integrated Gradients (IGs) and SmoothGrad. The framework reports fairness indicators alongside classification performance and offers interpretable, rule-level insights into where harmful bias may arise. The methods are evaluated on simulated datasets and public benchmarks, including Adult, Correctional Offender Management Profiling for alternative sanctions (COMPAS), and German Credit. Across all cases, the proposed models reduce group-level disparity measures such as statistical parity difference and disparate impact, while maintaining accuracy comparable to standard baselines. The results confirm that it is possible to improve both interpretability and fairness at the same time when neuro-fuzzy reasoning is embedded into the core of the learning process.

Machine learning (ML) now plays a direct role in decisions that shape people’s access to healthcare, credit, employment, and justice. When these systems learn from data that carries social bias, they can unintentionally reinforce inequality rather than help reduce it. This thesis focuses on improving fairness through in-processing strategies while keeping decision processes understandable to those affected by them. The central idea is to blend the transparency of fuzzy rules with the predictive strengths of neural networks (NNs). In doing so, the work explores how sensitive information influences model behaviour and proposes mechanisms to keep that influence under control. The thesis unfolds in three main stages. The first contribution introduces an in-processing learning strategy that uses attribution signals to identify when sensitive features dominate predictions and suppresses such behaviour during training. The second contribution updates the Adaptive Neuro-Fuzzy Inference System (ANFIS) by integrating kernel functions to increase flexibility and by injecting attribution feedback to improve stability and rule usefulness. The third contribution builds on this idea by allowing both the rule base and membership functions to evolve while the network trains, making the system more responsive to how bias appears across different data contexts. These developments are later unified into a single framework that works with commonly used attribution techniques, such as Integrated Gradients (IGs) and SmoothGrad. The framework reports fairness indicators alongside classification performance and offers interpretable, rule-level insights into where harmful bias may arise. The methods are evaluated on simulated datasets and public benchmarks, including Adult, Correctional Offender Management Profiling for alternative sanctions (COMPAS), and German Credit. Across all cases, the proposed models reduce group-level disparity measures such as statistical parity difference and disparate impact, while maintaining accuracy comparable to standard baselines. The results confirm that it is possible to improve both interpretability and fairness at the same time when neuro-fuzzy reasoning is embedded into the core of the learning process.

Shah, S (2026). In-Processing Neuro-Fuzzy Approaches for Fairness-Aware and Explainable Machine Learning. (Tesi di dottorato, , 2026).

In-Processing Neuro-Fuzzy Approaches for Fairness-Aware and Explainable Machine Learning

SHAH, SAHAR

2026

Abstract

Machine learning (ML) now plays a direct role in decisions that shape people’s access to healthcare, credit, employment, and justice. When these systems learn from data that carries social bias, they can unintentionally reinforce inequality rather than help reduce it. This thesis focuses on improving fairness through in-processing strategies while keeping decision processes understandable to those affected by them. The central idea is to blend the transparency of fuzzy rules with the predictive strengths of neural networks (NNs). In doing so, the work explores how sensitive information influences model behaviour and proposes mechanisms to keep that influence under control. The thesis unfolds in three main stages. The first contribution introduces an in-processing learning strategy that uses attribution signals to identify when sensitive features dominate predictions and suppresses such behaviour during training. The second contribution updates the Adaptive Neuro-Fuzzy Inference System (ANFIS) by integrating kernel functions to increase flexibility and by injecting attribution feedback to improve stability and rule usefulness. The third contribution builds on this idea by allowing both the rule base and membership functions to evolve while the network trains, making the system more responsive to how bias appears across different data contexts. These developments are later unified into a single framework that works with commonly used attribution techniques, such as Integrated Gradients (IGs) and SmoothGrad. The framework reports fairness indicators alongside classification performance and offers interpretable, rule-level insights into where harmful bias may arise. The methods are evaluated on simulated datasets and public benchmarks, including Adult, Correctional Offender Management Profiling for alternative sanctions (COMPAS), and German Credit. Across all cases, the proposed models reduce group-level disparity measures such as statistical parity difference and disparate impact, while maintaining accuracy comparable to standard baselines. The results confirm that it is possible to improve both interpretability and fairness at the same time when neuro-fuzzy reasoning is embedded into the core of the learning process.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tutor afferente a Bicocca
	
				ZOPPIS, ITALO FRANCESCO
BIANCO, SIMONE
			
	Parole chiave
	
				Machine Learning; Fairness-Aware; XAI; Neuro-Fuzzy Systems; In-Processing
			
	Parole chiave
	
				Machine Learning; Fairness-Aware; XAI; Neuro-Fuzzy Systems; In-Processing
			
	* Lingua del contenuto
	
				English
			
	* Data di discussione
	
				3-mar-2026
			
	* Ciclo di dottorato
	
				38
			
	* Anno accademico di conseguimento titolo
	
				2024/2025
			
	Fulltext
	
				open
			
	Citazione
	
				Shah, S (2026). In-Processing Neuro-Fuzzy Approaches for Fairness-Aware and Explainable Machine Learning. (Tesi di dottorato, , 2026).
			
	Appare nelle tipologie:
	
				07 - Tesi di dottorato Bicocca post 2009

File in questo prodotto:

File	Dimensione	Formato
phd_unimib_899134.pdf accesso aperto Descrizione: In-Processing Neuro-Fuzzy Approaches for Fairness-Aware and Explainable Machine Learning Tipologia di allegato: Doctoral thesis Dimensione 2.14 MB Formato Adobe PDF Visualizza/Apri	2.14 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/610653

Citazioni

ND

ND

Social impact