Join Free

Research and publish the best content.

The Future of Artificial Intelligence

210 views | +0 today

Tags
Current selected tag: 'AI Fairness'. Clear

AI ethics 1

AI Fairness 1

AI interpretability 1

LLM 1

NAACL 1

NLP 1

XAI 1

The Future of Artificial Intelligence

Curated by Juliette Decugis

Your new post is loading...

Scooped by Juliette Decugis

Scoop.it!

[2206.03945] Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models | The Future of Artificial Intelligence | Scoop.it

From arxiv.org - April 3, 2023 3:27 AM

Juliette Decugis's insight:

Very relevant paper as large LLMs are raising concerns for AI safety.

As described in this paper, explainability methods have only been applied to ML fairness in narrow applications: for feature understanding and hate speech detection. Although limiting biases is one of the motivations for model transparency, NLP fairness and interpretability struggle to find a common ground.

NLP fairness focuses on local explanations and invariant outcome across groups. On the other hand, XAI aims to solve procedural fairness; whether the model's reasoning across groups is bias. We struggle to generalize local explanations, identify biases without human supervision and quantify how biases may change.

The issue of "fairwashing" is also becoming increasingly concerning as we have no guarantee our current explanation methods actually represent inner working of the model.

At the end of the day, more representative and less bias datasets remain the key to AI fairness.

No comment yet.

[2206.03945] Challenges in Applying Explainability Methods to Improve the Fairness of NLP Models