77.svg

Overview

<aside> 💡 AI bias is a real concern in generative AI systems. The source of the bias is the training data itself, and even from the reinforcement learning as well.

Even before the launch of chatGPT, in 2016, Microsoft’s Tay chatbot was a cautionary tale. Gemini’s image generator misfire in early 2024 showed how even well-intentioned ‘tuners’ of AI could make a situation worse.

And, there’s the Waluigi Effect. It’s a well-known principle amongst LLM designers that says, if an AI has been trained on some form of ethical/ discourse, it is more likely for it to be able to do the opposite; i.e., training it on the ‘right thing’ makes it more possible for it to do the ‘wrong thing’.

</aside>

Known Dangers

Mitigation Strategies

To address these dangers, several strategies can be employed:


Sources: