AI Generated Capstone Project logo

Hi all! My name is Simone Van Taylor, and I am a Data Scientist with a deep passion for designing language models that don’t perpetuate harmful content like racism and misogyny.

For my 2024 AI Safety Capstone Project, I aggregated, unified, and analyzed existing open-source red-teaming datasets aimed at identifying stereotypes, discrimination, hate speech, and other representation harms in text-based LLMs.

Check out the final dataset 🤗

Learn more about all the datasets considered 🤗

Explore sample prompts 🚨

Why?

At A Glance

Key Takeaways

Bar Chart showing prevalance and intersectionality by Category

Dig into curated prompt examples in the analysis section!