REPO: Detoxifying LLMs via Representation Erasure-based Preference Optimization

Publication
CATS Workshop at ICML, Poster
Date