Identifying and Manipulating Personality Traits in LLMs Through Activation Engineering
We introduce Abliterator, a framework for studying and manipulating personality traits in large language models (LLMs) through activation engineering.
Experienced AI Engineer and Data Scientist with a strong background in machine learning, natural language processing, and data analytics. My interdisciplinary interests extend to classics, philosophy, and music.
We introduce Abliterator, a framework for studying and manipulating personality traits in large language models (LLMs) through activation engineering.