Improving Language Model Personas via Rationalization with Psychological Scaffolds

AuthorsBrihi Joshi**†, Xiang Ren†, Swabha Swayamdipta†, Rik Koncel-Kedziorski, Tim Paek

Language models prompted with a user description or persona are being used to predict the user’s preferences and opinions. However, existing approaches to building personas mostly rely on a user’s demographic attributes and/or prior judgments, but not on any underlying reasoning behind a user’s judgments. We introduce PB&J (Psychology of Behavior and Judgments), a framework that improves LM personas by incorporating potential rationales for why the user could have made a certain judgment. Our rationales are generated by a language model to explicitly reason about a user’s behavior on the basis of their experiences, personality traits, or beliefs. Our method employs psychological scaffolds: structured frameworks such as the Big 5 Personality Traits or Primal World Beliefs to help ground the generated rationales in existing theories. Experiments on public opinion and movie preference prediction tasks demonstrate that language model personas augmented with PB&J rationales consistently outperform personas conditioned only on user demographics and / or judgments, including those that use a model’s default chain-of-thought, which is not grounded in psychological theories. Additionally, our PB&J personas perform competitively with those using human-written rationales, suggesting the potential of synthetic rationales guided by existing theories.

† University of Southern California
** Work done while at Apple

Improving Language Model Personas via Rationalization with Psychological Scaffolds

Related readings and updates.

PrimeX: A Dataset of Worldview, Opinion, and Explanation

PersonaTeaming: Exploring How Introducing Personas Can Improve Automated AI Red-Teaming

Discover opportunities in Machine Learning.