PrimeX: A Dataset of Worldview, Opinion, and Explanation

AuthorsRik Koncel-Kedziorski, Brihi Joshi†, Tim Paek

As the adoption of language models advances, so does the need to better represent individual users to the model. Are there aspects of an individual’s belief system that a language model can utilize for improved alignment? Following prior research, we investigate this question in the domain of opinion prediction by developing PrimeX, a dataset of public opinion survey data from 858 US residents with two additional sources of belief information: written explanations from the respondents for why they hold specific opinions, and the Primal World Belief survey for assessing respondent worldview. We provide an extensive initial analysis of our data and show the value of belief explanations and worldview for personalizing language models. Our results demonstrate how the additional belief information in PrimeX can benefit both the NLP and psychological research communities, opening up avenues for further study.

† University of Southern California

PrimeX: A Dataset of Worldview, Opinion, and Explanation

Related readings and updates.

Improving Language Model Personas via Rationalization with Psychological Scaffolds

Apple Natural Language Understanding Workshop 2023

Discover opportunities in Machine Learning.