Systematic Evaluation of GPT-3 for Zero-Shot Personality Estimation

Ganesan, Adithya V; Lal, Yash Kumar; Nilsson, August; Schwartz, H. Andrew

Ganesan, Adithya V; Lal, Yash Kumar; Nilsson, August; Schwartz, H. Andrew

Chapter, Peer reviewed, Conference object

Published version

Åpne

2023.wassa-1.34.pdf (1.132Mb)

Permanent lenke

https://hdl.handle.net/11250/3120496

Utgivelsesdato

2023

Sammendrag

Very large language models (LLMs) perform extremely well on a spectrum of NLP tasks in a zero-shot setting. However, little is known about their performance on human-level NLP problems which rely on understanding psychological concepts, such as assessing personality traits. In this work, we investigate the zero-shot ability of GPT-3 to estimate the Big 5 personality traits from users’ social media posts. Through a set of systematic experiments, we find that zero-shot GPT-3 performance is somewhat close to an existing pre-trained SotA for broad classification upon injecting knowledge about the trait in the prompts. However, when prompted to provide fine-grained classification, its performance drops to close to a simple most frequent class (MFC) baseline. We further analyze where GPT-3 performs better, as well as worse, than a pretrained lexical model, illustrating systematic errors that suggest ways to improve LLMs on human-level NLP tasks. The code for this project is available on Github1

Utgiver

Association for Computational Linguistics

Serie

ACL Anthology;

Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal