On Using Self-Report Studies to Analyze Language Models
DOI:
https://doi.org/10.3384/nejlt.2000-1533.2024.5000Abstract
We are at a curious point in time where our ability to build language models (LMs) has outpaced our ability to analyze them. We do not really know how to reliably determine their capabilities, biases, dangers, knowledge, and so on. The benchmarks we have are often overly specific, do not generalize well, and are susceptible to data leakage. Recently, I have noticed a trend of using self-report studies, such as various polls and questionnaires originally designed for humans, to analyze the properties of LMs. I think that this approach can easily lead to false results, which can be quite dangerous considering the current discussions on AI safety, governance, and regulation. To illustrate my point, I will delve deeper into several papers that employ self-report methodologies and I will try to highlight some of their weaknesses.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Matúš Pikuliak
This work is licensed under a Creative Commons Attribution 4.0 International License.