Assessing Moral Judgment by Large Language Models – A Survey of Available Datasets

Hannah Clausen; Andrey Kutuzov; Anna Smajdor; Erik Velldal

doi:10.3384/nejlt.2000-1533.2026.6366

Authors

Hannah Clausen Department of Informatics, University of Oslo https://orcid.org/0009-0000-7876-8536
Andrey Kutuzov Department of Informatics, University of Oslo https://orcid.org/0000-0003-2540-5912
Anna Smajdor Department of Philosophy, Classics, History of Art and Ideas, University of Oslo https://orcid.org/0000-0002-9752-6302
Erik Velldal Department of Informatics, University of Oslo https://orcid.org/0009-0008-6479-4512

DOI:

https://doi.org/10.3384/nejlt.2000-1533.2026.6366

Abstract

Recent advances in language modeling have contributed to a growing emphasis on machine ethics, including researching and assessing moral judgments made by large language models (LLMs). This paper provides a critical survey of the existing datasets for exactly this assessment, with a special focus on the respective data sources. We address the current lack of theoretical grounding by providing an introduction to ethics and different frameworks from moral philosophy and moral psychology. Moreover, we identify four main data sources: webcrawled corpora, scholars, laypeople, and synthetic data generation. By discussing the strengths and weaknesses of these sources, we analyze their implications for the assessment of moral judgment. Importantly, systemizing the available datasets reveals an over-reliance on previous work, reinforcing existing shortcomings. Addressing the current limitations, we recommend adopting a consistent terminology and creating independently curated datasets based on interdisciplinary work. To ensure a clear delineation of normative approaches, we propose focusing on the assessment of moral consistency and certainty of LLMs as effective and well-defined indicators of their performance on moral judgment.

Assessing Moral Judgment by Large Language Models – A Survey of Available Datasets

Authors

DOI:

Abstract

Downloads

Published

Versions

Issue

Section

License

Make a Submission