OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models (2024)
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.18653/v1/2024.acl-long.466
Publication URI: http://dx.doi.org/10.18653/v1/2024.acl-long.466
Type: Conference/Paper/Proceeding/Abstract