Treffer: BenchING: A Benchmark for Evaluating Large Language Models in Following Structured Output Format Instruction in Text-Based Narrative Game Tasks
Title:
BenchING: A Benchmark for Evaluating Large Language Models in Following Structured Output Format Instruction in Text-Based Narrative Game Tasks
Source:
IEEE Transactions on Games IEEE Trans. Games Games, IEEE Transactions on. 17(3):665-675 Sep, 2025
Database:
IEEE Xplore Digital Library