Evaluating Large Language Models Using Gameplay (ClemBench)

📋 Type MA thesis
Status running
📅 Duration May 1, 2025 – Nov 1, 2025
👤 Primary supervisors Vincent Christlein Andreas Maier Raffaella Bernardi (University of Bolzano)
🎓 Student Mohammed Jawwad Artificial Intelligence