Briefing: How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment
Strategic angle: A study on the deductive reasoning capabilities of LLM agents using a text-based version of Clue.
Browse the full archive, newest first.