paper_type.json (232B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "Introduces a new episodic memory evaluation benchmark for LLMs with baseline experiments on state-of-the-art models; the primary contribution is the benchmark framework itself." 4 }