paper_type.json (263B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "The primary contribution is HalluLens, a new hallucination benchmark with a taxonomy and three dynamically-generated evaluation tasks; the LLM experiments are baselines demonstrating the benchmark's utility." 4 }