paper_type.json (315B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "The paper introduces FVEval, a first-of-its-kind comprehensive benchmark for evaluating LLMs on hardware formal verification with three sub-tasks and 571 test instances; while it includes baseline experiments, the primary contribution is the benchmark itself." 4 }