A community-driven registry for Claude, Cursor, Windsurf, Cline & more. Not affiliated with Anthropic.
Are you the author? Sign in to claim
A lightweight, agent-style framework for fact-checking atomic claims using iterative retrieval and verification. Reduces
FIRE is a novel agent-based framework for fact-checking atomic claims, designed to integrate evidence retrieval and claim verification in an iterative and cost-effective manner. Unlike traditional systems that fix the number of web queries before verifying, FIRE dynamically decides whether to stop or continue querying based on confidence.
Compared to previous systems like FACTCHECKGPT, FACTOOL, and SAFE, FIRE:
Input Claim
│
▼
[FIRE Decision Module]
├── confident → Output Label (True / False)
└── uncertain → Generate Search Query
│
▼
Web Search (via SerperAPI)
│
▼
Update Evidence Set
│
└── Loop until confident or max steps
FIRE is compared against state-of-the-art frameworks including FactCheckGPT, FACTOOL, and SAFE.
git clone https://github.com/mbzuai-nlp/fire.git
cd fire
pip install -r requirements.txt
# Run FIRE with GPT-4o-mini
python run_fire.py --model gpt-4o-mini --dataset factcheck_bench
@inproceedings{xie-etal-2025-fire,
address = {Albuquerque, New Mexico},
author = {Xie, Zhuohan and
Xing, Rui and
Wang, Yuxia and
Geng, Jiahui and
Iqbal, Hasan and
Sahnan, Dhruv and
Gurevych, Iryna and
Nakov, Preslav},
booktitle = {Findings of the Association for Computational Linguistics: NAACL 2025},
isbn = {979-8-89176-195-7},
pages = {2901--2914},
publisher = {Association for Computational Linguistics},
title = {{FIRE}: Fact-checking with Iterative Retrieval and Verification},
url = {https://aclanthology.org/2025.findings-naacl.158/},
year = {2025}
}
Developed by Zhuohan Xie, Rui Xing, Yuxia Wang, Jiahui Geng, Hasan Iqbal, Dhruv Sahnan, Iryna Gurevych, and Preslav Nakov
Affiliations: MBZUAI, The University of Melbourne
For questions or collaborations, contact:
📬 zhuohan.xie@mbzuai.ac.ae
“Fact-checking, now with FIREpower.”
Native macOS app to monitor Claude AI usage limits and watch your coding sessions live
Pocket Flow: Codebase to Tutorial
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.