https://www.nature.com/articles/d41586-025-02936-6
AI tool detects LLM-generated text in research papers and peer reviews
Authors and peer reviewers are failing to disclose the use of LLMs despite journal policies limiting their use.
Testing the AI-detection tool on manuscripts before ChatGPT was released in November 2022, it flagged only seven abstracts and no methods or peer-review reports as containing potentially AI-generated text. “From there on, the detections just increased linearly and at what we would think is a very high rate,” says Evanko.
The tool can also distinguish between different LLMs, including ChatGPT models, DeepSeek, LLaMa and Claude. “We’re only able to do this because we’ve generated our entire training set ourselves, so we know the exact provenance, we know what model the training data came from,” explains Spero.
The current model of Pangram cannot distinguish between passages that are fully generated by AI and those that are written by humans but edited using AI.
AI tool detects LLM-generated text in research papers and peer reviews
Authors and peer reviewers are failing to disclose the use of LLMs despite journal policies limiting their use.
Testing the AI-detection tool on manuscripts before ChatGPT was released in November 2022, it flagged only seven abstracts and no methods or peer-review reports as containing potentially AI-generated text. “From there on, the detections just increased linearly and at what we would think is a very high rate,” says Evanko.
The tool can also distinguish between different LLMs, including ChatGPT models, DeepSeek, LLaMa and Claude. “We’re only able to do this because we’ve generated our entire training set ourselves, so we know the exact provenance, we know what model the training data came from,” explains Spero.
The current model of Pangram cannot distinguish between passages that are fully generated by AI and those that are written by humans but edited using AI.
