Skip to content

arXiv introduces a one-year ban for scientists who hand their papers over to LLMs

1 min read
Share

arXiv, the largest open repository for pre-review scientific papers, has finally drawn the line. From now on, if an author submits a paper that clearly shows the work was handed over entirely to a large language model, the result is a one-year ban. And that's just the start. After the ban ends, every subsequent paper must first pass through a peer-reviewed scientific journal before it can appear on arXiv.

The site was already under pressure. The number of low-quality, AI-generated papers has been rising month by month, and arXiv had previously introduced a rule that new users must receive an endorsement from a known author. After 20 years of being hosted under Cornell, the organization is now becoming an independent nonprofit, which will let it raise money specifically for the fight against AI "slop."

What counts as "incontrovertible evidence"? Thomas Dietterich, chair of arXiv's computer science section, lists two cases: hallucinated citations (references that don't exist) and chat comments to or from a large language model that accidentally ended up inside the paper. That's a brutal marker that the author didn't even read their own submission.

It's important to understand what arXiv is not banning - it's not banning the use of LLMs. Dietterich is precise: authors must take "full responsibility" for the content, "regardless of how it was generated." If a researcher copies inaccurate text, plagiarized content, errors, or invented references directly from Chat-GPT - it's still their responsibility. The tool is not an alibi.

The system will run on a "one strike" principle. Moderators will flag suspicious papers, section chairs must confirm the evidence before a sanction is issued, and the author will have a right of appeal. The procedure is sharp but not biased - imposing order in a space where it didn't previously exist.

The question for the reader: can a single institution hold the line on quality when technology makes cheating cheap? This approach is a small equivalent of what Europe is doing at regulatory level - not a ban, but accountability. The question is whether it will be enough, or whether in a year or two AI-generated papers will keep appearing on arXiv, just better hidden.