
ArXiv’s new AI policy is clear: let an LLM write your paper without checking its work, and you are banned for a year. This isn’t a soft warning — it is the most consequential enforcement action the world’s largest open science repository has ever taken against AI misuse, and every researcher using tools like ChatGPT or Gemini in their workflow needs to understand exactly what it means.
What Is the arXiv AI Policy and Why Does It Matter?
ArXiv (pronounced “archive”) is the dominant preprint server for physics, mathematics, computer science, quantitative biology, and related fields. Before a paper gets through peer review — a process that can take months or years — researchers share their findings on arXiv to accelerate scientific discourse. On any given day, thousands of papers are posted, read, and cited from its pages.
That makes arXiv a uniquely high-stakes venue. When the arXiv AI policy shifts, it doesn’t just affect one journal or one discipline. It shapes norms across entire scientific communities.
The repository has been warning about AI slop — the flood of low-effort, LLM-generated papers that carry the surface appearance of legitimate research — for several years. First-time submitters were already required to obtain endorsements from established authors. But as of May 2026, arXiv has moved from speed bumps to hard penalties. The new arXiv AI policy introduces a one-year submission ban for any author whose paper contains incontrovertible evidence that they did not verify what their AI tools produced.
This is a watershed moment for academic publishing.
What the New arXiv Author Ban Actually Says
The updated arXiv AI policy was announced by Thomas Dietterich, chair of arXiv’s computer science section, on May 15, 2026. His statement cuts through ambiguity with unusual directness: if a submission shows that authors failed to check the output of their LLM, “we can’t trust anything in the paper.”
The penalty that follows is a two-part consequence:
- A one-year ban from submitting to arXiv.
- After the ban expires, all future arXiv submissions must first be accepted by a reputable peer-reviewed venue before they can be posted.
That second condition is especially significant. ArXiv’s entire value proposition is speed — the ability to share research before peer review. Stripping that privilege away permanently transforms how a scientist participates in their field.
What Counts as “Incontrovertible Evidence”?
The arXiv AI policy does not require moderators to guess or use probabilistic detection tools. It applies only when the evidence is unambiguous. Dietterich’s announcement cited two clear examples:
- Hallucinated references — citations to papers, authors, or journals that do not exist, a well-documented failure mode of large language models.
- LLM conversation artifacts — text that includes prompts to or responses from an AI assistant left accidentally in the manuscript.
These are not edge cases. Hallucinated citations have been found in published, peer-reviewed biomedical research with growing frequency. A 2026 study in The Lancet confirmed that fabricated citations are rising in the scientific literature, with LLM use identified as a likely driver. When such artifacts appear in an arXiv submission, they serve as a fingerprint: the authors copy-pasted AI output without reading it.
How the One-Strike Process Works
Despite being called a “one-strike” rule by Dietterich, the arXiv AI policy includes procedural safeguards to prevent arbitrary enforcement:
- A moderator must flag the submission as containing incontrovertible evidence of unchecked AI use.
- The section chair must independently confirm the evidence before any penalty is imposed.
- The author has the right to appeal the decision.
This means the ban is not automatic and is not triggered by AI-detection software alone. It requires human judgment at two levels. The policy targets carelessness, not tool use.
Permitted vs. Prohibited: AI Use in arXiv Papers
One of the most important things to understand about the arXiv AI policy is what it does not ban. Using LLMs to assist with writing, editing, coding, or literature review is explicitly allowed. What is not allowed is abdicating authorial responsibility for the content those tools produce.
The table below clarifies the distinction:
| Activity | Permitted Under arXiv AI Policy | Notes |
|---|---|---|
| Using LLMs for grammar and style editing | Yes | Author must review all changes |
| Using LLMs to generate a first draft | Yes | Author must verify every claim |
| Using AI coding assistants for analysis scripts | Yes | Results must be independently validated |
| Using LLMs to generate a literature review | Yes (with verification) | Every citation must be confirmed to exist |
| Copy-pasting LLM output with hallucinated citations | No | Incontrovertible evidence of policy violation |
| Submitting a paper with LLM prompts left in the text | No | Incontrovertible evidence of policy violation |
| Listing an AI as an author | No | Violates existing authorship norms |
| Using AI-generated content without disclosure | Context-dependent | Transparency is strongly encouraged |
The guiding principle of the arXiv AI policy, in Dietterich’s own framing, is that authors take “full responsibility” for their content, “irrespective of how the contents are generated.”
Why arXiv Is Taking This Stand Now
The Problem of AI Slop in Scientific Publishing
The volume of AI-generated content flooding academic repositories exploded between 2023 and 2025. Papers written almost entirely by LLMs began appearing with fabricated experiments, invented datasets, and — most damagingly — references to studies that had never been conducted. Because arXiv lacks full peer review at the point of submission, it became a particularly vulnerable target.
The damage is not merely reputational. Other researchers cite arXiv preprints in their own work. If a hallucinated reference propagates through the citation network, it can corrupt the evidentiary foundation of entire research threads. Graduate students and junior researchers, especially those working in fast-moving areas like machine learning, may not have the domain depth to catch fabricated citations before building on them.
The arXiv AI policy is, at its core, a quality signal. It tells readers: we are committed to ensuring that what is posted here was actually checked by a human who understood what they were submitting.
ArXiv’s Broader Institutional Shift
The new AI policy does not exist in isolation. ArXiv is simultaneously undergoing its most significant structural change in its history. After more than 20 years as a Cornell University project, arXiv is becoming an independent nonprofit. That transition is explicitly linked to fundraising capacity — the new organization will be able to raise more money to address problems like AI slop at scale, including investing in better moderation infrastructure.
These two developments together signal that arXiv is not treating AI governance as a temporary issue. It is building institutions and enforcement capacity for the long term.
What This Means for Researchers Using AI Tools
The “Full Responsibility” Principle
The most practically important phrase in the arXiv AI policy is “full responsibility.” It means that the legal and ethical accountability traditionally assigned to human authorship does not transfer to a tool. If an LLM produces a fabricated citation and you submit it, you — not the LLM, not the company that built it — are responsible for that fabrication.
This reframes the question researchers should be asking. The question is no longer “Did I use AI to write this?” It is “Can I personally vouch for every sentence, every citation, and every data point in this paper?”
Researchers who can answer yes to the second question have nothing to fear from the arXiv AI policy. Those who cannot should not be submitting.
A Checklist Before You Submit to arXiv
For researchers who use AI tools as part of their workflow, building a verification habit is now non-negotiable. Here is a practical pre-submission checklist aligned with the arXiv AI policy:
- Verify every reference manually. Search for each cited paper in Google Scholar, Semantic Scholar, or the relevant journal database. Confirm the title, authors, journal, and year are accurate.
- Read the full manuscript aloud or in review. LLM artifacts — awkward phrasing, generic transitions, leftover prompt text — are easier to catch when you slow down.
- Check every quantitative claim against its source. LLMs confidently fabricate statistics. If a number appears in your paper, trace it back to a primary source you have actually read.
- Remove any conversational AI residue. Search the document for phrases like “As an AI language model” or “Certainly! Here is” — common LLM response openers that sometimes survive copy-paste.
- Have a co-author or colleague review independently. A second set of human eyes remains the most effective hallucination detector available.
- Document your AI use. Even where disclosure is not yet mandatory, keeping a record of which tools you used and how protects you in any dispute.
The Bigger Picture: Academic Integrity in the Age of LLMs
The arXiv AI policy lands in the context of a broader reckoning across academia. The same week arXiv’s announcement circulated, it emerged that a lawyer representing Anthropic had been forced to apologize after the Claude AI hallucinated a legal citation in a court filing — a reminder that LLM hallucination is not a problem confined to research papers.
What makes the arXiv situation distinctive is the scale and the speed of propagation. Legal filings affect individual cases. A hallucinated citation in a high-profile arXiv preprint can be incorporated into dozens of papers within weeks of posting, especially in fast-moving fields where researchers rely on preprints as primary sources.
The arXiv AI policy is one response to this structural vulnerability. It is unlikely to be the last. Major peer-reviewed journals, conference organizers, and funding bodies are all watching to see how enforcement works in practice — and many are expected to adapt similar frameworks.
For the broader academic community, the policy raises questions that do not yet have settled answers:
- Where is the line between AI-assisted writing and AI-generated writing?
- Should AI tool use be disclosed in the methods section, the acknowledgments, or a dedicated statement?
- How will detection evolve as LLMs become better at mimicking human writing patterns?
- What obligations do institutions have to train researchers in responsible AI use before they submit?
The arXiv AI policy does not answer these questions. What it does is establish a floor: you must check what you submit. Everything above that floor remains under negotiation.
Key Takeaways
The arXiv AI policy introduced in May 2026 represents the sharpest enforcement action any major scientific repository has taken against careless AI use. Here is what matters most:
- ArXiv will impose a one-year submission ban on authors whose papers contain incontrovertible evidence — such as hallucinated references or leftover LLM prompts — that they did not verify AI-generated content.
- After the ban, affected authors must have future submissions accepted by a peer-reviewed venue before posting to arXiv, permanently altering their access to preprint publishing.
- The arXiv AI policy does not ban AI tools. It bans the failure to take responsibility for what those tools produce.
- The policy is enforced through human judgment — a moderator flags, a section chair confirms, and an appeal is available.
- Fabricated citations are rising across the scientific literature, making the arXiv AI policy part of a larger, unresolved challenge for academic publishing.
For researchers, the practical takeaway is simple: if you use LLMs in your work, treat their output as a first draft written by a capable but unreliable assistant — one that will confidently cite papers that do not exist. Your name is on the paper. Under the arXiv AI policy, that means your judgment, not the model’s, is what stands between your work and a one-year ban.