
Why Do Many AI & Computer Science Papers on arXiv Get High Citations but Are Not Published in Journals?
March 17, 2025arXiv has become one of the most influential preprint servers, especially in computer science, artificial intelligence (AI), and deep learning. However, while many papers on arXiv receive high citations, some are never published in peer-reviewed journals. Here’s why:
- arXiv Provides Early Access & Visibility 🔹 Rapid Dissemination: arXiv allows researchers to share findings immediately, whereas journal peer review can take months or even years. 🔹 High Accessibility: Papers on arXiv are open access, making them easier to cite than paywalled journal articles. 🔹 AI & CS Researchers Prefer Preprints: In fast-moving fields like deep learning, waiting for journal publication can make research outdated by the time it’s published.
📌 Example: Many groundbreaking AI papers, such as “Attention Is All You Need” (Transformer model), were first uploaded to arXiv before formal publication.
- Peer Review Is Not Always Necessary for Citations 🔹 Researchers Cite Useful Work Regardless of Peer Review: If a paper introduces a new model, dataset, or benchmark, it will be cited whether it’s peer-reviewed or not. 🔹 Self-Publishing Culture in AI/CS: Unlike medical and life sciences, computer science values technical contributions over traditional peer review. 🔹 Big Tech Companies Favor Preprints: Research labs at Google, OpenAI, Meta (Facebook AI), and Microsoft often publish directly on arXiv rather than waiting for journals.
📌 Example: OpenAI’s GPT and DALL·E papers were first released on arXiv, gaining thousands of citations before journal acceptance.
- Some Papers Are Too Technical for Journals 🔹 Mathematical/Algorithmic Papers: Journals often focus on applied research, while arXiv papers in theory, cryptography, or quantum computing might be too technical for mainstream journals. 🔹 Benchmark & Dataset Papers: New datasets (e.g., ImageNet, MNIST, COCO) are widely cited but do not always require journal publication.
📌 Example: The ImageNet paper (Deng et al., 2009) is highly cited but was first released as a preprint before its final conference version.
- Some Authors Skip Journals Due to Publishing Costs & Restrictions 🔹 Expensive Open-Access Fees: Journals like Nature Machine Intelligence or IEEE Transactions on AI charge thousands of dollars for open-access publication. 🔹 arXiv Has No Paywalls: Researchers can freely share their work without paying publishers.
📌 Example: Many AI research labs (Google DeepMind, Meta AI, OpenAI) skip journal publication entirely because arXiv already provides global visibility.
- Conferences Are More Important Than Journals in Computer Science 🔹 Top AI/CS Conferences Have Higher Impact Than Journals: In computer science, conferences are considered more prestigious than journals. 🔹 arXiv Papers Are Often Submitted to Conferences Instead: Many researchers submit their arXiv preprints to NeurIPS, CVPR, ICML, ICLR, or ACL rather than journals. 🔹 Double Submission Rules: Some conferences do not allow papers already published in journals, so authors keep them only on arXiv.
📌 Example: The Transformer paper (“Attention Is All You Need”) was first on arXiv, then accepted at NeurIPS (one of the top AI conferences).
Why Groundbreaking AI and Deep Learning Papers Are Published in Preprint on arXiv but Not in Journals
In the rapidly evolving fields of artificial intelligence (AI) and deep learning, many groundbreaking papers are disseminated as preprints on platforms like arXiv without subsequent publication in traditional peer-reviewed journals. This trend can be attributed to several factors:
- Rapid Dissemination of Research Findings
- The AI landscape advances swiftly, and researchers aim to share their findings promptly.
- arXiv allows for immediate distribution, enabling the community to access and build upon new work without delay.
- Traditional journals often involve lengthy peer-review processes, which can hinder the timely sharing of innovative ideas.
- Community Engagement and Feedback
- Publishing on arXiv facilitates early feedback from the global research community.
- This collaborative approach allows authors to refine their work based on diverse perspectives before considering formal publication.
- Shift Towards Conference Publications
- In AI and computer science, conferences often hold higher prestige than journals.
- Researchers may prioritize presenting their work at prominent conferences, using arXiv to establish the originality and timing of their research contributions.
Examples of Influential arXiv Preprints
Here are some notable AI and deep learning papers that were disseminated as arXiv preprints and have garnered significant citations:
- “Conditional Generative Adversarial Nets” by Mehdi Mirza and Simon Osindero (2014)
- This paper extends the Generative Adversarial Networks (GANs) framework to conditional models, introducing a mechanism to direct data generation processes.
- As of now, it has been cited over 5,700 times, reflecting its substantial impact on generative modeling research.
- “Swin Transformer: Hierarchical Vision Transformer using Shifted Windows” by Ze Liu et al. (2021)
- The authors propose the Swin Transformer, a novel hierarchical Transformer architecture that achieves linear computational complexity by utilizing shifted windows.
- This work has significantly influenced the development of vision transformers in computer vision.
- “How many preprints have actually been printed and why: a case study of computer science preprints on arXiv” by Qian Zhang et al. (2020)
- This study examines the publication fate of computer science preprints on arXiv, providing insights into the proportion that transition to peer-reviewed venues and the reasons behind their publication choices.
These examples illustrate that in the AI and deep learning communities, arXiv serves as a vital platform for the rapid dissemination and recognition of innovative research, even when such work does not undergo traditional journal publication.
Conclusion: Should You Submit to a Journal After arXiv?
Yes, if: ✅ Your research needs peer review for credibility (e.g., medical AI, regulatory approval). ✅ You want academic recognition in medical or life sciences. ✅ The research is applied AI and relevant to clinical or engineering journals.
No, if: ❌ Your work is highly technical (e.g., algorithms, new architectures). ❌ You aim to submit to a top CS conference instead (e.g., NeurIPS, ICML, ICLR). ❌ Your institution or employer values citations over peer review.
Final Thoughts
arXiv is a citation powerhouse because it provides fast, free, and unrestricted access to cutting-edge research. Many highly cited AI papers remain preprints because peer review is not always necessary for impact. However, publishing in journals adds credibility, particularly in healthcare, regulatory AI, and applied research