arXiv - what is it?
The purpose of this LibGuide is to assist librarians and other information specialists about the content, scope and business model of arXiv.org. For more information please contact any of the librarians listed on the guide or email email@example.com.
More information about the project itself can be found at arXiv.org's FAQ
1. What is arXiv?
arXiv is an open access repository for e-Prints in Physics, Mathematics, Computer Science, Astrophysics, Nonlinear Sciences, Quantitative Biology and Statistics. It is heavily used in these disciplines. Developed by physicist Paul Ginsparg in 1991 as a means of circulating scientific papers prior to publication, it initially focused on e-Prints in High Energy Physics (HEP). In time, focus broadened to related disciplines. All content in arXiv is freely available to all users.
Cornell University Library has supported and hosted arXiv since 2001. In 2010, Cornell launched a pilot project to develop a sustainable funding model for arXiv, asking the top 200 institutions that used the resource to make voluntary contributions in order to cover its operational costs. After that pilot phase ended, Cornell University Library developed a longer-term strategy for both the funding and governance of arXiv to ensure it persists as a resource for the scientific community.
arXiv surpassed half a million e-Prints in 2008. As of August 2013, there are 870,000 e-prints across all subjects in arXiv.
2. What kind of content can I expect to find in arXiv?
There are a few types of materials one can find in arXiv. These include
- Self-archived articles: Generally authors archive their articles prior to publication in a peer-reviewed journal. This is not always the case, however, as there are e-Prints in arXiv that are not subsequently part of the published literature. (See: http://www.istl.org/10-winter/viewpoints.html)
- Ancillary materials, such as datasets, related to articles. (See: http://arxiv.org/help/datasets)
- Conference proceedings. One example is the Conference on Uncertainty in Artificial Intelligence. (See: https://listserv.nd.edu/cgi-bin/wa?A2=PAMNET;2c77ab2c.1210)
- Overlay journals: These are journals who submissions are from freely available sources available elsewhere. One examples is the Electronic Proceedings in Theoretical Computer Science. (See: http://en.wikipedia.org/wiki/Overlay_journal)
For more information on submission details for arXiv, it may be helpful to consult the Primer (http://arxiv.org/help/primer).
3. What is the coverage of arXiv?
The coverage of arXiv varies depending on discipline. Its coverage in the High Energy Physics community is nearly comprehensive. As far as we're aware, no comprehensive study has been done on the coverage of arXiv across all of the disciplines it represents, but there have been several projects that have tackled this question for subsets of arXiv.
- In 2009, Tim Ingoldsby presented the results of a study comparing the literature published in Americal Institute of Physics (AIP) journals and e-Prints in arXiv. He noted that, "only for a narrow range of ... physics, can it be said that arXiv provides comprehensive coverage." While 97% of the content in Physical Review D was also in the arXiv, the percentage of material in other AIP journals in arXiv was not as high. The coverage by sub-discipline in Physics varied as well: elementary particles and fields and gravitation and astrophysics were most represented in arXiv, while atomic, molecular, and optical physics and plasmas and beams were least represented in arXiv. (See: http://www.councilscienceeditors.org/files/presentations/2009/Ingoldsby.pdf)
- A 2009 e-Print surveyed the reading and citing practices of scientists in High Energy Physics. (See: http://arxiv.org/abs/0906.5418)
- In a 2011 survey of Mathematicians' views on publishing issues, nearly a third of respondents reported reguarly posting their papers to the arXiv. (See: http://www.istl.org/11-fall/refereed4.html)
- A 2013 paper analyzes arXiv submissions and the published literature, with comparisons across the various disciplines that use arXiv. (See: http://arxiv.org/abs/1306.3261)
4. How do publishers work with arXiv?
- Some publishers encourage authors to archive a preprint version of their articles into arXiv. (Example: http://www.imstat.org/publications/arxiv.html)
- Some publishers have expressed interest in helping link to link arXiv eprints with published content/version of record. This work is ongoing in collaboration with publishers.
- arXiv collaborates with publishers and other initiatives to automatically update arXiv metadata with the DOI and journal references of published versions. Interested publishers should contact arXiv directly to explore this feature.
5. Is it possible to access data about the material in arXiv?
arXiv maintains an API (application programming interface) so that developers can access arXiv data in a systematic way. More information about the API, including full documentation on how to use it, can be found on arXiv. (See: http://arxiv.org/help/api/index)
6. arXiv works with other initiatives. What's the nature of these collaborations?
arXiv works with other information resources in Astronomy and High Energy Physics, including the Astrophysics Data System (ADS) and INSPIRE. Members from respective teams have worked together to improve metadata sharing between systems. Additionally, the groups meet for an annual summit to keep apprised of pertinent endeavors. (See: http://indico.cern.ch/conferenceDisplay.py?confId=262430)
7. Does arXiv support authority control, ORCID or ResearcherID when authors upload their submissions?
There is authority control in arXiv where authorship is associated with local identities. (See: http://arxiv.org/help/authority) At this time, arXiv does not support ORCID or ResearcherID.
8. How is arXiv funded?
arXiv recently transitioned to a community-supported collaboartive model, in which Institutional users are encouraged to pledge support towards arXiv. Pledges range from $1500-$3000USD annually, and are dependent on institutional usage. Contact support@arXiv.org for more information about pledging support to arXiv.
Members will elected to a Membership Advisory Board (MAB) which will represent the interests of the participating institutions. Please see the below documentation for further details:
- Membership Program FAQ
- List of supporting institutions (2012)
- For the latest statistics on institutional usage (2012), please see this link.
9. arXiv has two advisory boards, a Membership Advisory and Scientifc Advisory Board. What's the difference between the boards?
Both boards serve an advisory function to arXiv.
- Scientific Advisory Board is composed of scientists and researchers in disciplines covered by arXiv. The Board provides advice and guidance pertaining to the repository's intellectual oversight, with a particular focus on the policies and operation of arXiv's moderation system. See information about membership here.
- Membership Advisory Board represents participating institutions’ interests and advises CUL on issues related to repository management and development, standards implementation, interoperability, development priorities, business planning, and outreach and advocacy. Representation on MAB is reserved for libraries, research institutions, laboratories, and foundations that are members of arXiv and that contribute to the financial support of the service. See the bylaws.