Daniel Lemire's blog

, 1 min read

Though unrefereed, arXiv has a better h-index than most journals…

Google provides a ranking of research venue per domain. For databases and information systems, they provide the top 20 venues according to their h-index.

As part of their assessment, they chose to include arXiv: a repository of freely available research papers. Almost anyone can post a paper on arXiv. There is some filtering, but there is no scientific review of the papers. This means that if you download a paper from arXiv and it is complete junk, you have nobody to complain to (except the authors). Nevertheless, it appears that people are willing to post their great papers on arXiv. On a ranking per h-index, the databases section of arXiv ranks in 11th place, outranking prestigious journals like the VLDB Journal, Data & Knowledge Engineering, Information Systems and Knowledge and Information Systems… not to mention all the journals and conferences that do not appear in the top-20 list provided by Google. One could argue that the good ranking can be explained by the fact that arXiv includes everything. However, it is far from true. There are typically less than 30 new database papers every month on arXiv whereas big conferences often have more than 100 articles (150 at SIGMOD 2013 and and over 200 at VLDB 2013). Roughly speaking, the database section of arXiv is equivalent to two big conferences while there are dozens of conferences and journals.

You can subscribe to to arXiv on Twitter. All papers are freely downloadable. Credit: A post by Rasmus Pagh on Google+ pointed out the good ranking of arXiv in theoretical computer science.