On the Nature website, Richard Van Noorden reports that a French computer scientist, Cyril Labbé, has discovered over 120 computer-generated papers that have been published in conference proceedings between 2008 and 2013. Over 100 of these papers were published by the Institute of Electrical and Electronic Engineers (IEEE) and 16 others appeared in Springer publications.
The papers have been composed using SCIgen, which only requires the user to input author names, and automatically generates random papers that look like Computer Science, but which are actually meaningless. Cyril Labbé has written a program that is able to recognize papers that have been generated by SCIgen. (The program compares the vocabulary of a text to that of a reference corpus; in particular, it measures the inter-textual distance as the proportion of word-tokens shared by two texts. For details of the method, see Labbé's 2012 paper published in Scientometrics.)
The proceedings issues that appeared in Springer publications were (supposed to be) peer-reviewed; for the IEEE proceedings, it is less clear whether they underwent peer review. In any case, the former examples show that the peer review system is not always watertight, not just in the case of open-access journals (which was also discussed here at NewApps).
Most of the conferences took place in China and most of the authors have Chinese affiliations. Of course, it remains to be checked whether the author names correspond to real scholars and if so, whether they were aware of the submission in their name. Nature was able to contact one actual researcher: he does not know why his name appeared in the author list of such a computer-generated paper.
Below the fold, I offer a speculation on the motivation behind the submission of these fake papers.
Continue reading "Recently discovered: 100+ computer-generated papers. Relation to spamference?" »
Recent Comments