The skewness of scientific productivity Articles
Overview
published in
- Journal of Informetrics Journal
publication date
- October 2014
start page
- 917
end page
- 934
issue
- 4
volume
- 8
Digital Object Identifier (DOI)
International Standard Serial Number (ISSN)
- 1751-1577
Electronic International Standard Serial Number (EISSN)
- 1875-5879
abstract
- This paper exploits a unique 2003-2011 large dataset, indexed by Thomson Reuters, consisting of 17.2 million disambiguated authors classified into 30 broad scientific fields, as well as the 48.2 million articles resulting from a multiplying strategy in which any article co-authored by two or more persons is wholly assigned as many times as necessary to each of them. The dataset is characterized by a large proportion of authors who have their oeuvre in several fields. We measure individual productivity in two ways that are uncorrelated: as the number of articles per person and as the mean citation per article per person in the 2003-2011 period. We analyze the shape of the two types of individual productivity distributions in each field using size- and scale-independent indicators. To assess the skewness of productivity distributions we use a robust index of skewness, as well as the Characteristic Scores and Scales approach. For productivity inequality, we use the coefficient of variation. In each field, we study two samples: the entire population, and what we call "successful authors", namely, the subset of scientists whose productivity is above their field average. The main result is that, in spite of wide differences in production and citation practices across fields, the shape of field productivity distributions is very similar across fields. The parallelism of the results for the population as a whole and for the subset of successful authors, when productivity is measured as mean citation per article per person, reveals the fractal nature of the skewness of scientific productivity in this case. These results are essentially maintained when any article co-authored by two or more persons is fractionally assigned to each of them.
Classification
keywords
- co-authorship; disambiguation algorithm; individual scientist's productivity distributions; skewness of science; higher order statistics; statistical methods; coauthorship; coefficient of variation; individual productivity; large dataset; productivity distribution; scientific fields; skewness of science; two ways; productivity