The second installment in my #draftInProgress series on Open Data citation advantage. About one fourth of the methods section:
Methods: Which studies?
The primary analysis in this paper examines the citation count of a gene expression microarray experiment, relative to availability of the experiment’s data.
The sample of microarray experiments used in the current analysis was previously determined (Piwowar 2011 PLoS ONE, data from Piwowar 2011 Dryad). Briefly, a full-text query uncovered papers with keywords associated with relevant wet-lab methods. The full-text query had been characterized with high precision (90%, 95% confidence interval 86% to 93%) and a moderate recall (56%, 52% to 61%) for this task. Running the query in PubMed Central, HighWire Press, and Google Scholar revealed 11,603 distinct gene expression microarray papers. The papers were published between 2000 and 2009.
The current analysis retained papers published between 2001 and 2009.
Piwowar HA (2011). Who shares? Who doesn’t? Factors associated with openly archiving raw research data. PLoS ONE, 6 (7) : http://dx.doi.org/10.1371/journal.pone.0018657
Piwowar HA (2011). Data from: Who shares? Who doesn’t? Factors associated with openly archiving raw research data. Dryad Digital Repository : http://dx.doi.org/10.5061/dryad.mf1sd