<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Research Remix</title>
	<atom:link href="http://researchremix.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://researchremix.wordpress.com</link>
	<description>Blogging about the science, engineering, and human factors of biomedical research data reuse</description>
	<pubDate>Fri, 18 Apr 2008 00:40:33 +0000</pubDate>
	<generator>http://wordpress.org/?v=MU</generator>
	<language>en</language>
			<item>
		<title>Why study Data Sharing?  (+ why share your data)</title>
		<link>http://researchremix.wordpress.com/2008/04/10/why-study-data-sharing-why-share-your-data/</link>
		<comments>http://researchremix.wordpress.com/2008/04/10/why-study-data-sharing-why-share-your-data/#comments</comments>
		<pubDate>Thu, 10 Apr 2008 16:04:11 +0000</pubDate>
		<dc:creator>Heather Piwowar</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://researchremix.wordpress.com/?p=76</guid>
		<description><![CDATA[A few weeks ago I was asked to give a short presentation to my department about my research.  My slides are below, both as a slideshare and also as a PDF with commentary notes.
The presentation gives a brief overview of why data sharing is important, why we need to study it, some of what [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>A few weeks ago I was asked to give a short presentation to my department about my research.  My slides are below, both as a slideshare and also as a PDF with commentary notes.</p>
<p>The presentation gives a brief overview of wh<span class="black_small_text">y data sharing is important, why we need to study it, some of what we’ve learned, and what we still don’t know.  Spoiler: it concludes with an encouragement to share your own research data.<br />
</span></p>
<p><strong>Why study Data Sharing?  (+ why share your data)</strong><a href="http://researchremix.files.wordpress.com/2008/04/dbmicolloquium2008-piwowar-presentationwithnotes.pdf"></a></p>
<p><a href="http://researchremix.files.wordpress.com/2008/04/dbmicolloquium2008-piwowar-presentationwithnotes.pdf">click for Slides with Commentary Notes</a></p>
<p><object type='application/x-shockwave-flash' wmode='transparent' data='https://s3.amazonaws.com:443/slideshare/ssplayer.swf?id=346134&#038;doc=dbmicolloquium2008-piwowar-presentationwithnotes-1207842189664954-9' width='425' height='348'><param name='movie' value='https://s3.amazonaws.com:443/slideshare/ssplayer.swf?id=346134&#038;doc=dbmicolloquium2008-piwowar-presentationwithnotes-1207842189664954-9' /></object></p>
<p>Two process questions:</p>
<p>1.  Does anyone know of a better way to share slide spoken content, other than &#8220;PDF with notes&#8221;?  I don&#8217;t *want* my slides to include everything I say, because that makes for a boring presentation in real life.  But how else to communicate the information when archiving?  Slideshare does allow voice-over, but frankly that raises the barrier of entry for me (record a podcast, aack).  Any other ideas?</p>
<p>2.  Any open clip art sources, other than http://www.openclipart.org/ and http://www.wpclipart.com/ ?  These have lots of things, but unfort still don&#8217;t rival the variety that comes with PowerPoint.  I looked, but there just didn&#8217;t include the right hats.   So the presentation includes Microsoft clipart.  Guessing that means I can&#8217;t make those images CC.  Guessing that means I can&#8217;t post the presentation on Nature Precedings.  Oh well, small steps.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/researchremix.wordpress.com/76/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/researchremix.wordpress.com/76/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/researchremix.wordpress.com/76/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/researchremix.wordpress.com/76/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/researchremix.wordpress.com/76/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/researchremix.wordpress.com/76/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/researchremix.wordpress.com/76/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/researchremix.wordpress.com/76/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/researchremix.wordpress.com/76/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/researchremix.wordpress.com/76/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/researchremix.wordpress.com/76/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/researchremix.wordpress.com/76/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=researchremix.wordpress.com&blog=1015265&post=76&subd=researchremix&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://researchremix.wordpress.com/2008/04/10/why-study-data-sharing-why-share-your-data/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/researchremix-128.jpg" medium="image">
			<media:title type="html">Heather</media:title>
		</media:content>
	</item>
		<item>
		<title>Make All Research Results CC-BY</title>
		<link>http://researchremix.wordpress.com/2008/04/10/make-all-research-results-cc-by/</link>
		<comments>http://researchremix.wordpress.com/2008/04/10/make-all-research-results-cc-by/#comments</comments>
		<pubDate>Thu, 10 Apr 2008 15:01:31 +0000</pubDate>
		<dc:creator>Heather Piwowar</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<category><![CDATA[CC]]></category>

		<guid isPermaLink="false">http://researchremix.wordpress.com/?p=75</guid>
		<description><![CDATA[Thanks to Klaus Graf for putting it so succinctly (and PMR for echoing it):
There is a simple solution (I will repeat it because it is important like a mantra):
* MAKE ALL RESEARCH RESULTS CC-BY
* MAKE ALL RESEARCH RESULTS CC-BY
* MAKE ALL RESEARCH RESULTS CC-BY
Make all research results CC-by.
Really.  The publishers, yes.  But also [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Thanks to Klaus Graf for <a href="http://archiv.twoday.net/stories/4851871/">putting </a>it so succinctly (and PMR for <a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1033">echoing </a>it):</p>
<blockquote><p>There is a simple solution (I will repeat it because it is important like a mantra):</p>
<p>* MAKE ALL RESEARCH RESULTS CC-BY<br />
* MAKE ALL RESEARCH RESULTS CC-BY<br />
* MAKE ALL RESEARCH RESULTS CC-BY</p></blockquote>
<p>Make all research results <a href="http://creativecommons.org/licenses/by/3.0/">CC-by</a>.</p>
<p>Really.  The publishers, yes.  But also You.  Me.  All of us.  Try it.  Start small, ok, but start.  Put your raw data up on a website and <a href="http://wiki.creativecommons.org/Frequently_Asked_Questions">slap a Creative Commons license</a> on it.</p>
<p>Be the change you want to see.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/researchremix.wordpress.com/75/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/researchremix.wordpress.com/75/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/researchremix.wordpress.com/75/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/researchremix.wordpress.com/75/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/researchremix.wordpress.com/75/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/researchremix.wordpress.com/75/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/researchremix.wordpress.com/75/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/researchremix.wordpress.com/75/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/researchremix.wordpress.com/75/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/researchremix.wordpress.com/75/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/researchremix.wordpress.com/75/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/researchremix.wordpress.com/75/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=researchremix.wordpress.com&blog=1015265&post=75&subd=researchremix&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://researchremix.wordpress.com/2008/04/10/make-all-research-results-cc-by/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/researchremix-128.jpg" medium="image">
			<media:title type="html">Heather</media:title>
		</media:content>
	</item>
		<item>
		<title>Non-OA Full-text for text mining</title>
		<link>http://researchremix.wordpress.com/2008/04/07/non-oa-full-text-for-text-mining/</link>
		<comments>http://researchremix.wordpress.com/2008/04/07/non-oa-full-text-for-text-mining/#comments</comments>
		<pubDate>Mon, 07 Apr 2008 14:28:22 +0000</pubDate>
		<dc:creator>Heather Piwowar</dc:creator>
		
		<category><![CDATA[openaccess]]></category>

		<category><![CDATA[full text]]></category>

		<category><![CDATA[text mining]]></category>

		<guid isPermaLink="false">http://researchremix.wordpress.com/?p=70</guid>
		<description><![CDATA[Interesting discussion on Peter Murray-Rust&#8217;s blog about whether PubMed Central articles can be crawled and used for text mining. The answer is no, not now, not unless they are open access (as opposed to traditional closed access but deposited in PMC).  Really unfortunate.  Incremental progress, we&#8217;ll get there.
Anticipating my thesis work, I&#8217;ve been wondering about [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Interesting discussion on <a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1026">P</a><a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=1026">eter Murray-Rust&#8217;s blog</a> about whether PubMed Central articles can be crawled and used for text mining. The answer is no, not now, not unless they are open access (as opposed to traditional closed access but deposited in PMC).  Really unfortunate.  Incremental progress, we&#8217;ll get there.</p>
<p>Anticipating my thesis work, I&#8217;ve been wondering about similar text mining questions. I think my needs are a bit different than those of PMR: I&#8217;m interested in papers that meet a targeted search, rather than all articles or all articles in relevant journal (what I gather he&#8217;s be interested in?). I&#8217;m willing to limit myself to the articles that I have access to through my University&#8217;s subscriptions. I don&#8217;t need figures. I think once I have the papers I&#8217;m allowed to text mine them as fair use, since I have them under permission. So the question is what can I automatically download?</p>
<p>I learned I can&#8217;t spider PMC, but what about normal PubMed? Try as I might, I couldn&#8217;t find verbage on the PubMed website allowing/disallowing spidering through to full-text links on publisher websites (the links that are populated and visible when I&#8217;m logged in through the University&#8217;s connection). Is this allowed? Still seems like it might not be. And then you end up at the publisher sites anyway, with all of their differing rules. Unfort, the publisher&#8217;s rules are often hard to find, confusing, and vague (as often noted by PMR and others). Aaaaah.</p>
<p>So last month I asked our librarians&#8230;.</p>
<blockquote><p>As you know, PMC has OA and non-OA full-text.  They make their OA text available via FTP etc, and they stipulate that those mechanisms are the only way that people are allowed to access the full text &#8220;because of copyright restrictions&#8221; [<a href="http://www.pubmedcentral.nih.gov/about/copyright.html" target="_blank">http://www.pubmedcentral.nih.gov/about/copyright.html</a>].  I&#8217;d also like to access non-OA text for which Pitt has subscriptions, but it sounds like I can&#8217;t do this by &#8220;crawling&#8221; PMC based on their rules [explicitly stated in the link above].  I guess I&#8217;m wondering if I can do it by &#8220;crawling&#8221; the normal, full PubMed.  Basically write a script to find the &#8220;HSLS&#8221; links on the article citation pages, follow them (usually into the publisher&#8217;s websites), and automatically save the html or pdf articles that are returned from a PubMed query.</p>
<p>There is no difference in the end result from me manually clicking through and saving the papers&#8230; but there is sure a difference in the manual time requirement!  I wouldn&#8217;t have thought this sort of automated downloading would be a problem&#8230; but the Restrictions on Systematic Downloading of articles in the PMC copyright notice referenced above makes me want to double-check.  I can&#8217;t find any reference to &#8220;crawling&#8221; or &#8220;systematic downloads&#8221; for PubMed itself.</p>
<p>I do understand there are user requirements when using the Entrez programming utilities (run automated queries during off hours, 3 seconds between queries, etc) and I would be sure to honor those both with the elements of my scripts which use the E-tools and those which are crawling the web pages directly.</p>
<p>Does that make sense?  Are you aware of any restrictions for crawling PubMed to automatically access and save content for which I do indeed have access through Pitt?  I guess since I&#8217;m going into the publisher&#8217;s websites, they might also have restrictions?  Is there another way to consolidate a large set electronic full-text articles (ideally a few thousand)?</p>
<p>Thanks very much for any pointers you may have.</p></blockquote>
<p>The librarian responsed that automatically following PubMed links should be fine, and that there shouldn&#8217;t be problems from publisher sites because we have subscriptions and my text mining falls under fair use. I&#8217;ll add that I think it helps that I&#8217;m not aiming to download full editions, because I do know that some publisher websites disallow that.</p>
<p>Maybe I shouldn&#8217;t be bringing it up again here, since it feels like I&#8217;ve been given an institutional &#8220;All Clear.&#8221; But no sense burying my head in the sand in case there really are issues: I want to know. Web downloading policies and full-text reuse policies are so complicated. I&#8217;ve spent time looked into them, but it sure seems like unless it is your full-time job it is impossible to understand and keep on top of how it works. I don&#8217;t think our librarians deal with these issues every day. Who else would I go to for clarification?</p>
<p>Does anyone have differing interpretations, warnings, reassurances, alternatives, and general paths through this crazy mess? How do other people do this???</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/researchremix.wordpress.com/70/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/researchremix.wordpress.com/70/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/researchremix.wordpress.com/70/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/researchremix.wordpress.com/70/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/researchremix.wordpress.com/70/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/researchremix.wordpress.com/70/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/researchremix.wordpress.com/70/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/researchremix.wordpress.com/70/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/researchremix.wordpress.com/70/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/researchremix.wordpress.com/70/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/researchremix.wordpress.com/70/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/researchremix.wordpress.com/70/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=researchremix.wordpress.com&blog=1015265&post=70&subd=researchremix&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://researchremix.wordpress.com/2008/04/07/non-oa-full-text-for-text-mining/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/researchremix-128.jpg" medium="image">
			<media:title type="html">Heather</media:title>
		</media:content>
	</item>
		<item>
		<title>FriendFeed</title>
		<link>http://researchremix.wordpress.com/2008/04/07/friendfeed/</link>
		<comments>http://researchremix.wordpress.com/2008/04/07/friendfeed/#comments</comments>
		<pubDate>Mon, 07 Apr 2008 13:45:07 +0000</pubDate>
		<dc:creator>Heather Piwowar</dc:creator>
		
		<category><![CDATA[blogging]]></category>

		<guid isPermaLink="false">http://researchremix.wordpress.com/?p=69</guid>
		<description><![CDATA[Thanks to Cameron Neylon for describing what friendfeed is all about.  It looks neat, so I&#8217;m giving it a go. There isn&#8217;t an explicit way to add a &#8220;citeulike&#8221; source to monitor the papers I&#8217;m archiving, but I&#8217;ve added my citeulike username page as a &#8220;blog&#8221; and that seems to work.  For now I&#8217;m keeping [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Thanks to <a href="http://friendfeed.com/cameronneylon">Cameron Neylon</a> for <a href="http://blog.openwetware.org/scienceintheopen/2008/04/06/friendfeed-lifestreaming-and-workstreaming/">describing </a>what friendfeed is all about.  It looks neat, so I&#8217;m <a href="http://friendfeed.com/researchremix">giving it a go</a>. There isn&#8217;t an explicit way to add a &#8220;citeulike&#8221; source to monitor the papers I&#8217;m archiving, but I&#8217;ve added my citeulike username page as a &#8220;blog&#8221; and that seems to work.  For now I&#8217;m keeping my sources worky (ie not linking to the Flickr baby photostream) to keep the work-signal high.  Ok, except I did include my starred YouTube videos&#8230; low traffic, usually nerdy, and some of that stuff is way too funny not to pass on :)</p>
<p>Friendfeed addresses something I&#8217;ve been struggling with.  Ideally, I want to blog about the things I read that I find particularly interesting and relevant, to make sure that everyone else has seen them too (ok, they probably have, but just in case&#8230;).  Needless to say, I&#8217;m not getting to it.  I need to let it go.  So in case you&#8217;re interested in the things I&#8217;m interested in, have a look at my friendfeed.  And introduce yourself or leave a link to your blog/library/paper stream/something so I can follow your work too!</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/researchremix.wordpress.com/69/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/researchremix.wordpress.com/69/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/researchremix.wordpress.com/69/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/researchremix.wordpress.com/69/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/researchremix.wordpress.com/69/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/researchremix.wordpress.com/69/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/researchremix.wordpress.com/69/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/researchremix.wordpress.com/69/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/researchremix.wordpress.com/69/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/researchremix.wordpress.com/69/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/researchremix.wordpress.com/69/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/researchremix.wordpress.com/69/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=researchremix.wordpress.com&blog=1015265&post=69&subd=researchremix&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://researchremix.wordpress.com/2008/04/07/friendfeed/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/researchremix-128.jpg" medium="image">
			<media:title type="html">Heather</media:title>
		</media:content>
	</item>
		<item>
		<title>A Centralized Proposal Repository</title>
		<link>http://researchremix.wordpress.com/2008/04/02/a-centralized-proposal-repository/</link>
		<comments>http://researchremix.wordpress.com/2008/04/02/a-centralized-proposal-repository/#comments</comments>
		<pubDate>Wed, 02 Apr 2008 13:17:10 +0000</pubDate>
		<dc:creator>Heather Piwowar</dc:creator>
		
		<category><![CDATA[openscience]]></category>

		<category><![CDATA[centralized proposal repository]]></category>

		<category><![CDATA[Open Science]]></category>

		<guid isPermaLink="false">http://researchremix.wordpress.com/?p=68</guid>
		<description><![CDATA[I actively support Nature Precedings as a place to archive my early research findings for visibility, feedback, and attribution.  I recently submitted a research proposal.   Precedings does a spot-check of all submissions to verify appropriateness.  It usually takes a day or two, and results in an automated response stating that your [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>I actively support <a href="http://precedings.nature.com/">Nature Precedings</a> as a place to archive my early research findings for visibility, feedback, and attribution.  I recently submitted a <a href="http://researchremix.wordpress.com/2008/03/24/envisioning-a-biomedical-data-reuse-registry/">research proposal.</a>   Precedings does a spot-check of all submissions to verify appropriateness.  It usually takes a day or two, and results in an automated response stating that your submission has been posted.  In this case, I received a personal, thoughtful email explaining that although Nature Precedings had published proposals in the past, they are moving away from this practice to concentrate on their core goal of &#8220;a repository for manuscripts, posters, and presentations describing completed research.&#8221;</p>
<p>I see the issue.  On one hand, if the goal of a preprint is to get feedback and attribution for research ideas, what better time to do it than at the proposal stage.  On the other hand, ideas are a dime a dozen and so it might not be scalable for Nature Precedings.</p>
<p>Sounds like we need another solution.  There was a <a href="http://www.nature.com/nature/journal/v452/n7186/full/452409b.html">letter </a>to Nature just recently (highlighted by Maxine Clarke in <a href="http://blogs.nature.com/nautilus/2008/03/proposal_for_a_centralized_gra.html">Nautilus</a>), calling for a Centralized Proposal Repository.  This idea is grander than simply a wiki for feedback and attribution.  Dr Harel is suggesting it could also be searched by funders, to identify projects which match their interests.</p>
<p>Dr Harel expands on this idea on his <a href="http://noamyharel.googlepages.com/universalproposalrepository">website</a> and links to the beginning of a Centralized Proposal Repository <a href="http://sharescienceideas.wikispaces.com/">wiki.</a></p>
<p>Great idea, and I think right up the alley of open science.   The wiki seems to be Protected (to limit spammers?)  I&#8217;ll go ask to join, and keep you updated on what I learn.</p>
<p>ETA:  Looks like <a href="http://usefulchem.blogspot.com/">Jean-Claude Bradley</a> was involved in the set-up of the wiki.  Great stuff!</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/researchremix.wordpress.com/68/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/researchremix.wordpress.com/68/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/researchremix.wordpress.com/68/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/researchremix.wordpress.com/68/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/researchremix.wordpress.com/68/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/researchremix.wordpress.com/68/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/researchremix.wordpress.com/68/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/researchremix.wordpress.com/68/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/researchremix.wordpress.com/68/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/researchremix.wordpress.com/68/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/researchremix.wordpress.com/68/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/researchremix.wordpress.com/68/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=researchremix.wordpress.com&blog=1015265&post=68&subd=researchremix&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://researchremix.wordpress.com/2008/04/02/a-centralized-proposal-repository/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/researchremix-128.jpg" medium="image">
			<media:title type="html">Heather</media:title>
		</media:content>
	</item>
		<item>
		<title>Citing monkeys?</title>
		<link>http://researchremix.wordpress.com/2008/04/02/citing-monkeys/</link>
		<comments>http://researchremix.wordpress.com/2008/04/02/citing-monkeys/#comments</comments>
		<pubDate>Wed, 02 Apr 2008 12:42:59 +0000</pubDate>
		<dc:creator>Heather Piwowar</dc:creator>
		
		<category><![CDATA[LitNotes]]></category>

		<category><![CDATA[citations]]></category>

		<category><![CDATA[literature]]></category>

		<guid isPermaLink="false">http://researchremix.wordpress.com/?p=67</guid>
		<description><![CDATA[Does anyone remember reading a paper about the citation history of ape studies?  If so, I&#8217;d love a pointer or any keyword-details you might remember.  I haven&#8217;t been able to find it.
For those who are curious:  from what I remember, the authors were investigating the what proportion of ape (or was it [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Does anyone remember reading a paper about the citation history of ape studies?  If so, I&#8217;d love a pointer or any keyword-details you might remember.  I haven&#8217;t been able to find it.</p>
<p>For those who are curious:  from what I remember, the authors were investigating the what proportion of ape (or was it monkey) studies are truly necessary.  They collected citation histories and discovered that a large proportion of studies had received no citations at all.  This suggests that although such studies have to clear high hurdles to be approved, our mechanisms are insufficient and as a result we&#8217;re putting our cousins through unnecessary hardship.</p>
<p>Interesting to me on a number of levels.  1.  It would be a helpful reference for the usage of citation histories.  2.  We&#8217;ve been to the zoo recently with a gorilla-enthralled two year old.  The apes (and monkeys) act just like some people I know.  Any invasive research (or zookeeping&#8230;) had better be worth it.</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/researchremix.wordpress.com/67/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/researchremix.wordpress.com/67/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/researchremix.wordpress.com/67/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/researchremix.wordpress.com/67/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/researchremix.wordpress.com/67/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/researchremix.wordpress.com/67/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/researchremix.wordpress.com/67/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/researchremix.wordpress.com/67/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/researchremix.wordpress.com/67/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/researchremix.wordpress.com/67/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/researchremix.wordpress.com/67/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/researchremix.wordpress.com/67/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=researchremix.wordpress.com&blog=1015265&post=67&subd=researchremix&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://researchremix.wordpress.com/2008/04/02/citing-monkeys/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/researchremix-128.jpg" medium="image">
			<media:title type="html">Heather</media:title>
		</media:content>
	</item>
		<item>
		<title>PSB Open Science workshop:  call for participation</title>
		<link>http://researchremix.wordpress.com/2008/03/25/psb-open-science-workshop-call-for-participation/</link>
		<comments>http://researchremix.wordpress.com/2008/03/25/psb-open-science-workshop-call-for-participation/#comments</comments>
		<pubDate>Tue, 25 Mar 2008 17:20:40 +0000</pubDate>
		<dc:creator>Heather Piwowar</dc:creator>
		
		<category><![CDATA[Open Notebook Science]]></category>

		<category><![CDATA[PSB2009]]></category>

		<category><![CDATA[openscience]]></category>

		<category><![CDATA[Open Science]]></category>

		<category><![CDATA[PSB]]></category>

		<guid isPermaLink="false">http://researchremix.wordpress.com/?p=66</guid>
		<description><![CDATA[As reported by the organizers at One Big Lab and Science in the Open, PSB 2009 is going to have a 3 hour workshop devoted to Open Science.  Neat, eh?  What a great chance to meet and learn from others who are working this way and/or thinking about this topic!  And discuss [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>As reported by the organizers at <a href="http://onebiglab.blogspot.com/2008/03/psb-open-science-workshop-call-for.html">One Big Lab</a> and <a href="http://blog.openwetware.org/scienceintheopen/2008/03/24/open-science-at-psb-call-for-submissions/">Science in the Open</a>, <a href="http://psb.stanford.edu/">PSB 2009</a> is going to have a 3 hour <a href="http://psb.stanford.edu/cfp-opensci.html">workshop devoted to </a><b><a href="http://psb.stanford.edu/cfp-opensci.html">Open Science</a>.  </b>Neat, eh?  What a great chance to meet and learn from others who are working this way and/or thinking about this topic!  And discuss it with others who just happen to be in the PSB neighbourhood!  And go to Hawaii in January!   :)</p>
<p>I&#8217;ll definitely be submitting a talk proposal.  Still brainstorming the topic.  The winner in my head over the last 24 hours is &#8220;Open Science:  Measuring the Costs and Benefits.&#8221;  What do you think?  Other ideas?  Off to email Shirley and Cameron&#8230;..</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/researchremix.wordpress.com/66/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/researchremix.wordpress.com/66/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/researchremix.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/researchremix.wordpress.com/66/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/researchremix.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/researchremix.wordpress.com/66/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/researchremix.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/researchremix.wordpress.com/66/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/researchremix.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/researchremix.wordpress.com/66/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/researchremix.wordpress.com/66/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/researchremix.wordpress.com/66/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=researchremix.wordpress.com&blog=1015265&post=66&subd=researchremix&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://researchremix.wordpress.com/2008/03/25/psb-open-science-workshop-call-for-participation/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/researchremix-128.jpg" medium="image">
			<media:title type="html">Heather</media:title>
		</media:content>
	</item>
		<item>
		<title>Identifying Data Sharing in Biomedical Literature</title>
		<link>http://researchremix.wordpress.com/2008/03/25/identifying-data-sharing-in-biomedical-literature/</link>
		<comments>http://researchremix.wordpress.com/2008/03/25/identifying-data-sharing-in-biomedical-literature/#comments</comments>
		<pubDate>Tue, 25 Mar 2008 17:04:18 +0000</pubDate>
		<dc:creator>Heather Piwowar</dc:creator>
		
		<category><![CDATA[MyResearch]]></category>

		<category><![CDATA[data reuse]]></category>

		<category><![CDATA[data sharing]]></category>

		<category><![CDATA[nlp]]></category>

		<guid isPermaLink="false">http://researchremix.wordpress.com/?p=65</guid>
		<description><![CDATA[I emailed AMIA again to ask for clarification on their preprint policy, and quickly received this encouraging response: “Preposting is fine so long as the other sites  don’t formally publish the work.”   Great news, thanks AMIA.  
Note:  this brings my blog up-to-date on the research I&#8217;ve been doing, with the [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><i>I emailed AMIA again to ask for <a href="http://researchremix.wordpress.com/2008/03/24/preprints-of-conference-submissions/">clarification on their preprint policy</a>, and quickly received this encouraging response: <font color="#000000">“<span><font face="Arial" size="2">Preposting is fine so long as the other sites  don’t formally publish the work.”   </font></span></font>Great news, thanks AMIA.  </i></p>
<p><i>Note:  this brings my blog up-to-date on the research I&#8217;ve been doing, with the exception of one paper under review at PLoS Medicine.  That one is a complex collaboration.   Despite some attempts there isn&#8217;t consensus about making it open at this point.<br />
</i><i></i></p>
<p>Here is the paper we submitted to the <a href="http://www.amia.org/meetings/f08/">AMIA 2008 Annual Symposium</a>.   AMIA=American Medical Informatics Association.  Nature Precedings link to appear once it has been posted.</p>
<p><font size="2"><b>Identifying Data Sharing in Biomedical Literature<br />
</b>Heather A. Piwowar and Wendy W. Chapman</font></p>
<p><font size="2">Many policies and projects now 	encourage investigators to share their raw research data with other 	scientists. Unfortunately, it is difficult to measure the 	effectiveness of these initiatives because data can be shared in 	such a variety of mechanisms and locations. We propose a novel 	approach to finding shared datasets:  using natural language processing (NLP) techniques to 	identify declarations of dataset sharing within the full text of 	primary research articles. Using regular expression patterns and 	machine learning algorithms on open access biomedical literature, 	our system was able to identify 61% of articles with shared datasets 	with 80% precision.  A simpler version of our classifier achieved 	higher recall (86%), though lower precision (49%). We believe our 	results demonstrate the feasibility of this approach and hope to 	inspire further study of dataset retrieval techniques and policy 	evaluation.</font></p>
<p><a href="http://docs.google.com/Doc?id=dgqz7h9q_142hrx7ssgz">Full text</a></p>
<p>***</p>
<p>My inspiration for this work was the idea of a <a href="http://researchremix.wordpress.com/2008/03/24/envisioning-a-biomedical-data-reuse-registry/">Data Reuse Registry</a> and associated research.  As discussed, a DRR would benefit from automatic identification of data reuse in the biomedical literature.  Unfortunately, automatic identification of data <i>reuse</i> is a tough place to start my NLP (natural language processing) journey because I haven&#8217;t found any large, pre-existing gold standards of data reuse to use for evaluating such a system (<a href="http://www.ncbi.nlm.nih.gov/projects/geo/info/citations.html">this list</a> of GEO &#8220;third party&#8221; data reuse papers is a start).</p>
<p>Identifying data <i>sharing</i> is easier: there are available gold standards via database links, and authors tend to use more uniform language in describing sharing than reuse.  Automatically detecting data sharing could be useful to my research in other ways as well, down the road, as I look towards further sharing policy evalutation.</p>
<p>This data sharing identification system used very simple NLP techniques.  Hope to (and will probably need to) dig into some more complex approaches as I tackle data reuse identification.</p>
<p>If anyone knows of other resources that list specific instances of data reuse, I&#8217;d love to hear about them!</p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/researchremix.wordpress.com/65/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/researchremix.wordpress.com/65/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/researchremix.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/researchremix.wordpress.com/65/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/researchremix.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/researchremix.wordpress.com/65/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/researchremix.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/researchremix.wordpress.com/65/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/researchremix.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/researchremix.wordpress.com/65/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/researchremix.wordpress.com/65/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/researchremix.wordpress.com/65/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=researchremix.wordpress.com&blog=1015265&post=65&subd=researchremix&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://researchremix.wordpress.com/2008/03/25/identifying-data-sharing-in-biomedical-literature/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/researchremix-128.jpg" medium="image">
			<media:title type="html">Heather</media:title>
		</media:content>
	</item>
		<item>
		<title>Preprints of conference submissions?</title>
		<link>http://researchremix.wordpress.com/2008/03/24/preprints-of-conference-submissions/</link>
		<comments>http://researchremix.wordpress.com/2008/03/24/preprints-of-conference-submissions/#comments</comments>
		<pubDate>Mon, 24 Mar 2008 17:45:19 +0000</pubDate>
		<dc:creator>Heather Piwowar</dc:creator>
		
		<category><![CDATA[Open Notebook Science]]></category>

		<category><![CDATA[Policies]]></category>

		<category><![CDATA[amia]]></category>

		<category><![CDATA[conferences]]></category>

		<category><![CDATA[ons]]></category>

		<category><![CDATA[preprints]]></category>

		<guid isPermaLink="false">http://researchremix.wordpress.com/?p=64</guid>
		<description><![CDATA[It continues to surprise me how hard [1, 2] it is to figure out the rules for publishing in most journals, particularly with respect to preprints.   I&#8217;m finding the policies for publishing in conferences are just as ill-defined.  This is a shame because it surely decreases sharing.
I&#8217;ve been putting my work up [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>It continues to surprise me how hard [<a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=914">1</a>,<a href="http://www.sennoma.net/main/archives/2008/01/does_the_aappsp_really_represe.php"> 2</a>] it is to figure out the rules for publishing in most journals, particularly with respect to preprints.   I&#8217;m finding the policies for publishing in conferences are just as ill-defined.  This is a shame because it surely decreases sharing.</p>
<p>I&#8217;ve been putting my work up on Nature Precedings.  I submitted a poster a few days before the ISMB conference. Nature Precedings asked me if I knew whether ISMB would permit this.  I emailed to ask:  they quickly responded that it was no problem.  I&#8217;ve since asked PSB and ELPUB:  again, no problem.  I think that ELPUB did make state this during in the submission process, but in all three cases it would be great if they wrote this explictly on their websites so that people could feel free to post and discuss draft submissions&#8230; I&#8217;ll send a few emails to suggest it.</p>
<p>I recently asked AMIA.  I was pointed to the contributor license agreement [below], which left me guessing.  What does &#8220;publish&#8221; mean?  For most publishers <a href="http://researchremix.wordpress.com/2007/07/24/conversation-with-publishers-on-open-notebook-science/">I&#8217;ve talked to</a> it means &#8220;published in an official capacity&#8221; and they don&#8217;t count blogs or pre-print servers.  Yet clearly the journals who disallow preprints disagree with this definition [<a href="http://clinmed.netprints.org/misc/policies.shtml">quick list for Clinical Medicine</a>, or <a href="http://www.sherpa.ac.uk/romeo.php?all=yes">Sherpa/RoMEO</a> for an extensive list].  JAMIA, the official journal of AMIA, does not allow preprints. So I guess I should email AMIA again and ask for further clarification.</p>
<p><b>ETA:  I emailed again and quickly got a very clear response: </b><b>&#8220;<span><font color="#0000ff" face="Arial" size="2">Preposting is fine so long as the other sites  don&#8217;t formally publish the work.&#8221;   </font></span>Great news, thanks AMIA.<br />
</b></p>
<p>I&#8217;d like to &#8220;ask for forgiveness rather than permission&#8221;  but the potential down-side is too great in this case.  Student publication in AMIA is highly-valued in my department.</p>
<p>I wonder if <a href="http://www.earlham.edu/~peters/fos/2008/03/publisher-groups-criticize-attempts-to.html">author copyright addenda</a> can apply to conference submissions???  I don&#8217;t see why not.  Will have to keep that in mind next time.</p>
<p>AMIA Annual Symposium 2008 license agreement:</p>
<p><span id="more-64"></span></p>
<div class="ArwC7c ckChnd">
<blockquote><p><span><b>Contributor License Agreement</b> </span><span>This is a License Agreement between the contributor and the American Medical Informatics Association (<span class="nfakPe">AMIA</span>). This License Agreement is binding and enforceable beginning on <b>August 1, 2008</b> until terminated by the mutual agreement of both parties. Contributor grants to <span class="nfakPe">AMIA</span> the right to publish and republish the written work (including any subsequent changes in the title or authors) attached to this Agreement (hereinafter the &#8216;Contribution&#8217;), in a publication entitled, <i>Proceedings: <span class="nfakPe">AMIA</span> 2008 Annual Meeting</i>, which shall be available on a CD-ROM and/or in any other format or medium at the discretion of <span class="nfakPe">AMIA</span>. Contributor also grants certain other publication rights described below. Authors may reproduce without permission copies of their papers for use in their own classes. In consideration of the mutual covenants and agreements contained herein, the parties agree to the following:<b>Contributor&#8217;s Warranty</b></span></p>
<p>The Contributor represents and warrants to <span class="nfakPe">AMIA</span> that the Contributor has full power and authority to enter into this License Agreement, on behalf of himself/herself and any other author, and that the Contribution is an original work except for material in the public domain and such excerpts from other works as may be included with the written permission of the copyright owners, which has been obtained and shall be submitted with the Contribution. The Contributor also represents and warrants that the work has not been published previously, is not subject to copyright or other rights of a person other than the author(s), and has not been offered for publication elsewhere.</p>
<p><b>Publication License</b></p>
<p>In the event that this work is published by <span class="nfakPe">AMIA</span>, and in consideration for the editing and publication of this work, the contributor(s) hereby grant(s) to <span class="nfakPe">AMIA</span> an exclusive royalty free license to publish and republish the Contribution, including any and all rights incidental thereto. <span class="nfakPe">AMIA</span> shall have the right to: (1) permit others to republish the article in whole or in part, with or without fee; (2) produce preprints or reprints; (3) translate the Contribution into any language other than English for sale or free distribution; and (4) publish the work in a collection of articles in any form or medium.</p></blockquote>
<p><span> </span></div>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/researchremix.wordpress.com/64/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/researchremix.wordpress.com/64/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/researchremix.wordpress.com/64/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/researchremix.wordpress.com/64/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/researchremix.wordpress.com/64/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/researchremix.wordpress.com/64/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/researchremix.wordpress.com/64/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/researchremix.wordpress.com/64/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/researchremix.wordpress.com/64/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/researchremix.wordpress.com/64/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/researchremix.wordpress.com/64/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/researchremix.wordpress.com/64/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=researchremix.wordpress.com&blog=1015265&post=64&subd=researchremix&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://researchremix.wordpress.com/2008/03/24/preprints-of-conference-submissions/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/researchremix-128.jpg" medium="image">
			<media:title type="html">Heather</media:title>
		</media:content>
	</item>
		<item>
		<title>Envisioning a Biomedical Data Reuse Registry</title>
		<link>http://researchremix.wordpress.com/2008/03/24/envisioning-a-biomedical-data-reuse-registry/</link>
		<comments>http://researchremix.wordpress.com/2008/03/24/envisioning-a-biomedical-data-reuse-registry/#comments</comments>
		<pubDate>Mon, 24 Mar 2008 14:48:42 +0000</pubDate>
		<dc:creator>Heather Piwowar</dc:creator>
		
		<category><![CDATA[Data Reuse Registry]]></category>

		<category><![CDATA[MyResearch]]></category>

		<category><![CDATA[data reuse]]></category>

		<category><![CDATA[drr]]></category>

		<guid isPermaLink="false">http://researchremix.wordpress.com/?p=63</guid>
		<description><![CDATA[An idea I&#8217;ve been thinking about recently:






Envisioning a Biomedical Data Reuse Registry
Heather A. Piwowar and Wendy W. Chapman




Abstract
Repurposing research data holds many benefits for the advancement of biomedicine, yet is very difficult to measure and evaluate.  We propose a data reuse registry to maintain links between primary research datasets and studies that reuse this [...]]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><div class="Section1">An idea I&#8217;ve been thinking about recently:</div>
<div class="Section1"></div>
<div class="Section1"></div>
<div class="Section1"></div>
<div class="Section1"></div>
<div class="Section1"></div>
<div class="Section1">
<h3><b>Envisioning a Biomedical Data Reuse Registry</b><i></i></h3>
<p><i>Heather A. Piwowar and Wendy W. Chapman</i></div>
<div class="Section1"></div>
<div class="Section1"></div>
<div class="Section1"></div>
<div class="Section1"></div>
<div class="Section1"><b>Abstract<br />
</b>Repurposing research data holds many benefits for the advancement of biomedicine, yet is very difficult to measure and evaluate.  We propose a data reuse registry to maintain links between primary research datasets and studies that reuse this data.  Such a resource could help recognize investigators whose work is reused, illuminate aspects of reusability, and evaluate policies designed to encourage data sharing and reuse.</div>
<div class="Section1"></div>
<div class="Section1"></div>
<div class="Section1"><b>Motivation<br />
</b>The full benefits of data sharing will only be realized when we can incent investigators to share their data[1] and quantify the value created by data reuse.[2]  Current practices for recognizing the provenance of reused data include an acknowledgment, a listing of accession numbers, a database search strategy, and sometimes a citation within the article.  These mechanisms make it very difficult to identify and tabulate reuse, and thus to reward and encourage data sharing.  We propose a solution:  a Data Reuse Registry.<br />
<b></b></div>
<div class="Section1"></div>
<div class="Section1"><b>What is a data reuse registry?</b><br />
We define a Data Reuse Registry (DRR) as a database with links between biomedical research studies and the datasets used within the studies. The reuse articles may be represented as PubMed IDs, and the datasets as accession numbers within established databases or the PubMed IDs of the studies that originated the data.</div>
<div class="Section1"></div>
<div class="Section1"><b>How would the DRR be populated?<br />
</b>We anticipate several mechanisms for populating the DRR:<br />
* Voluntary submissions<br />
* Automatic detection from the literature[3]<br />
* Prospective submission of reuse plans, followed by automatic tracking</div>
<div class="Section1"> We envision collecting prospective citations in two steps.  First, prior to publication, investigators visit a web page and list datasets and accession numbers reused in their research, thereby creating a DRR entry record in the DRR database.  In return, the reusing investigators will be given some best-practices free-text language that they can insert into their acknowledgments section, a list of references to the papers that originated the data, some value-add information such as links to other studies that previously reused this data, and a reference to a new DRR entry record.  When authors cite this DRR within their reuse study as part of their data use acknowledgement, the second step of DRR data input can be done automatically:  citations in the published literature will be mined periodically to discover citations to DRR entries.  These citations will be combined with the information provided when the entry was created to explicitly link published papers with the datasets they reused.  The result will be searchable by anyone wishing to understand the reuse impact made by an investigator, institution, or database.<b></b></div>
<div class="Section1"></div>
<div class="Section1"></div>
<div class="Section1"><b>How would the DRR be used?<br />
</b>Information from the DRR could be used to recognize investigators whose work is reused, illuminate aspects of reusability, examine the variety of purposes for which a given dataset is reused, and evaluate policies designed to encourage data sharing and reuse.</div>
<div class="Section1"></div>
<div class="Section1"><b>Conclusion<br />
</b>While the DRR may not be a comprehensive solution, we believe it represents a starting place for finding solutions to the important problem of evaluating, encouraging, and rewarding data sharing and reuse.<b></b></div>
<div class="Section1"></div>
<div class="Section1"><b>Acknowledgments<br />
</b>HP is supported by NLM training grant 5T15-LM007059-19 and WC is funded through NLM grant 1 R01LM009427-01.<b></b></div>
<div class="Section1"></div>
<div class="Section1"><b>References </b><br />
1.        Compete, collaborate, compel. Nat Genet. 2007;39(8).<br />
2.        Ball CA, Sherlock G, Brazma A. Funding high-throughput data sharing. Nat Biotechnol. 2004 Sep;22(9):1179-83.<br />
3.        Piwowar HA, Chapman WW. Identifying data sharing in the biomedical literature.  Submitted to the AMIA Annual Symposium 2008.</div>
<div class="Section1"></div>
<div class="Section1"><i>[This DRR summary has been submitted as a poster description to AMIA 2008]</i></div>
<div class="Section1"></div>
<div class="Section1"></div>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/researchremix.wordpress.com/63/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/researchremix.wordpress.com/63/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/researchremix.wordpress.com/63/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/researchremix.wordpress.com/63/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/researchremix.wordpress.com/63/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/researchremix.wordpress.com/63/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/researchremix.wordpress.com/63/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/researchremix.wordpress.com/63/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/researchremix.wordpress.com/63/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/researchremix.wordpress.com/63/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/researchremix.wordpress.com/63/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/researchremix.wordpress.com/63/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=researchremix.wordpress.com&blog=1015265&post=63&subd=researchremix&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://researchremix.wordpress.com/2008/03/24/envisioning-a-biomedical-data-reuse-registry/feed/</wfw:commentRss>
	
		<media:content url="http://a.wordpress.com/avatar/researchremix-128.jpg" medium="image">
			<media:title type="html">Heather</media:title>
		</media:content>
	</item>
	</channel>
</rss>