Hi!ย I’m Jonathan D. Carlson, and I am the DataONE intern working on the Tracking the Reuse of 1,000 Datasets project.
ABOUT ME.ย I have a B.A. from Gustavus Adolphus College in Geology and Environmental Studies with a minor in Geography, and a M.S. from Michigan Technological University in Forest Ecology and Management.ย I am currently pursuing an M.A. in Library and Information Studies from the University of Wisconsin–Madison and aspire to be an academic science librarian.
ABOUT THE PROJECT.ย You can find a proposal for this project here and an update here.
ABOUT THE BLOG.ย This blog will act as an Open Science Notebook, and it will therefore contain descriptions of what I am doing and links to the data being collected and analyzed.
I welcome you to leave comments and suggestions if you see a way to improve anything along the way.
Good to see another intern using these. I’m the Workflows one – I applied to do yours, as well, but you look much more qualified at it. Best of luck!
I look forward to seeing how it goes – might be nice to get a broad overview of what’s going on, as well as a step-by-step of what you do each day.
Good luck!
I hope the datasets get re-used a lot. Particularly the TreeBASE ones. Although having had a look at the list… can’t say I’m that confident.
I was wondering if you’ve thought of adding in Impact Factor of the journal of the article that the deposited data is associated with, as a factor (as part of a multi-factor, multiple regression analysis, perhaps).
The basic expectation being (to be tested) that say a Nature dataset is more likely to get re-used that a European J. of Entomology dataset. Null hypothesis: Impact Factor doesn’t matter.
Also, for the Biological datasets (which form the majority of your sample after all…) what about taxon coverage of the dataset. Does it affect re-use rate if the data is predominantly (broadly) ‘animal’, ‘plant’, ‘bacteria’, or ‘fungi’. A glorified test of ‘kingdom’ popularity?
Anyway, do keep us updated with the progress ๐