{"id":177,"date":"2011-06-21T18:07:26","date_gmt":"2011-06-21T23:07:26","guid":{"rendered":"http:\/\/notebooks.dataone.org\/tracking1000datasets\/?p=177"},"modified":"2013-05-09T01:17:13","modified_gmt":"2013-05-09T01:17:13","slug":"june-21-2011-tracking-protein-data-bank","status":"publish","type":"post","link":"https:\/\/notebooks.dataone.org\/data-reuse\/june-21-2011-tracking-protein-data-bank\/","title":{"rendered":"June 21, 2011 – Tracking Protein Data Bank"},"content":{"rendered":"
Today was spent tracking the Protein Data Bank (PDB) datasets.\u00a0 As Heather had predicted, there were quite a few more hits than the previous data repositories (670 citations).\u00a0 The search terms used and search results can be seen in this Google Spreadsheet<\/a>.\u00a0 As Heather predicted such a large amount of results, she also suggested starting by only importing citations from datasets that had 3 or less citations.\u00a0 I imported these 56 citations to the PDB Mendeley group<\/a>, found the full-text, and analyzed them for reuse.<\/p>\n Determining dataset reuse was trickier for this data repository as several of the articles just listed the PDB ID number in a table with several other PDB IDs.\u00a0 There are therefore more medium and low confidence levels applied than there have been with other data repositories.<\/p>\n Of the 56 citations, I was unable to access full-text for 3 of them and another 3 had import errors.\u00a0 Out of the 50 articles examined so far, 36 have potential dataset reuse, 3 have ambiguous dataset reuse, and 4 cite the dataset but do not appear to reuse it.\u00a0 (My numbers don’t seem to be adding up so I will be reexamining the citations to make sure everything was tagged in the morning.)<\/p>\n","protected":false},"excerpt":{"rendered":" Today was spent tracking the Protein Data Bank (PDB) datasets.\u00a0 As Heather had predicted, there were quite a few more hits than the previous data repositories (670 citations).\u00a0 The search terms used and search results can be seen in this Google Spreadsheet.\u00a0 As Heather predicted such a large amount of Continue reading June 21, 2011 – Tracking Protein Data Bank<\/span>