Open Bibliography and Open Bibliographic Data » institutionalBenefits http://openbiblio.net Open Bibliographic Data Working Group of the Open Knowledge Foundation Tue, 08 May 2018 15:46:25 +0000 en-US hourly 1 http://wordpress.org/?v=4.3.1 Medline dataset http://openbiblio.net/2011/05/23/medline-dataset/ http://openbiblio.net/2011/05/23/medline-dataset/#comments Mon, 23 May 2011 09:56:55 +0000 http://openbiblio.net/?p=1120 Continue reading ]]> Announcing the CC0 Medline dataset

We are happy to report that we now have a full, clean public domain (CC0) version of the Medline dataset available for use by the community.

What is the Medline dataset?

The Medline dataset is a subset of bibliographic metadata covering approximately 98% of all PubMed publications. The dataset comes as a package of approximately 653 XML files, chronologically listing records in terms of the date the record was created. There are approximately 19 million publication records.

Medline is a maintained dataset, and updates chronologically append to the current dataset.

Read our explanation of the different PubMed datasets for further information.

Where to get it

The raw dataset can be downloaded from CKAN : http://ckan.net/package/medline

What is in a record

Most records contain useful non-copyrightable bibliographic metadata such as author, title, journal, PubMed record ID. Many also have DOIs. We have stripped out any potentially copyrightable material such as abstracts.

Read our technical description of a record for further information.

Sample usage

We have made an online visualisation of a sample of the Medline dataset – however the visualisation relies on WebGL which is not yet widely supported by all browsers. It should work in Chrome and probably FireFox4.

This is just one example, but shows what great things we can build and learn from when we have open access to the necessary data to do so.

]]>
http://openbiblio.net/2011/05/23/medline-dataset/feed/ 3
JISC OpenBibliography: Wider Benefits to Sector & Achievements for Host Institution http://openbiblio.net/2010/07/15/jisc-openbibliography-wider-benefits-to-sector-achievements-for-host-institution/ http://openbiblio.net/2010/07/15/jisc-openbibliography-wider-benefits-to-sector-achievements-for-host-institution/#comments Thu, 15 Jul 2010 09:15:48 +0000 http://openbiblio.net/?p=111 Continue reading ]]>
  • Bibliographic data is useful; A number of organisations such as CERN and Library of Congress have recognised that providing open access to bibliographic records and controlled vocabularies is a natural and necessary step to begin to identify errors and to avoid erroneous or divergent duplication, thereby improving the metadata accuracy. A key point from Karen Coyle is “The change that libraries will need to make in response [to user demand] must include the transformation of the library’s public catalog from a stand-alone database of bibliographic records to a highly hyperlinked data set that can interact with information resources on the World Wide Web.”
  • Bibliographic data is, in general, not open or linked: this limits its usefulness to the academic community. This project will deliver bibliographic material that is truly open (as in http://opendefinition.org where the team has particular expertise). Many attempts to create LOD suffer because there are no useful resources to link to. OpenBibliography will expose Author names, Institutions and Geographical Locations with semantic targets in the LOD ecosystem (e.g. Geonames, Wikipedia); the project will put significant effort into disambiguation so that OpenBibliography can become an important node in the LOD graph.
  • Processes to make it open or linked are not familiar to libraries and publishers: Much modern bibliographic data is created implicitly or explicitly by the scholarly publication process but exposed poorly or not at all. Working with cooperating publishers can rapidly transform their output to complete open semantic bibliography. By providing a clear working model for bibliographic metadata as semantic, referenceable links with a reusable workflow to gather, add provenance, refine and disambiguate existing metadata information, members of the JISC community can apply the same model and techiniques with the open-source code and services we will provide to use data from and contribute to the aforementioned ‘highly hyperlinked data set’.
  • ]]>
    http://openbiblio.net/2010/07/15/jisc-openbibliography-wider-benefits-to-sector-achievements-for-host-institution/feed/ 0