Author Archives: benosteen

“Full-text” search for openbiblio, using Apache Solr

Overview: This provides a simple search interface for openbiblio, using a network-addressable Apache Solr instance to provide FTS over the content. The indexer currently relies on the Entry Model (in /model/entry.py) to provide an acceptable dictionary of terms to be … Continue reading

Posted in JISC OpenBib, Uncategorized | Leave a comment

Academic Bibliography data available from Acta Cryst E

The bibliographic data from Acta Cryst E, a publication by the International Union of Crystallography (IUCr), has been extracted and made available with their consent. You can find a SPARQL endpoint for the data here and the full dataset here. … Continue reading

Posted in JISC OpenBib | Tagged , , , | 3 Comments

Name matching strategy using bibliographic data

One of the aims of an RDF representation of bibliographic data should be to have authors represented by unique, reference-able points within the data (as URIs), rather than as free-text fields. What steps can we do to match up the … Continue reading

Posted in JISC OpenBib, Semantic Web | 1 Comment

Characterising the British Library Bibliographic dataset

Having RDF data is good. Having Linkable data is better but having some idea of what sorts of properties you can expect to find within a triplestore or block of data can be crucial. That sort of broad-stroke information can … Continue reading

Posted in JISC OpenBib, OKFN Openbiblio | Tagged , , , , | 2 Comments

"Bundling" instances of author names together without using owl:sameas

Bundling? It’s a verb I’ve taken from ”Glaser, H., Millard, I., Jaffri, A., Lewy, T. and Dowling, B. (2008) On Coreference and The Semantic Web http://eprints.ecs.soton.ac.uk/15765/” where the core idea is that you have a number of URIs that mean or … Continue reading

Posted in JISC OpenBib, ORDF, Semantic Web | Tagged , , , , , , | 7 Comments

Augmenting the British Library's RDF data to allow for disambiguation

The British Library have released what they term the ‘British National Bibliography’ (BNB) under a permissive licence. This constitutes just under 3 million records, and is derived from the ‘most polished set of bibliographic data’ as some of it dates … Continue reading

Posted in JISC OpenBib, Semantic Web | Tagged , , , , | 5 Comments

Some obvious URI patterns for a service?

Whilst the technical issues and backends may vary, there are one or two URI patterns that may be adopted I think. It’s not REST, but it is a sensible structure I hope. (This is not to replace voID, but to … Continue reading

Posted in JISC OpenBib | Tagged , , , , , | 2 Comments

Data Triage Notes

I’ve begun to write up my experiences and notes on the triage of the datasets I am processing for the JISC Open Bibliography and Citation projects, in a way that others might make sense of them. You can find the … Continue reading

Posted in JISC OpenBib | Tagged , , , , , | 1 Comment

Disambiguation, deduplication and 'ideals'

(NB Republished from a mailing list conversation at http://lists.okfn.org/pipermail/open-bibliography/2010-August/000397.html – follow this link to see the comments and replies) In my work on meshing bibliographic datasets together, I’ve been using a conceptual tool that I would like to hear views … Continue reading

Posted in JISC OpenBib | Tagged , , , , , | Leave a comment

Bibliographic models in RDF

Put it in RDF to solve all your problems! As with most things in life, the reality is often a little more complex. If you are old enough, you may well remember when this very same cry was often uttered, … Continue reading

Posted in JISC OpenBib | Tagged , , , , , , , | 7 Comments