Monthly Archives: November 2007

Theses: why do we force graduate students to corrupt crystallographic information?

In our SPECTRa-T project we are exploring how we can extract data and metadata from chemistry theses. Almost all these documents are now born-digital, i.e. written in a wordprocessor such as Word or TeX rather than being typed on carbon … Continue reading

Posted in data, theses | Leave a comment

CrystalEye repository: technical aspects

There has been some confusion recently (post+comments , post+comments) about copying and redistributing CrystalEye. While some of this relates to the legal, moral and ethical issues, there are major technical aspects that need to be understood. Here are some, without … Continue reading

Posted in crystaleye | 1 Comment

Derivative use of Open Access works

A number of people have commented on my concern about the re-use of Open Data and suggested that I have put unreasonable restrictions on it. I show two comments and then refer to Klaus Graf who has, I think, put … Continue reading

Posted in open issues | 6 Comments

Open NMR: what metadata do we want?

One of the reasons that CrystalEye works is that the metadata contributed by the authors (and required by the publishers, through IUCr) is superb. Is there general agreement about what metadata should be captured for NMR spectra or shifts? The … Continue reading

Posted in nmr, open issues | 5 Comments

A novel idea: publishers adding value to OA publications

Here’s a post from a little while ago. TA publishers thinking about how to build on OA repositories 19:19 08/10/2007, Peter Suber, Open Access News: Stephane Goldstein has blogged some notes on the ALPSP conference, Repositories – for better or … Continue reading

Posted in open issues | Leave a comment

COST and CrystalEye: What's the longest B-C bond?

In my talk to COST I demonstrated CrystalEye – Nick Day’s collection of > 100,000 crystal structures. I like giving live demos – at least it keeps me on my toes – and so I loaded the CrystalEye Home page. … Continue reading

Posted in crystaleye | Leave a comment

Open NMR: update and requests for input

The NMR project that Nick Day has been working on for the last month has run its course. We said that it would finish at the end of October so as not to prolong Nick’s writing up. Like all research … Continue reading

Posted in nmr, open issues | 1 Comment

Feature Extraction and Feature authoring

An interesting review: Deepak Singh: The value of feature extraction: Let’s start with a quote from a talk on Ambient Findability For every search on cancer.gov, there are over 100 cancer-related searches on public search engines. Of these searches, 70% … Continue reading

Posted in semanticWeb | Leave a comment

Gordon calls for Open Something

UK Prime Minister endorses public access to public info (Peter Suber, Open Access News) reports: Michael Cross, PM embraces the notion of easier access to government data, The Guardian, November 1, 2007.  Excerpt: The case for allowing free access to … Continue reading

Posted in open issues | Leave a comment

Open Source and Open Data

In the last post I commented on some of the limitations of licences to ensure Open Data. This post  now compares it with Open Source. I am a campaigner for Open Data (see Wikipedia) and on the advisory board for … Continue reading

Posted in open issues | Leave a comment