#jiscxyz #okfn #quixotechem
I’m off to #JISCMRD (Managing Research Data) to hear about the new round of projects including our own JISCXYZ. Ours concentrates on the publication of data and we are working with publishers to save and validate data at early stages in the publication process.
Meanwhile here’s an indication of how to destroy data (supplemental data):
That’s the commonest method. And here’s another (http://www.rsc.org/suppdata/OB/b2/b209981k/geometry.pdf ). This file could have released useful data to the world. In fact it destroyed it by putting it into PDF. The file should have looked like:
#P B3LYP/AUG-CC-PVDZ OPT=TIGHT GEOM=CHECK GUESS=READINT=ULTRAFINE\\Be
D001 with INT=ULTRAFINE\,2\C,0.1063168353,0.3005635652,-0.5502851935
Notice the precise formatting. This is REQUIRED to read the file in. Instead the author or the publisher (neither of whom apparently care) tipped it into PDF which introduced spurious line ends. It’s UNREADABLE by a machine. Follow the link and Read the file and see what I mean .
It’s beautiful and garbage. A sickly hamburger.
That’s because almost all publishers don’t care about data. Which means that many of their publications are second-rate. Many are suspect scientifically because the data aren’t published.