Author Archives: pm286

TheContentMine is Ready for Business and will make scientific and medical facts available to everyone on a massive scale.

  It’s a year since I started TheContentMine (contentmine.org) – a project funded by the Shuttleworth Foundation. In ContentMine we intend to extract all the world’s scientific and medical facts from the scholarly literature and make them available to everyone … Continue reading

Posted in Uncategorized | 4 Comments

32-year old Elsevier paper could have averted Ebola but Liberians would have had to pay to read it

I am very angry with the publishing industry. Last week the NY Times reported that the Ministry of Health in Liberia had discovered a 30-year old paper that, if they had known about it, might have alerted Liberians to the … Continue reading

Posted in Uncategorized | 3 Comments

Content Mining Hackday in Cambridge this Friday 20150123 all welcome

We are having a ContentMine hackday – open to all – this Friday in Cambridge https://www.eventbrite.co.uk/e/contentmining-hackday-in-cambridge-facilitated-by-contentmine-tickets-716287435 . We are VERY grateful to Laura James, from our Advisory Board who also set up the Cambridge Makespace where the event will be … Continue reading

Posted in Uncategorized | 1 Comment

This month's typographical horror: Researchers PAY typesetters to corrupt information

One of the “benefits” we get from paying publishers to publish our work is that they “typeset” it. Actually they don’t. They pay typesetters to mutilate it. I don’t know how much they pay but it’s probably > 10 USD … Continue reading

Posted in Uncategorized | 11 Comments

FORCE2015 ContentMine Workshop/hack – we are going to index the scientific literature and clinical trials…

TL;DR We had a great session at FORCE2015 yesterday in Oxford – people liked it, understood it, and are wanting to join us. We ran a pre-conference workshop for 3 hours followed by extra hack. This was open to all … Continue reading

Posted in Uncategorized | 3 Comments

FORCE2015 Workshop: How ContentMine works for you and what you can bring

TL;DR. WE outline the tools and pipeline which ContentMine will show on Sunday at Force2015. They are very general and accessible to everyone…. ContentMine technology and community is maturing quickly. We’ve just had a wonderful three days in Berlin with … Continue reading

Posted in Uncategorized | 1 Comment

ContentMine Update and FORCE2015; we read and index the daily scholarly literature

We’ve been very busy and I haven’t blogged as much as I’d liked. Here’s an update and news about immediate events. Firstly to welcome Graham Steel (McDawg) who is joining us as community manager. Graham is a massive figure in the … Continue reading

Posted in Uncategorized | 1 Comment

Wiley's "Free to read" actually means "pay 35 USD"

I got the above unwanted Twitter from Wiley (I have checked as far as possible that it’s genuine). It seems to be Wiley advertising a free to read article. I have pasted the message so you can try this at … Continue reading

Posted in Uncategorized | 3 Comments

How publishers destroy science: Elsevier's XML, API and the disappearing chemical bond. DO NOT BUY XML

TL;DR Elsevier typsetting turns double bonds into garbage. Those of you who follow this blog will know that I contend that publishers corrupt manuscripts and thereby destroy science. Those of you who follow this blog will know that Elsevier publicly … Continue reading

Posted in Uncategorized | 6 Comments

Publishers' typesetting destroys science: They are all as bad as each other. Can you spot the error?

I’ve just been trying to mine publicly visible scientific publications from scholarly publishers. (That’s right – “publicly visible” – Hargreaves comes later). AND THE TECHNICAL QUALITY IS AWFUL. PUBLISHERS DESTROY SCIENCE THROUGH THEIR TECHNICAL INCOMPETENCE AND INDIFFERENCE. They destroy the … Continue reading

Posted in Uncategorized | 10 Comments