Author Archives: pm286

TheContentMine is Ready for Business and will make scientific and medical facts available to everyone on a massive scale.

Posted on April 16, 2015 by pm286

It’s a year since I started TheContentMine (contentmine.org) – a project funded by the Shuttleworth Foundation. In ContentMine we intend to extract all the world’s scientific and medical facts from the scholarly literature and make them available to everyone … Continue reading →

Posted in Uncategorized | 4 Comments

32-year old Elsevier paper could have averted Ebola but Liberians would have had to pay to read it

Posted on April 15, 2015 by pm286

I am very angry with the publishing industry. Last week the NY Times reported that the Ministry of Health in Liberia had discovered a 30-year old paper that, if they had known about it, might have alerted Liberians to the … Continue reading →

Posted in Uncategorized | 3 Comments

Content Mining Hackday in Cambridge this Friday 20150123 all welcome

Posted on January 19, 2015 by pm286

We are having a ContentMine hackday – open to all – this Friday in Cambridge https://www.eventbrite.co.uk/e/contentmining-hackday-in-cambridge-facilitated-by-contentmine-tickets-716287435 . We are VERY grateful to Laura James, from our Advisory Board who also set up the Cambridge Makespace where the event will be … Continue reading →

Posted in Uncategorized | 1 Comment

This month's typographical horror: Researchers PAY typesetters to corrupt information

Posted on January 19, 2015 by pm286

One of the “benefits” we get from paying publishers to publish our work is that they “typeset” it. Actually they don’t. They pay typesetters to mutilate it. I don’t know how much they pay but it’s probably > 10 USD … Continue reading →

Posted in Uncategorized | 11 Comments

FORCE2015 ContentMine Workshop/hack – we are going to index the scientific literature and clinical trials…

Posted on January 12, 2015 by pm286

TL;DR We had a great session at FORCE2015 yesterday in Oxford – people liked it, understood it, and are wanting to join us. We ran a pre-conference workshop for 3 hours followed by extra hack. This was open to all … Continue reading →

Posted in Uncategorized | 3 Comments

FORCE2015 Workshop: How ContentMine works for you and what you can bring

Posted on January 8, 2015 by pm286

TL;DR. WE outline the tools and pipeline which ContentMine will show on Sunday at Force2015. They are very general and accessible to everyone…. ContentMine technology and community is maturing quickly. We’ve just had a wonderful three days in Berlin with … Continue reading →

Posted in Uncategorized | 1 Comment

ContentMine Update and FORCE2015; we read and index the daily scholarly literature

Posted on January 7, 2015 by pm286

We’ve been very busy and I haven’t blogged as much as I’d liked. Here’s an update and news about immediate events. Firstly to welcome Graham Steel (McDawg) who is joining us as community manager. Graham is a massive figure in the … Continue reading →

Posted in Uncategorized | 1 Comment

Wiley's "Free to read" actually means "pay 35 USD"

Posted on December 16, 2014 by pm286

I got the above unwanted Twitter from Wiley (I have checked as far as possible that it’s genuine). It seems to be Wiley advertising a free to read article. I have pasted the message so you can try this at … Continue reading →

Posted in Uncategorized | 3 Comments

How publishers destroy science: Elsevier's XML, API and the disappearing chemical bond. DO NOT BUY XML

Posted on December 15, 2014 by pm286

TL;DR Elsevier typsetting turns double bonds into garbage. Those of you who follow this blog will know that I contend that publishers corrupt manuscripts and thereby destroy science. Those of you who follow this blog will know that Elsevier publicly … Continue reading →

Posted in Uncategorized | 6 Comments

Publishers' typesetting destroys science: They are all as bad as each other. Can you spot the error?

Posted on December 13, 2014 by pm286

I’ve just been trying to mine publicly visible scientific publications from scholarly publishers. (That’s right – “publicly visible” – Hargreaves comes later). AND THE TECHNICAL QUALITY IS AWFUL. PUBLISHERS DESTROY SCIENCE THROUGH THEIR TECHNICAL INCOMPETENCE AND INDIFFERENCE. They destroy the … Continue reading →

Posted in Uncategorized | 10 Comments

Author Archives: pm286

TheContentMine is Ready for Business and will make scientific and medical facts available to everyone on a massive scale.

32-year old Elsevier paper could have averted Ebola but Liberians would have had to pay to read it

Content Mining Hackday in Cambridge this Friday 20150123 all welcome

This month's typographical horror: Researchers PAY typesetters to corrupt information

FORCE2015 ContentMine Workshop/hack – we are going to index the scientific literature and clinical trials…

FORCE2015 Workshop: How ContentMine works for you and what you can bring

ContentMine Update and FORCE2015; we read and index the daily scholarly literature

Wiley's "Free to read" actually means "pay 35 USD"

How publishers destroy science: Elsevier's XML, API and the disappearing chemical bond. DO NOT BUY XML

Publishers' typesetting destroys science: They are all as bad as each other. Can you spot the error?

Recent Posts

Recent Comments

Archives

Categories

Meta