Monthly Archives: August 2010

#solo10 An introduction to textmining and data extraction

Scraped/typed into Arcturus But now we'll show what we can get out of patents. Even if you aren't a chemist you should be able to follow this. It'll show you what text-mining is about and how we are looking for … Continue reading

Posted in Uncategorized | Leave a comment

#solo10 GreenChemicalReaction: Anatomy of a Patent

Scraped/typed into Arcturus As we are making progress I thought we'd let you have a look at what we are doing. Because we believe in Openness we're wearing our heart in the Open and telling you as it is created. … Continue reading

Posted in Uncategorized | Leave a comment

#solo10 GreenChemicalReaction; we’ll have something exciting to show at the meeting

Scraped/typed into Arcturus Typical hard slog day, gearing up to do text-mining on the chemical reactions extracted from the patent. Slower than I had thought but steady progress. The routines are all written – it's the glueware that is the … Continue reading

Posted in Uncategorized | Leave a comment

#solo10 GreenChainReaction: We need more volunteers – and yes, it will be fun!

Scraped/typed into Arcturus Continued incremental progress... I know it's the summer vacation but we need your help even more. Now is your chance to Download documents robotically Run semantifiers over them Do natural language processing Extract data from raw text … Continue reading

Posted in Uncategorized | 1 Comment

#solo10 GreenChainReaction: A greenness calculator

Scraped/typed into Arcturus Jean-Claude Bradley has made a useful contribution to the Green Chain Reaction (http://usefulchem.blogspot.com/2010/08/green-solvent-metric-on-solvent.html )... Even if you aren't a chemist, you should be able to follow the logic In the spirit of contributing to Peter Murray-Rust's initiative … Continue reading

Posted in Uncategorized | 1 Comment

#solo10 GreenChainReaction: Can you spot how green the reaction is?

Scraped/typed into Arcturus Mat Todd has made some great suggestions about what we can measure in the Green Chain reaction. Here's his comments on the Etherpad (http://okfnpad.org/solo10 ) and my additions. Please comment and add extra ideas. I'll show some … Continue reading

Posted in Uncategorized | Leave a comment

#solo10 GreenChainReaction: Update and continued request for help

Typed into Arcturus We are making excellent progress. Some things go faster, some slower as always. We now need a second round of volunteers. I'll detail what we have done and what needs to be done this week. Most of … Continue reading

Posted in Uncategorized | Leave a comment

#solo10 GreenChainReaction: Some chemical information is appalling quality, but does anyone care?

Typed into Arcturus Earlier I asked for the compound a patent image represented (EP_2050749A1/0026imgb0032.tif) "Could someone please tell me what the InChI or SMILES or CML is for this compound?" This was a slightly trick question as you have to … Continue reading

Posted in Uncategorized | Leave a comment

#solo10 GreenChainReaction: update and What is that Chemical Compound?

Typed into Arcturus The first pass of the automatic extraction of chemical information from patents is going well on a mechanical level.   One weekly index has 30-200 appropriate patents. Each has between 0 and 1500 images of chemical relevance … Continue reading

Posted in Uncategorized | 5 Comments

Supplementary Data must be published somewhere to validate the science

Dictated into Arcturus There has been quite a lot of discussion over the last days and about the decision by the Journal Of Neuroscience to stop posting supplemental data. This has been reviewed by Heather Piwowar (http://researchremix.wordpress.com/2010/08/13/supplementary-materials-is-a-stopgap-for-data-archiving/ ) with a … Continue reading

Posted in Uncategorized | 2 Comments