-
Recent Posts
-
Recent Comments
- pm286 on ContentMine at IFLA2017: The future of Libraries and Scholarly Communications
- Hiperterminal on ContentMine at IFLA2017: The future of Libraries and Scholarly Communications
- Next steps for Text & Data Mining | Unlocking Research on Text and Data Mining: Overview
- Publishers prioritize “self-plagiarism” detection over allowing new discoveries | Alex Holcombe's blog on Text and Data Mining: Overview
- Kytriya on Let’s get rid of CC-NC and CC-ND NOW! It really matters
-
Archives
- June 2018
- April 2018
- September 2017
- August 2017
- July 2017
- November 2016
- July 2016
- May 2016
- April 2016
- December 2015
- November 2015
- September 2015
- May 2015
- April 2015
- January 2015
- December 2014
- November 2014
- September 2014
- August 2014
- July 2014
- June 2014
- May 2014
- April 2014
- March 2014
- February 2014
- January 2014
- December 2013
- November 2013
- October 2013
- September 2013
- August 2013
- July 2013
- May 2013
- April 2013
- March 2013
- February 2013
- January 2013
- December 2012
- November 2012
- October 2012
- September 2012
- August 2012
- July 2012
- June 2012
- May 2012
- April 2012
- March 2012
- February 2012
- January 2012
- December 2011
- November 2011
- October 2011
- September 2011
- August 2011
- July 2011
- May 2011
- April 2011
- March 2011
- February 2011
- January 2011
- December 2010
- November 2010
- October 2010
- September 2010
- August 2010
- July 2010
- June 2010
- May 2010
- April 2010
- August 2009
- July 2009
- June 2009
- May 2009
- April 2009
- March 2009
- August 2008
- July 2008
- June 2008
- May 2008
- April 2008
- March 2008
- February 2008
- January 2008
- December 2007
- November 2007
- October 2007
- September 2007
- August 2007
- July 2007
- June 2007
- May 2007
- April 2007
- December 2006
- November 2006
- October 2006
- September 2006
-
Categories
- "virtual communities"
- ahm2007
- berlin5
- blueobelisk
- chemistry
- crystaleye
- cyberscience
- data
- etd2007
- fun
- general
- idcc3
- jisc-theorem
- mkm2007
- nmr
- open issues
- open notebook science
- oscar
- programming for scientists
- publishing
- puzzles
- repositories
- scifoo
- semanticWeb
- theses
- Uncategorized
- www2007
- XML
- xtech2007
-
Meta
Category Archives: Uncategorized
Data-driven scholarship
(I’m afraid I’m going to bang on again about access to data). I’ve been at the CNI and the NSF/JISC meeting for which I wrote a position paper. The meeting was on Digital Repositories and Data-driven Scholarship (Science). My paper … Continue reading
Posted in Uncategorized
2 Comments
my terabytes
I’ve been offline for ca. 2 days – staying in a hotel which dates from the days of Mae West (she stayed there) but where the internet only works in one place if you hide under the bed. The closing … Continue reading
Posted in Uncategorized
Leave a comment
PDFBox and OCR
Ben Litchfield is the(?) current guru of PDFBox and has updated me on PDFBox. (I copy it here as although I think Jim has fixed the Blog (thanks, Jim) I won’t take chances.) Name: Ben Litchfield URI: http://www.pdfbox.org/ | IP: 170.37.224.2 … Continue reading
Posted in Uncategorized
5 Comments
PDFBox and Hamburgers – the story continues
I blogged recently about how I used PDFBox to turn chemical theses (in hamburger PDF) into text. I have now found some interesting (and I think exciting) developments – but I’d like a reality check. I downloaded a thesis from … Continue reading
Posted in Uncategorized
2 Comments
Rise of the Chemical Blogosphere
Another snippet from ChemBark some months ago but highly relevant News Story of 2006: The Rise of the Chemical Blogosphere I have no doubt that the chemical blogosphere is here to stay and adds important new directions. I said this … Continue reading
Posted in Uncategorized
Leave a comment
Chemical Citizen of 2006 (Wikipedia, Blue obelisk, etc.)
I am trying to get my past blog-stuff sorted out – some of my unpublishined snippets may appear in random order. I had selected Chem-Bark’s post: Chemical Citizen of 2006: Wikipedia User “V8rik” where CB lauds the contributions from “V8rik”. … Continue reading
Posted in Uncategorized
Leave a comment
IDID – Idea-Design-Implementation-Dissemination
Software development is hard. Tedious. Frustrating. It usually takes much longer than anyone, including the author, thinks. So what tools and philosophy are useful to the solo – or near-solo – Open Source programmer? Here are some thoughts which you’re … Continue reading
Posted in Uncategorized
Leave a comment
Data Aggregators or the Gift Economy?
The C20th saw the rise and value of scientific data aggregators – organisation who extracted data from the literature, cleaned it, packaged it and offered it for re-use. In some cases they got grants to support this, but most moved … Continue reading
Posted in Uncategorized
Leave a comment
WWMM – The World Wide Molecular Matrix
We have been working on a general, fluid, concept which we labelled “World Wide Molecular Matrix” – starting about 2001. (We actually put in a grant application under that name to the then new UK eScience programme – it didn’t … Continue reading
Posted in Uncategorized
2 Comments
Hamburgers – theses in PDF
Having blogged about the excitement of automatic reading and semantic enhancement of chemical theses I come to the startk reality of PDF. “Turning PDF into XML is like turning a hamburger back into a cow” (anon). So I searched for … Continue reading
Posted in Uncategorized
1 Comment