Scraped/typed into Arcturus
Continued incremental progress…
I know it’s the summer vacation but we need your help
even more. Now is your chance to
- Download documents robotically
- Run semantifiers over them
- Do natural language processing
- Extract data from raw text
- Create RDF and search it
These are reall transferable information skills that will be highly valued. Volunteers will be guided through this by experts (well we’ve been going a week so that makes us experts!)
All you have to do is volunteer. It will be fun. After all it is the holidays.
So where are we at?
- The patent downloader has been compiled, distributed and works. Start trying it out now
- We’ve downloaded and parsed patents back to 2000 so if they all work we’ll have gazillions of experiments
- We’re finalising the chemistry extraction over the next few hours.
Lezan Hawizy who wrote the chemicalTagger is now back in action. She and David Jessop are presenting their work (code, results) at the American Chemical Soc. In Boston. Be there!
What’s more to do?
- Build an aggregator for the green data. Probably solvents at first.
- Attach greenness to each solvent
- Develop data presentation. Sam Adams (who works with us and won the Dev8D memento challenge and was runner up in OR10’s DevSci) has shown me an idea that looks like being sensational. But you will have to come to the session at #solo10 to see its premiere! (Unless you are actually helping with this project, which of course you now will be)
- Legitimise other sources of data. Heather Piwowar (@researchremix) and I are setting out our principles for accessing and producing Open Data – we’ll be asking for responses through IsItOpen.
Hi,
sounds exiting. How can I become a volunteer?
Cheers,
Diego