In the last post I explained why I became interested in contentmining to do scientific research and started to explain how it it is still a major political and legal challenge. I am excited that I have been asked to run a workshop at CopyCamp, and here is the information I am giving to participants. (You may also find my slides useful https://www.slideshare.net/petermurrayrust/contentmining-and-copyright-at-copycamp2017 ).
Workshops on TDM/contentmining cover many areas and the precise format of this one will depend on the participants. On the program notes I suggested:
- hackers (who can make tools such as R, Python, etc.) do exciting things
- scientists (including citizens) which want to explore questions in bioscience
- librarians who want to explore C21st ways of creating knowledge
- open activists who want to change policy both by political means and using tools
- young people. we have had wonderful contributions from a 15-year old
So if everyone wants to talk about European and UK copyright politics, that's fine. But we also have tools and tutorial showing how mining is done and we suggest people get some hands-on. It's probably going to be a good idea to work in small groups where there are complementary skills:
This is a VM and should be independent of the operating system of the host machine. It has been tested in several installations but there may be problems with non-US/UK keyboards and encodings. By default the tutorial is in English (all the resources, EuropePMC, dictionaries are also in English and generally use only ASCII 32-127.