This week we focus on a whole new type of data: text. Using text mining methods, we are able to analyze text data, obtained for example from twitter, doctor’s notes or websites. We will touch upon the basics of this fascinating world, looking at getting your data in the correct format, frequency analysis (what can the number of times a word appears within a document tell you about the document), sentiment analysis (how to analyze the emotion of a text), and visualization.
Complete and hand in questions 1 - 5 of the lab (at least 2 hours before the start of the lab), finishing the section “Part 1: to be completed at home before the lab”.