Chapter 7 Advanced manipulations of corpora
In this chapter, we will continue using R
for the qualitative analyses of textual corpora.
After importing textual datasets, we will look at how to manipulate these datasets. We will introduce some new notions, including kwics (key-words in context), compound words, n-grams, dictionaries, part-of-speech tagging, and feature co-occurrence matrices (FCM). We will also look at how to visualise these datasets.