1.
Introduction
Install packages
R commands
2.
Data Import
Pre-formatted files
Multiple text files
Different encodings
3.
Basic Operations
Workflow
Corpus
Construct a corpus
Subset corpus
Change units of texts
Extract tags from texts
Tokens
Construct a tokens object
Keyword-in-contexts
Select tokens
Compound tokens
Look up dictionary
Genarate n-grams
Document-feature matrix
Construct a DFM
Select features
Look up dictionary
Group documents
Feature co-occurence matrix
Construct a FCM
4.
Statistical Analysis
Simple frequency analysis
Lexical diversity
Document/feature similarity
Relative frequency analysis (keyness)
Collocation analysis
5.
Advanced Operations
Similarity between Twitter users
Compound multi-word expressions
Targeted dictionary analysis
Target-word collocations
6.
Scaling and Classification
Naive Bayes classifier
Wordscores
Wordfish
Correspondence analysis
Topic models
Newsmap
7.
Different Languages
Japanese
More
Github repo
Website
Clear History
Edit this page
quanteda tutorials
>
Basic Operations
> Document-feature matrix
Document-feature matrix
Learn how to construct and modify a document-feature matrix