nsyllable::data_syllables_en
Syllable counts of English wordsquanteda::data_char_sampletext
A paragraph of text for testing various text-based functionsquanteda::data_char_ukimmig2010
Immigration-related sections of 2010 UK party manifestosquanteda::data_corpus_inaugural
US presidential inaugural address textsquanteda::data_dfm_lbgexample
dfm from data in Table 1 of Laver, Benoit, and Garry (2003)quanteda::data_dictionary_LSD2015
Lexicoder Sentiment Dictionary (2015)quanteda.textmodels::data_corpus_EPcoaldebate
Crowd-labelled sentence corpus from a 2010 EP debate on coal subsidiesquanteda.textmodels::data_corpus_dailnoconf1991
Confidence debate from 1991 Irish Parliamentquanteda.textmodels::data_corpus_irishbudget2010
Irish budget speeches from 2010quanteda.textmodels::data_corpus_moviereviews
Movie reviews with polarity from Pang and Lee (2004)quanteda.textstats::data_char_wordlists
Word lists for readability statisticsreadtext::data_char_encodedtexts
encoded texts for testingspacyr::data_char_paragraph
A short paragraph of text for testingspacyr::data_char_sentences
Sample short documents for testingstopwords::data_stopwords_ancient
stopword lists for ancient languagesstopwords::data_stopwords_marimo
stopword lists including parts-of-speechstopwords::data_stopwords_misc
miscellaneous stopword listsstopwords::data_stopwords_nltk
stopword lists from the Python NLTK librarystopwords::data_stopwords_perseus
stopword lists for ancient languages - Perseus Digital Librarystopwords::data_stopwords_smart
stopword lists from the SMART systemstopwords::data_stopwords_snowball
snowball stopword liststopwords::data_stopwords_stopwordsiso
multilingual stopwords from https://github.com/stopwords-iso/stopwords-iso