Package: tosca Type: Package Title: Tools for Statistical Content Analysis Version: 0.3-4 Authors@R: c(person("Lars", "Koppers", email="koppers@statistik.tu-dortmund.de", role=c("aut", "cre"), comment = c(ORCID = "0000-0002-1642-9616")), person("Jonas", "Rieger", email="jonas.rieger@tu-dortmund.de", role=c("aut"), comment = c(ORCID = "0000-0002-0007-4478")), person("Karin", "Boczek", email="karin.boczek@tu-dortmund.de", role=c("ctb"), comment = c(ORCID = "0000-0003-1516-4094")), person("Gerret", "von Nordheim", email="gerret.vonnordheim@tu-dortmund.de", role=c("ctb"), comment = c(ORCID = "0000-0001-7553-3838"))) Description: A framework for statistical analysis in content analysis. In addition to a pipeline for preprocessing text corpora and linking to the latent Dirichlet allocation from the 'lda' package, plots are offered for the descriptive analysis of text corpora and topic models. In addition, an implementation of Chang's intruder words and intruder topics is provided. Sample data for the vignette is included in the toscaData package, which is available on gitHub: . URL: https://github.com/Docma-TU/tosca, https://doi.org/10.5281/zenodo.3591068 License: GPL (>= 2) Encoding: UTF-8 Depends: R (>= 3.5.0) Imports: tm (>= 0.7-5), lda (>= 1.4.2), quanteda (>= 1.4.0), lubridate (>= 1.7.3), htmltools (>= 0.3.6), RColorBrewer (>= 1.1-2), stringr (>= 1.3.1), WikipediR (>= 1.5.0), data.table (>= 1.11.4) Suggests: toscaData, testthat (>= 2.0.0), knitr (>= 1.20), devtools (>= 1.13), rmarkdown (>= 1.9) RoxygenNote: 7.3.2 VignetteBuilder: knitr Config/pak/sysreqs: libicu-dev libxml2-dev libssl-dev Repository: https://docma-tu.r-universe.dev Date/Publication: 2025-04-22 05:30:18 UTC RemoteUrl: https://github.com/docma-tu/tosca RemoteRef: HEAD RemoteSha: 1b7601ae4b9552926ca4c4234d2efffc924c8966 NeedsCompilation: no Packaged: 2026-06-11 09:34:28 UTC; root Author: Lars Koppers [aut, cre] (ORCID: ), Jonas Rieger [aut] (ORCID: ), Karin Boczek [ctb] (ORCID: ), Gerret von Nordheim [ctb] (ORCID: ) Maintainer: Lars Koppers