This is an old revision of the document!
Table of Contents
A series of textbooks on Undegraduate Topics in Computer Science
Data scientists grasp it as text mining, computer scientists call it natural language processing, in traditional academy it echoes the methods of corpus linguistics, while digital humanists desribe it as distant reading.
TEI by example: Tutorial into TEI.
AntConc: A freeware corpus analysis toolkit for concordancing and text analysis.
SIGHUM: Association of Computational lignustics Special Interest Group on Language Technologies for Socio-Economic Sciences and Humanities
DigiLing:Free online courses on computational linguistics
Voyant Tools: a web tool for analysis and visualization of digital texts
Hermeneutica is a collaborative project by Stéfan Sinclair & Geoffrey Rockwell to think through computer-assisted text analysis for humanists. This site is a compantion to the book Hermeneutica book (MIT, 2016) and Voyant Tools. This site includes online versions of the practical interlude chapters from the book with dynamic panels from Voyant.
Gensim: topic modeling for humans
Text network analysis
- GitHub with codes: https://github.com/SeNeReKo
- in python 2.7
- based on Natural language toolkit (http://www.nltk.org/): contains corpuses and vocabularies.
Morphodita for Python: lemmatizer of Czech, created by Milan Straka and Jana Straková from the Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, CUNI, Czech Republic.
Python wrapper for Majka: Majka is a linguistics tool for morphology analysis bulit by NLP center at MUNI.
bla bla bla - so many things to put here.
Python Web Applications - a tutorial on how to build them