DCA - Dynamic Corpus Analyzer
The Dynamic Corpus Analyzer (DCA) is a web application for analysis and visualization of files in Text Corpus Format (TCF). This is the processing format of WebLicht, which can be used to lingusitically annotate texts. After a text corpus is uploaded to DCA, a lot of different analysis and visualization can be applied to the corpus. This includes the token level (token, POS, lemma), pares trees (constituent and dependency) as well as statistical analysis and text laws for the corpus as a whole.
DCA uses two different levels of authentication and authorization (for getting an account, please contact Thomas Zastrow):
- With a guest account, it is possible to make use of all the integrated corpora
- For uploading your own corpora, you will need a manager account
Th following documentation explains the individual functions of DCA. In general, you can choose one from the menu in the top. On the left hand, you see some options which are sometimes mandatory to be set and a button for executing the function on the choosen corpus.
After login in to DCA, the following welcome screen welcomes you:
All functionality in DCA is available via the menus in the top. Important: For all functions, it is necessary that at first you choose a corpus from the drop down list on the left before you execute the function. Once you have choosen a corpus, it stays actice until you close the browser:
The "Corpus Übersicht" shows you which corpora are in the system and which linguistic annotations they contain.
If you have the rights, you can here upload new corpora in TCF 0.3 format to DCA.
The editor is a simple solution for editing the token assigned information of a corpus. You can walk throug the tokens of a corpus, edit token, lemma and POS information and save back the edited information to the system.
POS eines Wortes
The Dynamic Corpus Analyzer was developed by Thomas Zastrow.