DCA - Dynamic Corpus Analyzer
From WebLichtWiki
Revision as of 10:59, 20 March 2012 by Thomas Zastrow (Talk | contribs)
Contents |
Introduction
The Dynamic Corpus Analyzer (DCA) is a web application for analysis and visualization of files in Text Corpus Format (TCF). This is the processing format of WebLicht, which can be used to lingusitically annotate texts. After a text corpus is uploaded to DCA, a lot of different analysis and visualization can be applied to the corpus. This includes the token level (token, POS, lemma), pares trees (constituent and dependency) as well as statistical analysis and text laws for the corpus as a whole.
DCA uses two different levels of authentication and authorization (for getting an account, please contact Thomas Zastrow):
- With a guest account, it is possible to make use of all the integrated corpora
- For uploading your own corpora, you will need a manager account
File:Sc dca 1.png The Welcome Screen
Start
Corpus Übersicht
Management
Editor
Impressum
Anzeige
Text anzeigen
Sätze anzeigen
Konkordanzen
Frequenzen visualisieren
Wordline
Wortkreise
Statistik
Allgemeine Kennzahlen
Übergangswahrscheinlichkeiten
POS Statistik
Lemma Statistik
Suche
Wort
Wortpaar
POS eines Wortes
Phrase suchen
Syntax
Grammatik erstellen
Konstrukt suchen
Textgesetze
Länge-Frequenz
Type-Token Relationen
Parsetrees
XPath anwenden
Semantik
Semantik Übersicht
Impressum
The Dynamic Corpus Analyzer was developed by Thomas Zastrow.