DCA - Dynamic Corpus Analyzer

From WebLichtWiki

(Difference between revisions)
Jump to: navigation, search
(Introduction)
(Introduction)
Line 11: Line 11:
  
  
[[File:sc_dca_1.png]]
+
[[File:sc_dca_1.png|right|The Welcome Screen]]
  
 
==Start==
 
==Start==

Revision as of 10:58, 20 March 2012

Giraffe.png

Contents

Introduction

The Dynamic Corpus Analyzer (DCA) is a web application for analysis and visualization of files in Text Corpus Format (TCF). This is the processing format of WebLicht, which can be used to lingusitically annotate texts. After a text corpus is uploaded to DCA, a lot of different analysis and visualization can be applied to the corpus. This includes the token level (token, POS, lemma), pares trees (constituent and dependency) as well as statistical analysis and text laws for the corpus as a whole.

DCA uses two different levels of authentication and authorization (for getting an account, please contact Thomas Zastrow):

  • With a guest account, it is possible to make use of all the integrated corpora
  • For uploading your own corpora, you will need a manager account


The Welcome Screen

Start

Corpus Übersicht

Sc dca 2.png

Management

Sc dca 3.png

Editor

Sc dca 4.png

Impressum

Anzeige

Text anzeigen

Sc dca 5.png

Sätze anzeigen

Sc dca 6.png

Konkordanzen

Sc dca 7.png

Frequenzen visualisieren

Sc dca 8.png

Wordline

Sc dca 9.png

Wortkreise

Sc dca 10.png

Statistik

Allgemeine Kennzahlen

Sc dca 11.png

Übergangswahrscheinlichkeiten

Sc dca 12.png

POS Statistik

Sc dca 13.png

Lemma Statistik

Sc dca 14.png

Suche

Wort

Sc dca 15.png

Wortpaar

Sc dca 16.png

POS eines Wortes

Sc dca 17.png

Phrase suchen

Sc dca 18.png

Syntax

Grammatik erstellen

Sc dca 19.png

Konstrukt suchen

Sc dca 20.png

Textgesetze

Länge-Frequenz

Sc dca 21.png

Type-Token Relationen

Sc dca 22.png

Parsetrees

XPath anwenden

Sc dca 23.png

Semantik

Semantik Übersicht

Sc dca 24.png

Impressum

The Dynamic Corpus Analyzer was developed by Thomas Zastrow.