Tundra

From WebLichtWiki

(Difference between revisions)
Jump to: navigation, search
(Update TüNDRA treebank list.)
(Using TüNDRA)
Line 1: Line 1:
 
TüNDRA - the ''Tübingen aNnotated Data Retrieval Application'' - is a treebank search application based in part on the popular but no longer supported ''TIGERsearch'' program.
 
TüNDRA - the ''Tübingen aNnotated Data Retrieval Application'' - is a treebank search application based in part on the popular but no longer supported ''TIGERsearch'' program.
 +
 +
== What is TüNDRA ==
 +
 +
TüNDRA (Tübingen Annotated Data Retrieval Application) is a web-based application that allows users to search, visualise and generate statistics from treebanks. A number of treebanks are provided, but users may also use the tool to explore resources developed via Weblicht. It uses a lightweight query language inspired by the widely used TIGERSearch application, offering. offering corpus linguists an interface for using corpora with complex annotation and syntactic links.
  
 
== Using TüNDRA ==
 
== Using TüNDRA ==
Line 5: Line 9:
 
TüNDRA is available via the WebLicht website at [https://weblicht.sfs.uni-tuebingen.de/Tundra https://weblicht.sfs.uni-tuebingen.de/Tundra].  Login is with any valid WebLicht access credentials.
 
TüNDRA is available via the WebLicht website at [https://weblicht.sfs.uni-tuebingen.de/Tundra https://weblicht.sfs.uni-tuebingen.de/Tundra].  Login is with any valid WebLicht access credentials.
  
TüNDRA can be used to search the following treebanks:
+
== Available data ==
 +
 
 +
TüNDRA can be used to explore data created in WebLicht. The application also provides access to the following treebanks:
  
 +
* [http://universaldependencies.org/ Universal Dependencies treebanks for 47 languages]
 
* [http://www.sfs.uni-tuebingen.de/en/ascl/resources/corpora/tueba-dz.html TüBa-D/Z treebank of German]
 
* [http://www.sfs.uni-tuebingen.de/en/ascl/resources/corpora/tueba-dz.html TüBa-D/Z treebank of German]
 
* [http://www.sfs.uni-tuebingen.de/de/ascl/ressourcen/corpora/tueba-ds.html TüBa-D/S treebank of spoken German]
 
* [http://www.sfs.uni-tuebingen.de/de/ascl/ressourcen/corpora/tueba-ds.html TüBa-D/S treebank of spoken German]
Line 13: Line 20:
 
* [http://www.bultreebank.org/ HPSG-based Syntactic Treebank of Bulgarian]
 
* [http://www.bultreebank.org/ HPSG-based Syntactic Treebank of Bulgarian]
 
* [http://itreebank.marginalia.it/ Index Thomisticus Treebank]
 
* [http://itreebank.marginalia.it/ Index Thomisticus Treebank]
 +
* [http://proiel.github.io/ Ancient Greek and Latin parts of the PROIEL treebank]
 +
* [https://www.linguistik.hu-berlin.de/de/institut/professuren/korpuslinguistik/forschung/nosta-d NoSta-D corpus of nonstandard and normalized German]

Revision as of 08:32, 9 January 2017

TüNDRA - the Tübingen aNnotated Data Retrieval Application - is a treebank search application based in part on the popular but no longer supported TIGERsearch program.

What is TüNDRA

TüNDRA (Tübingen Annotated Data Retrieval Application) is a web-based application that allows users to search, visualise and generate statistics from treebanks. A number of treebanks are provided, but users may also use the tool to explore resources developed via Weblicht. It uses a lightweight query language inspired by the widely used TIGERSearch application, offering. offering corpus linguists an interface for using corpora with complex annotation and syntactic links.

Using TüNDRA

TüNDRA is available via the WebLicht website at https://weblicht.sfs.uni-tuebingen.de/Tundra. Login is with any valid WebLicht access credentials.

Available data

TüNDRA can be used to explore data created in WebLicht. The application also provides access to the following treebanks: