Text+Berg Corpora

Access Modalities

(1) Access via website of the Swiss Alpine Club

Please use the search engine of the SAC website to get access to the yearbooks and the current issues of the journal “Die Alpen”: http://alpen.sac-cas.ch/de/zeitschrift/suche/suche-in-die-alpen/

(2) Access via Deutsches Textarchiv DTA

The text collection platform “Deutsches Textarchiv DTA” offers access to the data in various formats: http://www.deutschestextarchiv.de/news/67

(3) Access to the corpus for linguistic research

The access to the online corpus is intended for research purposes. The online corpus is no book archive. It is not possible to display entire articles but only excerpts from texts. You have however all common corpus linguistics tools at your disposal to investigate the corpus: full text search, Part-of-Speech tags, keyword search, collocations, etc.

  • Access to the corpora (password protected)
  • Please fill in this form to request access to the corpus.
  • Please note: Students of Computation Linguistics at the University of Zurich and participants of the MOOC “Sprachtechnologie in den Digital Humanities” should register at the following address: https://pub.cl.uzh.ch/service/cqpweb/

The current versions of the corpora can be investigated online.

The corpus must be used for scientific purposes only. Commercial use is prohibited. The origin of the data always has to be cited (www.textberg.ch/site). We propose the following citation:

Bubenhofer, Noah / Volk, Martin / Leuenberger, Fabienne / Wüest, Daniel (Hrsg.): Text+Berg-Korpus (Release 151v01). Digitale Edition des Jahrbuch des SAC 1864-1923, Echo des Alpes 1872-1924, Die Alpen, Les Alpes, Le Alpi 1925-2014, The Alpine Journal 1969-2008: Institut für Computerlinguistik, Universität Zürich, 2015.

Release 151 v01:

@MISC{TextBerg_Release_151_v01_2015,
 editor = {Noah Bubenhofer and Martin Volk and Fabienne Leuenberger and Daniel Wüest},
 year = 2015,
 title = {{Text+Berg}-Korpus (Release 151_v01)},
 note = {Digitale Edition des Jahrbuch des SAC 1864-1923, Echo des Alpes 1872-1924, Die Alpen, Les Alpes, Le Alpi 1925-2014, The Alpine Journal 1969-2008},
 howpublished = {XML-Format},
 school = {Institut für Computerlinguistik, Universität Zürich}
}

Release 147 v03:

@MISC{TextBerg_Release_147_v03_2013,
  editor = {Noah Bubenhofer and Martin Volk and David Klaper and Manuela Weibel and Daniel Wüest},
  year = 2013,
  title = {{Text+Berg}-Korpus (Release 147_v03)},
  note = {Digitale Edition des Jahrbuch des SAC 1864-1923, Echo des Alpes 1872-1924 und Die Alpen 1925-2011},
  howpublished = {XML-Format},
  school = {Institut für Computerlinguistik, Universität Zürich}
}

Published Releases of the Corpora

  • Text+Berg-Korpus, Release 151, v01, 11. April 2015: SAC (Jahrbuch des SAC, Alpen) – 1864-2014, Echo des Alpes – 1872-1924, Alpine Journal – 1969-2008; Release NotesChanges