german
german,
a dataset directory which
contains some short German texts, for use in automatic
language tests.
Licensing:
The computer code and data files described and made available on this web page
are distributed under
the GNU LGPL license.
Related Data and Programs:
TEXT,
a dataset directory which
contains some short texts in English;
WORDS,
a dataset directory which
contains lists of words;
Datasets:
-
die_bremer_stadtmusikanten.txt,
the text of "Die Bremer Stadtmusikanten",
by the Brothers Grimm;
-
die_kuechenuhr.txt,
the text of "Die Kuechenuhr",
by Wolfgang Borchert;
-
die_verwandlung.txt,
the text of "Die Verwandlung",
by Franz Kafka. The text contains umlauts, double-s, and guillemots;
-
die_verwandlung_normalizedd.txt,
the text of "Die Verwandlung",
by Franz Kafka. The special German characters have been replaced,
using "umlaut_remover.py".
-
genesis.txt,
the first five books of Genesis;
-
massnahmen_gegen_die_gewalt.txt,
the text of "Massnahmen gegen die Gewalt",
by Bertolt Brecht;
-
vor_dem_gesetz.txt,
the text of "Vor dem Gesetz",
by Franz Kafka;
Last revised on 29 August 2021.