Markup of KorpusDK

Document Actions

The texts in KorpusDK have information about all words and texts

Word markup

All texts in KorpusDK have been automatically tagged at word level. This means that every word in the texts is supplied with information about part of speech and inflected form. This information is not necessarily visible in the corpus examples displayed in the concordance, but they are used extensively for querying. You can also choose to see the markup for yourself by selecting one or more of the attributes. This is done from the search result page for a concordance: Click the + button at the left side of the settings panel to unfold the panel for more options and select the attributes you want to display for the concordance.

The tagging was carried out using a Constraint Grammar based tagger, DanPars, developed by Eckhard Bick at the University of Southern Denmark. The tagger is available online at the home page of the VISL project, and here you can also find a query interface for syntactically parsed versions of Korpus 90 and Korpus 2000 as well as documentational pages for the categories and tags used in the markup.

Textual information

Each text is supplied with information about the title, year of publication and text genre and with information about the name, gender and year of birth of the author, provided it is available. If you click on a concordance line you will get a view where you can see a larger linguistic context and the details of the textual information.

Sections

Markup of KorpusDK

Document Actions

Word markup

Textual information