Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Snippets of Folk Legends: Adapting a Text Mining Tool to a Collection of Folk Legends
Institute for Language and Folklore.
Institute for Language and Folklore, Språkrådet.
Institute for Language and Folklore, Avdelningen för arkiv och forskning i Göteborg.
2021 (English)In: Post-Proceedings of the 5th Conference Digital Humanities in the Nordic Countries (DHN 2020), 2021Conference paper, Published paper (Refereed)
Abstract [en]

A topic modelling tool was adapted to requirements for a collection of Swedish folk legends. To offer an overview of a list of folk legend texts, which had been automatically extracted by the topic modelling tool, snippet text versions of the folk legends were displayed. The snippets were constructed from the full-text versions of the legends using the sentences most relevant to the topics extracted by the topic modelling algorithm. In addition, collection-adapted data was constructed for performing a pre-processing of the folk legend texts, before they were submitted to the topic modelling algorithm. This data consisted of a collection-adapted stop word list and word lists for improving the quality of clusters of semantically similar words.

Place, publisher, year, edition, pages
2021.
National Category
Language Technology (Computational Linguistics)
Research subject
Language Technology
Identifiers
URN: urn:nbn:se:sprakochfolkminnen:diva-2074OAI: oai:DiVA.org:sprakochfolkminnen-2074DiVA, id: diva2:1604766
Conference
5th Conference Digital Humanities in the Nordic Countries (DHN 2020)
Projects
Nationella språkbanken
Funder
Swedish Research Council, 2017-00626Available from: 2021-10-21 Created: 2021-10-21 Last updated: 2021-12-29Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

http://ceur-ws.org/Vol-2865/poster5.pdf

Authority records

Skeppstedt, MariaDomeij, RickardSkott, Fredrik

Search in DiVA

By author/editor
Skeppstedt, MariaDomeij, RickardSkott, Fredrik
By organisation
Institute for Language and FolkloreSpråkrådetAvdelningen för arkiv och forskning i Göteborg
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 187 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf