Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Handwritten Text Recognition and Linguistic Research
Institute for Language and Folklore, Department of Dialectology, Onomastics and Folklore Research, Gothenburg.
2020 (English)In: Proceedings of the Digital Humanities in the Nordic Countries 5th Conference (DHN 2020), Riga, Latvia, October 21-23, 2020. / [ed] Sanita Reinsone, Inguna Skadiņa, Anda Baklāne, Jānis Daugavietis, Riga, 2020, Vol. 2612, p. 302-309Conference paper, Published paper (Refereed)
Abstract [en]

This paper presents ongoing work with automatic transcription of handwritten, phonetically precise dialect texts from the south-west of Sweden (collected in the 1890s). Using a SAMPA-based transcription key (where SAMPA stands for Speech Assessment Methods Alphabet), I have enabled the training of an HTR engine (where HTR stands for Handwritten text recognition), by feeding it manual transcriptions, to automatically transcribe two separate (but similar) phonetic hands. The phonetically detailed output reveals structural properties of the dialect that are hard (at best) or impossible (at worst) to retrieve from other sources. In this paper, I show how my research on enclitic pronouns in North Germanic has benefitted from the possibility to search for prosodic dependencies that the digital versions of the dialect texts provide.

Place, publisher, year, edition, pages
Riga, 2020. Vol. 2612, p. 302-309
Series
CEUR Workshop proceedings, ISSN 1613-0073 ; 2612
Keywords [en]
Handwritten Text Recognition, Dialect Texts, Swedish, International Phonetic Alphabet, Speech Assessment Methods Alphabet, Digitization, Enclitic Pronouns
National Category
General Language Studies and Linguistics
Research subject
Language History; Language Technology; Dialectology
Identifiers
URN: urn:nbn:se:sprakochfolkminnen:diva-1788OAI: oai:DiVA.org:sprakochfolkminnen-1788DiVA, id: diva2:1434067
Conference
Digital Humanities in the Nordic Countries (DHN) 5
Projects
TilltalNationella språkbankenAvailable from: 2020-06-02 Created: 2020-06-02 Last updated: 2020-12-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

http://ceur-ws.org/Vol-2612/

Authority records

Magnusson Petzell, Erik

Search in DiVA

By author/editor
Magnusson Petzell, Erik
By organisation
Department of Dialectology, Onomastics and Folklore Research, Gothenburg
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 256 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf