Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Annotating risk factor mentions in the COVID-19 Open Research Dataset
Institute for Language and Folklore.
Institute for Language and Folklore, Språkrådet.ORCID iD: 0000-0001-6573-4636
Institute for Language and Folklore, Språkrådet.ORCID iD: 0000-0001-6949-6380
Institute for Language and Folklore, Språkrådet.
2020 (English)In: Proceedings of CLARIN Annual Conference 2020 / [ed] Costanza Navarretta and Maria Eskevich, 2020, p. 52-55Conference paper, Oral presentation with published abstract (Refereed)
Abstract [en]

We here describe the creation of manually annotated training data for the Kaggle task “What do we know about COVID-19 risk factors?”. We applied our text mining tool on the “COVID-19 Open Research Dataset” to i) select data for manual annotation, ii) classify the data into initially established classification categories, and iii) analyse our data set in search for potential refinements of the annotation categories. The process resulted in a corpus consisting of 50,000 tokens, for which each token is annotated as to whether it is part of an expression that functions as a “risk factor trigger”. Two types of risk factor triggers were annotated, those indicating that the text describes a risk factor, and those indicating that something could not be shown to be a risk factor.

Place, publisher, year, edition, pages
2020. p. 52-55
National Category
Languages and Literature
Research subject
Language Technology
Identifiers
URN: urn:nbn:se:sprakochfolkminnen:diva-1817OAI: oai:DiVA.org:sprakochfolkminnen-1817DiVA, id: diva2:1511146
Conference
CLARIN Annual Conference
Available from: 2020-12-17 Created: 2020-12-17 Last updated: 2023-12-01Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

CLARIN Annual Conference 2020 PROCEEDINGS

Authority records

Skeppstedt, MariaAhltorp, MagnusEriksson, GunnarDomeij, Rickard

Search in DiVA

By author/editor
Skeppstedt, MariaAhltorp, MagnusEriksson, GunnarDomeij, Rickard
By organisation
Institute for Language and FolkloreSpråkrådet
Languages and Literature

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 118 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf