Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
From word clouds to Word Rain: Revisiting the classic word cloud to visualize climate change texts
Centre for Digital Humanities and Social Sciences Uppsala, Department of ALM, Uppsala University, Uppsala, Sweden.ORCID iD: 0000-0001-6164-7762
Institute for Language and Folklore, Språkrådet. Language Council of Sweden, Institute for Language and Folklore, Stockholm, Sweden.ORCID iD: 0000-0001-6573-4636
Department of Science and Technology, Linköping University, Norrköping, Sweden.ORCID iD: 0000-0002-1907-7820
Centre for Digital Humanities and Social Sciences Uppsala, Department of ALM, Uppsala University, Uppsala, Sweden.
2024 (English)In: Information Visualization, ISSN 1473-8716, E-ISSN 1473-8724, Vol. 23, no 3, p. 217-238Article in journal (Refereed) Epub ahead of print
Abstract [en]

Word Rain is a development of the classic word cloud. It addresses some of the limitations of word clouds, in particular the lack of a semantically motivated positioning of the words, and the use of font size as a sole indicator of word prominence. Word Rain uses the semantic information encoded in a distributional semantics-based language model – reduced into one dimension – to position the words along the x-axis. Thereby, the horizontal positioning of the words reflects semantic similarity. Font size is still used to signal word prominence, but this signal is supplemented with a bar chart, as well as with the position of the words on the y-axis. We exemplify the use of Word Rain by three concrete visualization tasks, applied on different real-world texts and document collections on climate change. In these case studies, word2vec models, reduced to one dimension with t-SNE, are used to encode semantic similarity, and TF-IDF is used for measuring word prominence. We evaluate the technique further by carrying out domain expert reviews.

Place, publisher, year, edition, pages
Sage Publications, 2024. Vol. 23, no 3, p. 217-238
Keywords [en]
word cloud, tag cloud, text visualization, digital humanities, climage change data, text and document data
Keywords [sv]
ordmoln, taggmoln, textvisualisering, digital humaniora, klimatförändringsdata, text- och dokumentdata
National Category
Natural Language Processing
Research subject
Language Technology
Identifiers
URN: urn:nbn:se:sprakochfolkminnen:diva-2738DOI: 10.1177/14738716241236188OAI: oai:DiVA.org:sprakochfolkminnen-2738DiVA, id: diva2:1847885
Available from: 2024-03-30 Created: 2024-03-30 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

skeppstedt-et-al-2024-from-word-clouds-to-word-rain-revisiting-the-classic-word-cloud-to-visualize-climate-change-texts.pdf(5784 kB)10 downloads
File information
File name FULLTEXT01.pdfFile size 5784 kBChecksum SHA-512
6dfcecfce024d7517a10a9cc1adb6a66a7a1bace9e1ab0b0633f2862222bdf928fd6f14550b145ab87d0f41f38af0c2e90c2856a233da96b14bbab72ee3730dc
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Ahltorp, Magnus

Search in DiVA

By author/editor
Skeppstedt, MariaAhltorp, MagnusKucher, Kostiantyn
By organisation
Språkrådet
In the same journal
Information Visualization
Natural Language Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 10 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 470 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf