From word clouds to Word Rain: Revisiting the classic word cloud to visualize climate change texts
2024 (English)In: Information Visualization, ISSN 1473-8716, E-ISSN 1473-8724, Vol. 23, no 3, p. 217-238Article in journal (Refereed) Epub ahead of print
Abstract [en]
Word Rain is a development of the classic word cloud. It addresses some of the limitations of word clouds, in particular the lack of a semantically motivated positioning of the words, and the use of font size as a sole indicator of word prominence. Word Rain uses the semantic information encoded in a distributional semantics-based language model – reduced into one dimension – to position the words along the x-axis. Thereby, the horizontal positioning of the words reflects semantic similarity. Font size is still used to signal word prominence, but this signal is supplemented with a bar chart, as well as with the position of the words on the y-axis. We exemplify the use of Word Rain by three concrete visualization tasks, applied on different real-world texts and document collections on climate change. In these case studies, word2vec models, reduced to one dimension with t-SNE, are used to encode semantic similarity, and TF-IDF is used for measuring word prominence. We evaluate the technique further by carrying out domain expert reviews.
Place, publisher, year, edition, pages
Sage Publications, 2024. Vol. 23, no 3, p. 217-238
Keywords [en]
word cloud, tag cloud, text visualization, digital humanities, climage change data, text and document data
Keywords [sv]
ordmoln, taggmoln, textvisualisering, digital humaniora, klimatförändringsdata, text- och dokumentdata
National Category
Natural Language Processing
Research subject
Language Technology
Identifiers
URN: urn:nbn:se:sprakochfolkminnen:diva-2738DOI: 10.1177/14738716241236188OAI: oai:DiVA.org:sprakochfolkminnen-2738DiVA, id: diva2:1847885
2024-03-302024-03-302025-02-07Bibliographically approved