{"id":1236,"date":"2021-07-09T15:42:29","date_gmt":"2021-07-09T15:42:29","guid":{"rendered":"http:\/\/thinking.is.ed.ac.uk\/wir\/?p=1236"},"modified":"2021-07-12T13:58:18","modified_gmt":"2021-07-12T13:58:18","slug":"wikimedia-and-the-diversity-of-languages-online","status":"publish","type":"post","link":"https:\/\/thinking.is.ed.ac.uk\/wir\/wikimedia-and-the-diversity-of-languages-online\/","title":{"rendered":"Wikimedia and the Diversity of Languages online &#8211; Guest post by Clea Strathmann"},"content":{"rendered":"<p><span style=\"font-weight: 400\">Globally,<\/span><a href=\"https:\/\/www.washingtonpost.com\/news\/worldviews\/wp\/2015\/04\/23\/the-worlds-languages-in-7-maps-and-charts\/\"><span style=\"font-weight: 400\"> over 7,000 languages are spoken<\/span><\/a><span style=\"font-weight: 400\"> &#8211; only around 4% of people are native English speakers. Despite this, English holds the title of the \u201c<a href=\"https:\/\/www.statista.com\/statistics\/262946\/share-of-the-most-common-languages-on-the-internet\/\">Language of the internet<\/a>\u201d.&nbsp; It dominates with Chinese<a href=\"https:\/\/medium.com\/digital-diplomacy\/the-internet-is-multilingual-but-you-need-to-learn-mandarin-558c15ab5158\"> almost 50% of global web traffic<\/a> with the top ten languages accounting for <a href=\"https:\/\/www.statista.com\/statistics\/262946\/share-of-the-most-common-languages-on-the-internet\/\">76.9 percent of global internet users<\/a>. The majority of <\/span><a href=\"http:\/\/labs.theguardian.com\/digital-language-divide\/\"><span style=\"font-weight: 400\">African and Indigenous languages <\/span><\/a><span style=\"font-weight: 400\">are not recognised by Google&#8217;s search engine.&nbsp;<\/span><\/p>\n<p><span style=\"font-weight: 400\">When an English speaker searches for something on Google, a Wikipedia article typically appears as a top hit, often as a convenient infobox at the side of the browser. This is because English Wikipedia<\/span><a href=\"https:\/\/en.wikipedia.org\/wiki\/List_of_Wikipedias\"><span style=\"font-weight: 400\"> has over 6 million articles<\/span><\/a><span style=\"font-weight: 400\">. Wikipedias in other languages are more limited &#8211; only two other Wikipedias (Cebuano and Swedish) have over 3 million articles, and the 20 largest Wikipedias have around 1 million entries each. Many of these articles are comparatively shorter than those in English Wikipedia.&nbsp;<\/span><\/p>\n<div id=\"attachment_1256\" style=\"width: 1290px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/langs-2.png\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-1256\" class=\"wp-image-1256 size-full\" src=\"http:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/langs-2.png\" alt=\"\" width=\"1280\" height=\"720\" srcset=\"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/langs-2.png 1280w, https:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/langs-2-300x169.png 300w, https:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/langs-2-1024x576.png 1024w, https:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/langs-2-768x432.png 768w, https:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/langs-2-900x506.png 900w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/a><p id=\"caption-attachment-1256\" class=\"wp-caption-text\"><a href=\"https:\/\/meta.wikimedia.org\/wiki\/List_of_Wikipedias_by_language_group\"> Percentage of Wikipedia articles in each language group<\/a> &#8211; Western European language groups dominate Wikipedia.<\/p><\/div>\n<p><span style=\"font-weight: 400\">This lack of diversity restricts a significant portion of the world from access to knowledge that is readily-available to English speakers, and <\/span><a href=\"https:\/\/hbr.org\/2013\/11\/countries-with-better-english-have-better-economies\"><span style=\"font-weight: 400\">disproportionately affects those who live in less-developed regions<\/span><\/a><span style=\"font-weight: 400\"> who may not speak any of the internet\u2019s other dominant languages. Access to knowledge is vital for bridging the understanding between languages and cultures.&nbsp;<\/span><\/p>\n<blockquote>\n<p><span style=\"font-weight: 400\">Knowledge creates understanding \u2013 understanding is sorely lacking in today\u2019s world. \u2013 Katherine Maher, <\/span><span style=\"font-weight: 400\">Executive Director Wikimedia Foundation.&nbsp;<\/span><\/p>\n<\/blockquote>\n<p><span style=\"font-weight: 400\">The United Nations has, as part of their <a href=\"https:\/\/sdgs.un.org\/goals\">sustainable development goals<\/a>, emphasised a need for equitable education and lifelong learning. To enable this, resources of knowledge must be available in all languages. But alongside access to knowledge, the lack of linguistic diversity <\/span><span style=\"font-weight: 400\">is a pressing issue for smaller languages, including indigenous languages <\/span><a href=\"https:\/\/www.un.org\/development\/desa\/en\/news\/social\/indigenous-languages.html\"><span style=\"font-weight: 400\">which are dying out at a rate of two languages per month<\/span><\/a><span style=\"font-weight: 400\">. For speakers of these languages, their extinction may also reflect the extinction of their culture and identity.&nbsp;<\/span><\/p>\n<p><iframe loading=\"lazy\" title=\"Arctic Knot, day 2\" width=\"629\" height=\"354\" src=\"https:\/\/www.youtube.com\/embed\/xVoTUaf8HPc?start=3025&#038;feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<p><em>Watch Dr. Sara Thomas speak about Scots Wikipedia at the Arctic Knot.<\/em><\/p>\n<p><b>The role of Wikimedia in improving linguistic diversity&nbsp;<\/b><\/p>\n<p><span style=\"font-weight: 400\">Wikipedia is attempting to increase global access to knowledge, and it is one of <\/span><a href=\"https:\/\/wikimediafoundation.org\/news\/2018\/08\/30\/diversity-inclusion-numbers\/#:~:text=As%20we%20look%20ahead%2C%20we,for%20newcomers%20from%20diverse%20backgrounds.\"><span style=\"font-weight: 400\">the aims of The Wikimedia Foundation<\/span><\/a><span style=\"font-weight: 400\"> to ensure that knowledge is diverse, inclusive, and accessible to all. When considering linguistic diversity, the aim is for the number of Wikipedia articles to be evenly distributed across languages. Theoretically, this could be done by simply translating articles from one language Wikipedia into another.&nbsp;<\/span><\/p>\n<p><span style=\"font-weight: 400\">However, translating Wikipedia would not be enough to create linguistic diversity. Take the Game of Thrones article on Welsh \u201cWicipedia\u201d, for instance, which highlights <\/span><a href=\"https:\/\/slate.com\/technology\/2019\/08\/welsh-wikipedia-google-translate.html\"><span style=\"font-weight: 400\">the similarity of the fictional languages in the series<\/span><\/a><span style=\"font-weight: 400\"> to Welsh and emphasises its Welsh actors. This demonstrates the impact of culture on what is important, or not, to the readers of Wikipedia. The relationship between the language and culture is heavily-entangled, and makes it even more important that these are represented and preserved online.&nbsp;<\/span><\/p>\n<p><iframe loading=\"lazy\" title=\"Arctic Knot, day 1\" width=\"629\" height=\"354\" src=\"https:\/\/www.youtube.com\/embed\/f13c3lCghtE?start=3188&#038;feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<p style=\"text-align: center\"><em>Watch the o<span style=\"font-size: inherit\">pening speeches by&nbsp;<\/span><a class=\"extiw\" style=\"font-size: inherit\" title=\"w:en:Aili Keskitalo\" href=\"https:\/\/en.wikipedia.org\/wiki\/en:Aili_Keskitalo\">Aili Keskitalo<\/a><span style=\"font-size: inherit\">, President of the S\u00e1mi Parliament of Norway and <\/span><a class=\"extiw\" style=\"font-size: inherit\" title=\"en:Guri Melby\" href=\"https:\/\/en.wikipedia.org\/wiki\/Guri_Melby\">Guri Melby<\/a><span style=\"font-size: inherit\">, Minister of Education Norway at the Arctic Knot 2021<\/span><\/em><\/p>\n<p><span style=\"font-weight: 400\">One of the best ways that we can support linguistic diversity is through collaborative efforts with Wikipedia projects. In 2017, The University of Edinburgh and Wikimedia UK started the <\/span><a href=\"https:\/\/wikimedia.org.uk\/wiki\/Celtic_Knot_Conference_2017\"><span style=\"font-weight: 400\">\u2018Celtic Knot\u2019 Wikipedia Language conference<\/span><\/a><span style=\"font-weight: 400\">, which aims to bring together smaller language communities to collaborate on ideas for how to improve the Wikipedia content in these languages and to increase their linguistic presence across other language Wikipedias. The Celtic Knot also developed into the <\/span><a href=\"https:\/\/meta.wikimedia.org\/wiki\/Arctic_Knot_Conference_2021\"><span style=\"font-weight: 400\">Arctic Knot conference,<\/span><\/a><span style=\"font-weight: 400\"> hosted by Wikimedia Norway this year, which aims to improve the visibility of indigenous arctic languages. These conferences allow speakers to address the importance of engaging with their language, and provide practical resources for encouraging contributions to Wikipedia. The <\/span><a href=\"https:\/\/rising.globalvoices.org\/languagetoolkit\/resources\/\"><span style=\"font-weight: 400\">Toolkit for language activism,<\/span><\/a><span style=\"font-weight: 400\"> for instance, supports the creation of digital skills and written language skills which can help people who speak minority languages to contribute to Wikipedia. Through such projects, people are encouraged to contribute to Wikipedia to improve both representation and usability of languages.&nbsp;<\/span><\/p>\n<p><b>Using Wikidata to build linguistic diversity online&nbsp;<\/b><\/p>\n<p><span style=\"font-weight: 400\">From the collaborative efforts of dedicated Wikimedians, communities are already seeing successes in increasing the presence of their languages. But for smaller languages, including many indigenous languages, writing entire Wikipedia articles is challenging and time-consuming. This is where Wikipedia\u2019s sister project &#8211; <\/span><a href=\"https:\/\/www.wikidata.org\/wiki\/Wikidata:Main_Page\"><span style=\"font-weight: 400\">Wikidata <\/span><\/a><span style=\"font-weight: 400\">&#8211; has proven to be an important contributor to improving language diversity online.<\/span><\/p>\n<p><a href=\"http:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/query.png\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-1239\" src=\"http:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/query.png\" alt=\"\" width=\"1280\" height=\"720\" srcset=\"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/query.png 1280w, https:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/query-300x169.png 300w, https:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/query-1024x576.png 1024w, https:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/query-768x432.png 768w, https:\/\/thinking.is.ed.ac.uk\/wir\/wp-content\/uploads\/sites\/14\/2021\/07\/query-900x506.png 900w\" sizes=\"auto, (max-width: 1280px) 100vw, 1280px\" \/><\/a><\/p>\n<p><i><span style=\"font-weight: 400\">This chart, <\/span><\/i><a href=\"https:\/\/w.wiki\/3cVK\"><i><span style=\"font-weight: 400\">made using Wikidata<\/span><\/i><\/a><i><span style=\"font-weight: 400\">, shows the amount of Wikipedia articles about Greek citizens that are available on English Wikipedia but not on Greek Wikipedia. The majority are sports players, but it also includes a number of artists and academics.<\/span><\/i><\/p>\n<p><\/p>\n<p><span style=\"font-weight: 400\"><a href=\"https:\/\/www.wikidata.org\/wiki\/Wikidata:Main_Page\">Wikidata<\/a> is a free and open knowledge base of machine-readable facts. Each data item has&nbsp; a unique identifier (a \u2018Q\u2019 number). The label, description and all of the statements within each data item can be labelled in <\/span><b><i>any<\/i><\/b><span style=\"font-weight: 400\"> language and, because of this, the data can be instantly transformed into any language. This means that any search can make this knowledge both discoverable and understandable in any language. Items from Wikidata are important for modern technologies such as <\/span><a href=\"https:\/\/www.wired.com\/story\/inside-the-alexa-friendly-world-of-wikidata\/\"><span style=\"font-weight: 400\">Amazon\u2019s Alexa<\/span><\/a><span style=\"font-weight: 400\"> and Siri, which use Wikidata\u2019s machine-readable entries to answer questions &#8211; but, importantly, these can only provide responses in the languages it is labelled in, and the number of Wikidata language labels, beyond European languages, is scarce.<\/span><\/p>\n<p><span style=\"font-weight: 400\">As an example, take disease and health data, which constitute vital information that needs to be easily-accessible. A search of <\/span><a href=\"https:\/\/w.wiki\/3YJn\"><span style=\"font-weight: 400\">diseases uploaded to Wikidata<\/span><\/a><span style=\"font-weight: 400\"> reveals over 13,000 diseases have been uploaded to the database, but around 5,000 of these entries are only labelled in 1 language. So whilst Wikidata is a useful tool to<\/span> <span style=\"font-weight: 400\">aid knowledge discovery,<\/span> <span style=\"font-weight: 400\">it will take the work of native language speakers from around the world to develop it into the linguistically diverse database that it has the potential to become. In growing both the number of items in Wikidata, and its language labels, technologies can become more accessible for different languages. Ultimately, this is crucial in enabling smaller languages to thrive, rather than just to survive.&nbsp;<\/span><\/p>\n<p><b>What can we do to promote linguistic diversity?<\/b><\/p>\n<p><span style=\"font-weight: 400\">Governments have highlighted the importance of actively increasing linguistic diversity. UNESCO has produced a 10-year plan for the preservation of indigenous languages, referred to as the <\/span><a href=\"https:\/\/en.unesco.org\/news\/upcoming-decade-indigenous-languages-2022-2032-focus-indigenous-language-users-human-rights\"><span style=\"font-weight: 400\">Decade of Indigenous Languages<\/span><\/a><span style=\"font-weight: 400\">, which calls into action the human rights of Indigenous Peoples. A key part of the plan surrounds the use of technology to support access to Indigenous languages &#8211; this can involve the use of Wikipedia and Wikidata as impactful open platforms for building global understanding about different languages and, alongside this, different cultures. Encouraging people to contribute to Wikipedia may seem difficult, but events including the Celtic and Arctic Knot conferences, and outreach projects such as<\/span><a href=\"https:\/\/en.wikipedia.org\/wiki\/Wikipedia:Meetup\/IndigenizeWikipedia#Run_Your_Own_Indigenize_Wikipedia_Event\"><span style=\"font-weight: 400\"> Indigenizing Wikipedia<\/span><\/a><span style=\"font-weight: 400\">, have demonstrated how successfully Wikipedia can be used as a platform for language activism.&nbsp;<\/span><\/p>\n<p><span style=\"font-weight: 400\">By contributing to both Wikipedia and Wikidata, we can increase the use and representation of smaller languages, contributing to the preservation of the important cultures that are intertwined with them.&nbsp;<\/span><\/p>\n<p><strong>Clea Strathmann, Open Data and Knowledge Equity intern<\/strong><\/p>\n<p><a href=\"https:\/\/meta.wikimedia.org\/wiki\/Arctic_Knot_Conference_2021\/Program\">Watch the whole Arctic Knot conference on YouTube here.<\/a><\/p>\n\n\n<p><a href=\"https:\/\/media.ed.ac.uk\/playlist\/dedicated\/51020161\/1_rneev7bo\/1_k7ikooca\" data-type=\"URL\" data-id=\"https:\/\/media.ed.ac.uk\/playlist\/dedicated\/51020161\/1_rneev7bo\/1_k7ikooca\">Welsh Wikipedia Thinking Big &#8211; Keynote address by Jason Evans at the Celtic Knot<\/a><\/p>\n\n\n\n<p><a href=\"https:\/\/media.ed.ac.uk\/playlist\/dedicated\/51020161\/1_rneev7bo\/1_k1k6ir4i\">A state of the question &#8211; the Catalan language project &#8211; \u00c0lex Hinojo, Executive Director, Amical Wikimedia<\/a><\/p>\n\n\n\n<p><a href=\"https:\/\/media.ed.ac.uk\/playlist\/dedicated\/51020161\/1_rneev7bo\/1_8g4uce40\">The Scottish Gaelic Uicipeid project &#8211; Susan Ross at the Celtic Knot<\/a><\/p>\n\n\n\n<p><a href=\"https:\/\/media.ed.ac.uk\/playlist\/dedicated\/51020161\/1_rneev7bo\/1_0kn68p62\">Celtic Knot &#8211; Panel discussion &amp; closing plenary: The Politics of Language Online<\/a><\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Globally, over 7,000 languages are spoken &#8211; only around 4% of people are native English speakers. Despite this, English holds the title of the \u201cLanguage of the internet\u201d.&nbsp; It dominates with Chinese almost 50% of global web traffic with the top ten languages accounting for 76.9 percent of global internet users. The majority of African [&hellip;]<\/p>\n","protected":false},"author":41,"featured_media":1258,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[1],"tags":[962,472,458,963,820,818,478,957,964,965,966,817,88,10],"class_list":["post-1236","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","tag-arctic-knot","tag-celtic-knot","tag-celtic-languages","tag-clea-strathmann","tag-dlam-feed","tag-dlamfeed","tag-indigenous-languages","tag-isintern","tag-language-diversity","tag-linguistic-diversity","tag-minority-languages","tag-openedfeed","tag-wikidata","tag-wikipedia"],"_links":{"self":[{"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/posts\/1236","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/users\/41"}],"replies":[{"embeddable":true,"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/comments?post=1236"}],"version-history":[{"count":11,"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/posts\/1236\/revisions"}],"predecessor-version":[{"id":1259,"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/posts\/1236\/revisions\/1259"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/media\/1258"}],"wp:attachment":[{"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/media?parent=1236"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/categories?post=1236"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/thinking.is.ed.ac.uk\/wir\/wp-json\/wp\/v2\/tags?post=1236"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}