News

Content translation tool helps create over half a million Wikipedia articles

In science fiction, the Encyclopedia Galactica is a compendium of a galaxy’s worth of knowledge.

Wikipedia isn’t quite there yet—for one, we’ve barely left Earth. However, that doesn’t mean we aren’t trying to put together a planet’s worth of knowledge and ensure that all of its inhabitants can learn from it in their own languages.

That’s where our content translation tool comes in. The tool simplifies translating Wikipedia articles into different languages by automating many of the tedious steps inherent in manually translating Wikipedia’s articles. As of last month, more than half a million articles have been created with the help of the content translation tool since its introduction.

Why did we build this tool? It’s simple: translating Wikipedia’s knowledge into new languages can help reduce our knowledge gap. For example, English-speaking users can access more than five million articles. Bengali speakers, a language of 260 million people, have access to a mere 75,000.

The content translation tool has not been easy to construct, maintain, and grow. We’ve spent four years constructing and fine-tuning it to best fit Wikipedia’s decentralized model, ensuring that it fits into all of Wikimedia’s many individual wikis which have grown and developed their own custom infrastructure to suit their local contexts.

More recently, the Wikimedia Foundation has worked to modernize the content translation tool with a rich-text interface and improve its output by utilizing artificial intelligence. The tool now provides users with an initial machine translation for them to improve prior to publishing, and incorporates safeguards which help ensure that all untouched machine translations are reviewed. We’ve partnered with Google to ensure that these automatically translated items are as high-quality as possible, and we’ve ensured that none of our users’ data is being passed to Google in the process.

We’re liking what we’re seeing from the newly improved content translation tool. In the last year, it has been used to translate nearly 150,000 articles, an over twenty percent increase in the year-over-year number. Moreover, our data shows that translated articles are less likely to be deleted than articles created from scratch.

What’s next for the tool? As part of the Wikimedia Foundation’s medium-term plan, released earlier this year, the Foundation’s language team will be focusing on two key areas: ensuring that the full suite of translation tools can be used by as many different language Wikimedia wikis as possible, and in expanding the kinds of contributions that the content translation tool supports. They’ll take on the former before tackling the latter, starting with the Malayalam, Bengali, Tagalog, Javanese, and Mongolian languages.

Pau Giner, Lead UX Designer, Product Design
Wikimedia Foundation

For more information about our content translation tool’s history and future, please see an expanded blog post on Wikimedia Space, the new site built for news, questions, and conversations within the Wikimedia movement.

Related

Read further in the pursuit of knowledge

Google and Wikimedia Foundation partner to increase knowledge equity online

From our beginning, Wikipedia has aspired to create a world where knowledge is free, accessible and useful to everyone, in every language, in every country, and across any device. Although the volunteers who make Wikipedia possible have made great strides towards achieving this vision, we’re still a long way from getting there. They say Rome….

Read more

You can now use Google Translate to translate articles on Wikipedia

Studies estimate that there are more than 7,000 languages spoken around the world. Wikipedia exists in about 300 of them. That’s about 4 percent of some of the world’s languages documenting some of the world’s knowledge. Consider the Arabic language. With more than 420 million speakers, it’s one of the most widely spoken languages in….

Read more

Help us unlock the world’s knowledge.

As a nonprofit, Wikipedia and our related free knowledge projects are powered primarily through donations.

Donate now

Contact us

Questions about the Wikimedia Foundation or our projects? Get in touch with our team.
Contact

Photo credits