{"id":3504,"date":"2020-01-09T00:00:00","date_gmt":"2020-01-09T00:00:00","guid":{"rendered":"http:\/\/pl-asapbio.local\/panlingua\/"},"modified":"2025-03-28T21:37:26","modified_gmt":"2025-03-28T21:37:26","slug":"panlingua","status":"publish","type":"post","link":"https:\/\/asapbio.org\/panlingua\/","title":{"rendered":"Search for preprints in your native language with PanLingua"},"content":{"rendered":"<p>Humberto Debat<sup>1<\/sup> &amp; Richard Abdill<sup>2<\/sup><\/p>\n<p class=\"has-small-font-size\"><sup>1 <\/sup>National Institute of Agricultural Technology (IPAVE-CIAP-INTA), C\u00f3rdoba, Argentina; 0000-0003-3056-3739; @humbertodebat<sup>&nbsp;<\/sup><br \/><sup>2 <\/sup>University of Minnesota, Minneapolis, Minnesota, United States; 0000-0001-9565-5832; @richabdill&nbsp;<\/p>\n<p>The majority of scholarly work in biology is published in English, a language <a href=\"https:\/\/en.wikipedia.org\/wiki\/List_of_languages_by_number_of_native_speakers\" target=\"_blank\" rel=\"noopener\">most of the world<\/a> does not speak. To help remediate this key issue hindering <a href=\"https:\/\/journals.plos.org\/plosbiology\/article?id=10.1371\/journal.pbio.2000933\" target=\"_blank\" rel=\"noopener\">inclusive scientific dialogue<\/a>, we built <a href=\"https:\/\/panlingua.rxivist.org\" target=\"_blank\" rel=\"noopener\">PanLingua<\/a>, a multilingual preprint search tool intended to enable search and global access to machine translations of all preprints hosted by <a href=\"https:\/\/www.biorxiv.org\/\" target=\"_blank\" rel=\"noopener\">bioRxiv.org<\/a>: users can enter search terms in their native language and view search results linking to the full text of all available manuscripts, translated into more than 100 languages. The tool is <a href=\"https:\/\/github.com\/blekhmanlab\/panlingua\" target=\"_blank\" rel=\"noopener\">open source<\/a>, and we welcome feedback from users about how you\u2019re using the tool or how it could be improved.<\/p>\n<p>Over the last five years, researchers in the life sciences have embraced preprints <a href=\"https:\/\/doi.org\/10.7554\/eLife.45133\" target=\"_blank\" rel=\"noopener\">like never before<\/a>, sharing their results online before publication in conventional journals. The <a href=\"https:\/\/asapbio.org\/preprint-info\/biology-preprints-over-time\">wide majority<\/a> of these manuscripts now appear on bioRxiv, where one of the only requirements for acceptance is that <a href=\"https:\/\/www.biorxiv.org\/about\/FAQ\" target=\"_blank\" rel=\"noopener\">all manuscripts must be written in English<\/a>, a criterion shared by most of the world\u2019s largest and most popular scientific journals. So while there is much discussion about broadening access to scientific literature, the debate is essentially about access to <em>English-language<\/em> scientific literature, an omission that obfuscates a key <a href=\"https:\/\/www.tandfonline.com\/doi\/abs\/10.2167\/cilp084.0\" target=\"_blank\" rel=\"noopener\">obstacle<\/a> to those seeking the knowledge available within scholarly articles.<\/p>\n<p>There is much work needed to balance the asymmetries of scientific discourse, which currently flows mostly from the North to the Global South. Automatic translation platforms have greatly improved their efficacy in the last few years and represent a valuable opportunity for readers to grasp the essence of a text written in a language they may not speak, as observed last year by Daniel Prieto, our colleague from Uruguay, in his call for the scientific community \u201cto develop <a href=\"https:\/\/www.nature.com\/articles\/d41586-018-05844-0\" target=\"_blank\" rel=\"noopener\">a comprehensive <strong>multi-language translation<\/strong> <strong>tool<\/strong><\/a> with the help of services such as Google Translate\u2026 [to] enable international researchers to access regional databases not compiled in English.\u201d<\/p>\n<p><a href=\"https:\/\/translate.google.com\" target=\"_blank\" rel=\"noopener\">Google Translate<\/a> is already capable of providing passable translations of individual articles, but readers do not have a convenient way to <em>search<\/em> the vast collection of scholarly outputs in other languages. This is the modest improvement offered by PanLingua in the broader challenge of reducing language barriers in <a href=\"https:\/\/journals.plos.org\/plosbiology\/article?id=10.1371\/journal.pbio.2000933\" target=\"_blank\" rel=\"noopener\">scientific dialogue<\/a>. We encourage others to develop similar tools \u2014 not only to broaden access to English-language scientific works, but also the evident counter-platform, which would help English speakers search the millions of non-English scientific literature available \u2014 for instance, the 79 million articles published in <a href=\"https:\/\/www.nature.com\/articles\/d41586-018-05235-5\" target=\"_blank\" rel=\"noopener\">Chinese<\/a>, or the 1.5 million open-access works in Portuguese, Spanish, French and other languages available at <a href=\"http:\/\/www.lareferencia.info\/en\/\" target=\"_blank\" rel=\"noopener\">LA Referencia<\/a>.<\/p>\n<p><strong>How it works<\/strong><\/p>\n<p>In short, most of the work is done by Google and bioRxiv:<\/p>\n<ol class=\"wp-block-list\">\n<li>A user arrives at <a href=\"https:\/\/panlingua.rxivist.org\/\" target=\"_blank\" rel=\"noopener\">panlingua.rxivist.org<\/a>. They are presented with a search box and a list of languages supported by the Google Cloud Translate API.<\/li>\n<li>The user inputs a search term in their chosen language and submits the form.<\/li>\n<li>The user\u2019s input is sent to the Google Cloud Translate API, which provides an English translation of the search term.<\/li>\n<li>The translated search term is used to generate a URL of the standard bioRxiv search.<\/li>\n<li>The generated bioRxiv URL is passed to translate.google.com, which provides a translated version of that page in whatever language was originally selected by the user.<\/li>\n<li>The user is redirected to the translate.google.com page with the search results.<\/li>\n<li>Links within the translated search results point to the translated versions of each paper, which means the user is now in an environment where full translated versions of the selected articles are available on their own language.<\/li>\n<\/ol>\n<figure class=\"wp-block-image size-large\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1024\" height=\"347\" src=\"https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.30.53-2-1024x347.png\" alt=\"PanLingua homepage with 'Patogenos resistentes' typed into search bar for Spanish language search\" class=\"wp-image-4086\" srcset=\"https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.30.53-2-1024x347.png 1024w, https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.30.53-2-300x102.png 300w, https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.30.53-2-768x261.png 768w, https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.30.53-2-1536x521.png 1536w, https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.30.53-2.png 1910w\" sizes=\"(max-width: 1024px) 100vw, 1024px\"><\/figure>\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"669\" src=\"https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.31.18-2-1024x669.png\" alt=\"google search bar in english at top with biorxiv results page in spanish below\" class=\"wp-image-4087\" srcset=\"https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.31.18-2-1024x669.png 1024w, https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.31.18-2-300x196.png 300w, https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.31.18-2-768x502.png 768w, https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.31.18-2-1536x1004.png 1536w, https:\/\/asapbio.org\/wp-content\/uploads\/2025\/03\/Screenshot-2020-01-09-at-19.31.18-2.png 1974w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\"><\/figure>\n<p>PanLingua owes its name to <a href=\"https:\/\/en.wikipedia.org\/wiki\/Xul_Solar\" target=\"_blank\" rel=\"noopener\">Xul Solar<\/a>, an Argentine artist, writer, and inventor who <a href=\"http:\/\/universossonoros.blogspot.com\/2012\/11\/xul-solar-la-musica-traves-del-lienzo_26.html\" target=\"_blank\" rel=\"noopener\">described <\/a>himself as \u201cmaster of a script that nobody yet reads\u2026<strong> creator of a universal language called Panlingua based on numbers and astrology that will help people know each other better<\/strong>.\u201d While our tool does not take Xul\u2019s route of enabling dialogue through a universal language, we believe available translation technology can serve as an effective intermediary to bond together the <a href=\"https:\/\/www.helsinki-initiative.org\/en\" target=\"_blank\" rel=\"noopener\">diversity of languages<\/a> linked to scientific endeavor.<\/p>\n<p>The use of machine translation in science is predicated on a straightforward notion: on the contrary to non-scientific literature, where translation is judged by aesthetics, the central goal of translated scientific literature is <strong>legibility<\/strong>. There are innumerable ways to translate a poem, for example: its richness goes beyond words and involves nuanced aesthetic resources requiring human discretion to interpolate. Scientific papers, on the other hand, can be comprehended with a much more literal translation \u2014 something machines can already do, at least for some of Google Translate\u2019s 104 supported languages. \u201cClose enough\u201d is not ideal, and aspects of the original text can be lost, confounded, or tergiversated, but automatic translation at least allows users to arrive at the existence of the translated work and get a general idea of its content. Tools such as Google Translate are <a href=\"https:\/\/www.wired.com\/2016\/09\/google-claims-ai-breakthrough-machine-translation\/\" target=\"_blank\" rel=\"noopener\">evolving<\/a> at such speed that is not na\u00efve to believe that they have <a href=\"https:\/\/www.cambridge.org\/core\/journals\/political-analysis\/article\/no-longer-lost-in-translation-evidence-that-google-translate-works-for-comparative-bagofwords-text-applications\/43CB03805973BB8AD567F7AE50E72CA6\" target=\"_blank\" rel=\"noopener\">grasped<\/a> the threshold of basic legibility.<\/p>\n<p>Science is a shared enterprise, a global endeavor enriched by a multiplicity of visions, realities and languages. Everyone benefits from the development of a more inclusive ecosystem, and seamless international scholarly discourse is a real possibility. Many barriers are stopping this utopia; let us remember language.<\/p>\n<p><em>Find PanLingua at <\/em><a href=\"https:\/\/panlingua.rxivist.org\/\" target=\"_blank\" rel=\"noopener\">https:\/\/panlingua.rxivist.org\/<\/a><em>. The code for PanLingua is available from GitHub (<a href=\"https:\/\/github.com\/blekhmanlab\/panlingua\" target=\"_blank\" rel=\"noopener\">https:\/\/github.com\/blekhmanlab\/panlingua<\/a>) and archived on Zenodo, DOI: <a href=\"https:\/\/doi.org\/10.5281\/zenodo.3601512\" target=\"_blank\" rel=\"noopener\">10.5281\/zenodo.3601512<\/a>.<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Humberto Debat1 &amp; Richard Abdill2 1 National Institute of Agricultural Technology (IPAVE-CIAP-INTA), C\u00f3rdoba, Argentina; 0000-0003-3056-3739; @humbertodebat&nbsp;2 University of Minnesota, Minneapolis, Minnesota, United States; 0000-0001-9565-5832; @richabdill&nbsp; The majority of scholarly work [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":2273,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[42],"tags":[],"class_list":["post-3504","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-guest-posts"],"acf":[],"_links":{"self":[{"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/posts\/3504","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/comments?post=3504"}],"version-history":[{"count":1,"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/posts\/3504\/revisions"}],"predecessor-version":[{"id":3505,"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/posts\/3504\/revisions\/3505"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/media\/2273"}],"wp:attachment":[{"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/media?parent=3504"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/categories?post=3504"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/asapbio.org\/wp-json\/wp\/v2\/tags?post=3504"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}