Selected publications
Full list in: Google Scholar, ACL Anthology or Semantic Scholar
- Revisiting Syllables in Language Modelling and Their Application on Low-Resource Machine Translation
Arturo Oncevay, Kervy Dante Rivas Rojas, Liz Karen Chavez Sanchez, Roberto Zariquiey. In COLING 2022. 2022. - Quantifying Synthesis and Fusion and their Impact on Machine Translation
Arturo Oncevay, Duygu Ataman, Niels van Berkel, Barry Haddow, Alexandra Birch, Johannes Bjerva. In NAACL 2022. 2022. - BPE vs Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic Languages
Manuel Mager, Arturo Oncevay, Elisabeth Mager, Katharina Kann, Ngoc Thang Vu. In Findings of the Association for Computational Linguistics: ACL 2022. 2022. - AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages
Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Ivan Vladimir Meza Ruiz, Gustavo Giménez-Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, Thang Vu, Katharina Kann. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2022. - Findings of the AmericasNLP 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas
Manuel Mager, Arturo Oncevay, Abteen Ebrahimi, John Ortega, Annette Rios, Angela Fan, Ximena Gutierrez-Vasques, Luis Chiruzzo, Gustavo Giménez-Lugo, Ricardo Ramos, Ivan Vladimir Meza Ruiz, Rolando Coto-Solano, Alexis Palmer, Elisabeth Mager-Hois, Vishrav Chaudhary, Graham Neubig, Ngoc Thang Vu, Katharina Kann. In Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas. 2021. - Peru is Multilingual, Its Machine Translation Should Be Too?
Arturo Oncevay. In Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas. 2021. - Bridging Linguistic Typology and Multilingual Machine Translation with Multi-View Language Representations
Arturo Oncevay, Barry Haddow, Alexandra Birch. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. 2020. - Efficient Strategies for Hierarchical Text Classification: External Knowledge and Auxiliary Tasks
Kervy Rivas Rojas, Gina Bustamante, Arturo Oncevay, Marco Antonio Sobrevilla Cabezudo. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020. - No Data to Crawl? Monolingual Corpus Creation from PDF Files of Truly low-Resource Languages in Peru
Gina Bustamante, Arturo Oncevay, Roberto Zariquiey. In Proceedings of the 12th Language Resources and Evaluation Conference. 2020.