Cristina España i Bonet

Evaluating Speech Enhancement Performance Across Demographics and Language

Jose Giraldo, Alex Peiró-Lilja, Carme Armentano-Oller, Rodolfo Zevallos and Cristina España-Bonet

In Proceedings of Interspeech 2025, pages 1353-1357, Rotterdam, Netherlands, August 2025.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{giraldo25_interspeech,
  title     = {{Evaluating Speech Enhancement Performance Across Demographics and Language}},
  author    = {{Jose Giraldo and Alex Peir\'o-Lilja and Carme Armentano-Oller and Rodolfo Zevallos and Cristina Espa{\~n}a-Bonet}},
  year      = {{2025}},
  month     = aug,
  address   = {{Rotterdam, Netherlands}},
  booktitle = {{Interspeech 2025}},
  pages     = {{1353--1357}},
  doi       = {{10.21437/Interspeech.2025-1760}},
  issn      = {{2958-1796}}
}

Towards Domain-Specific Spoken Language Understanding for a Catalan Voice-Controlled Video Game

Alex Peiró-Lilja, Rodolfo Zevallos, Carme Armentano-Oller, Jose Giraldo, Cristina España-Bonet and Mireia Farrús

In Proceedings of Interspeech 2025, pages 4965-4966, Rotterdam, Netherlands, August 2025.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{peirolilja25_interspeech,
  title     = {{Towards Domain-Specific Spoken Language Understanding for a Catalan Voice-Controlled Video Game}},
  author    = {{Alex Peir\'o-Lilja and Rodolfo Zevallos and Carme Armentano-Oller and Jose Giraldo and Cristina Espa{\~n}a-Bonet and Mireia Farr\'us}},
  year      = {{2025}},
  month     = aug,
  address   = {{Rotterdam, Netherlands}},
  booktitle = {{Interspeech 2025}},
  pages     = {{4965--4966}},
  issn      = {{2958-1796}}
}

MultiCoPIE: A Multilingual Corpus of Potentially Idiomatic Expressions for Cross-lingual PIE Disambiguation (BEST PAPER AWARD)

Uliana Sentsova, Debora Ciminari, Josef van Genabith and Cristina España-Bonet

In Proceedings of the 21st Workshop on Multiword Expressions (MWE 2025), pages 67-81, Albuquerque, New Mexico, U.S.A.. Association for Computational Linguistics, May 2025.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{sentsova-etal-2025-multicopie,
    title = "{M}ulti{C}o{PIE}: A Multilingual Corpus of Potentially Idiomatic Expressions for Cross-lingual {PIE} Disambiguation",
    author = "Sentsova, Uliana  and
      Ciminari, Debora  and
      Genabith, Josef Van  and
      Espa{\~n}a-Bonet, Cristina",
    editor = {Ojha, Atul Kr.  and
      Giouli, Voula  and
      Mititelu, Verginica Barbu  and
      Constant, Mathieu  and
      Korvel, Gra{\v{z}}ina  and
      Do{\u{g}}ru{\"o}z, A. Seza  and
      Rademaker, Alexandre},
    booktitle = "Proceedings of the 21st Workshop on Multiword Expressions (MWE 2025)",
    month = may,
    year = "2025",
    address = "Albuquerque, New Mexico, U.S.A.",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.mwe-1.8/",
    pages = "67--81",
    ISBN = "979-8-89176-243-5"
 }

Continual Learning in Multilingual Sign Language Translation

Shakib Yazdani, Josef van Genabith and Cristina España-Bonet

In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2025), Association for Computational Linguistics, pages 10923-10938. Albuquerque, New Mexico, May 2025.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{continual-learning-slt,
    title = "Continual Learning in Multilingual Sign Language Translation",
    author = "Yazdani, Shakib and van Genabith, Josef and Espa{\~n}a-Bonet, Cristina"
    editor = "Chiruzzo, Luis  and
      Ritter, Alan  and
      Wang, Lu",
    booktitle = "Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)",
    month = apr,
    year = "2025",
    address = "Albuquerque, New Mexico",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.naacl-long.546/",
    pages = "10923--10938",
    ISBN = "979-8-89176-189-6"
 }

AFRIDOC-MT: Document-level MT Corpus for African Languages

Jesujoba O. Alabi, Israel Abebe Azime, Miaoran Zhang, Cristina España-Bonet, Rachel Bawden, Dawei Zhu, David Ifeoluwa Adelani, Clement Oyeleke Odoje, Idris Akinade, Iffat Maab, Davis David, Shamsuddeen Hassan Muhammad, Neo Putini, David O. Ademuyiwa, Andrew Caines and Dietrich Klakow

arXiv preprint, January 2025.

[ Abstract PDF Poster Slides BibTeX ]

@article{alabi2025afridoc,
  title={AFRIDOC-MT: Document-level MT Corpus for African Languages},
  author={Alabi, Jesujoba O and Azime, Israel Abebe and Zhang, Miaoran and Espa{\~n}a-Bonet, Cristina and 
          Bawden, Rachel and Zhu, Dawei and Adelani, David Ifeoluwa and Odoje, Clement Oyeleke and 
          Akinade, Idris and Maab, Iffat and others},
  journal={arXiv preprint arXiv:2501.06374},
  year={2025}
}

Analysing Translation Artifacts: A Comparative Study of LLMs, NMTs, and Human Translations

Fedor Sizov, Cristina España-Bonet, Josef van Genabith, Roy Xie and Koel Dutta Chowdhury

In Proceedings of the Proceedings of the Ninth Conference on Machine Translation (WMT 2024), pages 1183-1199. Miami, United States of America, November 2024.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{sizov-etal-2024-analysing,
    title = "Analysing Translation Artifacts: A Comparative Study of {LLM}s, {NMT}s, and Human Translations",
    author = "Sizov, Fedor  and
      Espa{\~n}a-Bonet, Cristina  and
      Van Genabith, Josef  and
      Xie, Roy  and
      Dutta Chowdhury, Koel",
    editor = "Haddow, Barry  and
      Kocmi, Tom  and
      Koehn, Philipp  and
      Monz, Christof",
    booktitle = "Proceedings of the Ninth Conference on Machine Translation",
    month = nov,
    year = "2024",
    address = "Miami, Florida, USA",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.wmt-1.116",
    doi = "10.18653/v1/2024.wmt-1.116",
    pages = "1183--1199"
}

Introduction to Multilingual and Multicultural NLP

Cristina España-Bonet

Tutorial at the DisAI Summer School on trustworthy, multilingual and multimodal AI, September 3rd-6th, Bratislava, Slovakia, 2024.

[ Slides Abstract ]

Sign Language Translation with Sentence Embedding Supervision

Yasser Hamidullah, Josef van Genabith and Cristina España-Bonet

In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Association for Computational Linguistics, pages 425-434. Bangkok, Thailand, August 2024.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{alt-sentence-embeddings,
    title = "Sign Language Translation with Sentence Embedding Supervision",
   author = {Yasser Hamidullah, Josef van Genabith and Cristina Espa{\~n}a-Bonet},
    booktitle = "Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.acl-short.40",
    pages = "425--434"
 }

Elote, Choclo and Mazorca: on the Varieties of Spanish

Cristina España-Bonet and Alberto Barrón-Cedeño

In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2024), Association for Computational Linguistics, pages 3689-3711. Mexico City, Mexico, June 2024.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{elote-varieties,
    title = "Elote, Choclo and Mazorca: on the Varieties of Spanish ",
    author = "Espa{\~n}a-Bonet, Cristina and Barr\'on-Cede{\~n}o, Alberto"
    booktitle = "Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
    month = jun,
    year = "2024",
    address = "Mexico City, Mexico",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.naacl-long.204.pdf",
    pages = "3689--3711"
 }

When Elote, Choclo and Mazorca are not the Same. Isomorphism-Based Perspective to the Spanish Varieties Divergences

Cristina España-Bonet, Ankur Bhatt, Koel Dutta Chowdhury, Alberto Barrón-Cedeño

In Proceedings the Eleventh Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2024), Association for Computational Linguistics, pages 56-77. Mexico City, Mexico, June 2024.

[ Abstract PDF PDF Slides BibTeX ]

@inproceedings{elote-isomorphism,
    title = "When Elote, Choclo and Mazorca are not the Same. Isomorphism-Based Perspective to the Spanish Varieties Divergences",
    author = "Espa{\~n}a-Bonet, Cristina and Bhatt, Ankur and Chowdhury, Koel Dutta and Barr\'on-Cede{\~n}o, Alberto",
    booktitle = "Proceedings of the Eleventh Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2024)",
    month = jun,
    year = "2024",
    address = "Mexico City, Mexico",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.vardial-1.5.pdf",
    pages = "56--77"
}

Mitigating Translationese with GPT-4: Strategies and Performance

Maria Kunilovskaya, Koel Dutta Chowdhury, Heike Przybyl, Cristina España-Bonet, Josef van Genabith

In Proceedings of the 25th Annual Conference of the European Association for Machine Translation (EAMT 2024), pages 411-430. Sheffield, United Kingdom, June 2024.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{mitigating-translationese-gpt4,
    title = "Mitigating Translationese with GPT-4: Strategies and Performance",
    author = "Kunilovskaya, Maria and Chowdhury, Koel and Przybyl, Heike and Espa{\~n}a-Bonet, Cristina and van Genabith, Josef",
    booktitle = "Proceedings of the 25th Annual Conference of the European Association for Machine Translation",
    month = jun,
    year = "2024",
    address = "Sheffield, United Kingdom",
    publisher = "European Association for Machine Translation",
    url = "https://eamt2024.github.io/proceedings/vol1.pdf",
    pages = "411--430"
}

When Your Cousin Has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages (BEST STUDENT PAPER AWARD)

Niyati Bafna, Cristina España-Bonet, Josef van Genabith, Benoît Sagot, Rachel Bawden

In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 17544â17556, Torino, Italia. ELRA and ICCL, May 2024.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{bafna-etal-2024-cousin-right,
    title = "When Your Cousin Has the Right Connections: Unsupervised Bilingual Lexicon Induction for Related Data-Imbalanced Languages",
    author = "Bafna, Niyati  and
      Espa{\~n}a-Bonet, Cristina  and
      van Genabith, Josef  and
      Sagot, Beno{\^\i}t  and
      Bawden, Rachel",
    editor = "Calzolari, Nicoletta  and
      Kan, Min-Yen  and
      Hoste, Veronique  and
      Lenci, Alessandro  and
      Sakti, Sakriani  and
      Xue, Nianwen",
    booktitle = "Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)",
    month = may,
    year = "2024",
    address = "Torino, Italia",
    publisher = "ELRA and ICCL",
    url = "https://aclanthology.org/2024.lrec-main.1526",
    pages = "17544--17556"
}

DGS-Fabeln-1: A Multi-Angle Parallel Corpus of Fairy Tales between German Sign Language and German Text

Fabrizio Nunnari, Eleftherios Avramidis, Cristina España-Bonet, Marco González, Anna Hennes, and Patrick Gebhard

In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 4847â4857, Torino, Italia. ELRA and ICCL, May 2024.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{nunnari-etal-2024-dgs-fabeln,
    title = "{DGS}-Fabeln-1: A Multi-Angle Parallel Corpus of Fairy Tales between {G}erman {S}ign {L}anguage and {G}erman Text",
    author = "Nunnari, Fabrizio  and
      Avramidis, Eleftherios  and
      Espa{\~n}a-Bonet, Cristina  and
      Gonz{\'a}lez, Marco  and
      Hennes, Anna  and
      Gebhard, Patrick",
    editor = "Calzolari, Nicoletta  and
      Kan, Min-Yen  and
      Hoste, Veronique  and
      Lenci, Alessandro  and
      Sakti, Sakriani  and
      Xue, Nianwen",
    booktitle = "Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)",
    month = may,
    year = "2024",
    address = "Torino, Italia",
    publisher = "ELRA and ICCL",
    url = "https://aclanthology.org/2024.lrec-main.434",
    pages = "4847--4857"
}

Multilingual Coarse Political Stance Classification of Media. The Editorial Line of a ChatGPT and Bard Newspaper

Cristina España-Bonet

In Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 11757â11777, Singapore (hybrid). Association for Computational Linguistics, December 2023.

[ Abstract PDF Poster Slides BibTeX ]

@inproceedings{espana-bonet:2023,
    title = "Multilingual Coarse Political Stance Classification of Media. The Editorial Line of a {C}hat{GPT} and Bard Newspaper",
    author = "Espa{\~n}a-Bonet, Cristina",
    editor = "Bouamor, Houda and Pino, Juan and Bali, Kalika",
    booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2023",
    month = dec,
    year = "2023",
    address = "Singapore",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.findings-emnlp.787",
    doi = "10.18653/v1/2023.findings-emnlp.787",
    pages = "11757--11777"
   }

Translating away Translationese without Parallel Data

Rricha Jalota, Koel Chowdhury, Cristina España-Bonet and Josef van Genabith

In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7086â7100, Singapore (hybrid). Association for Computational Linguistics, December 2023.

[ Abstract PDF BibTeX ]

@inproceedings{jalotaEtAl:2023,
    title = "Translating away Translationese without Parallel Data",
    author = "Jalota, Rricha and Chowdhury, Koel and Espa{\~n}a-Bonet, Cristina and van Genabith, Josef",
    editor = "Bouamor, Houda and Pino, Juan and Bali, Kalika",
    booktitle = "Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing",
    month = dec,
    year = "2023",
    address = "Singapore",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.emnlp-main.438",
    doi = "10.18653/v1/2023.emnlp-main.438",
    pages = "7086--7100"
   }

Findings of the Second WMT Shared Task on Sign Language Translation (WMT-SLT23)

Mathias Müller, Malihe Alikhani, Eleftherios Avramidis, Richard Bowden, Annelies Braffort, Necati Cihan Camgöz, Sarah Ebling, Cristina España-Bonet, Anne Göhring, Roman Grundkiewicz, Mert Inan, Zifan Jiang, Oscar Koller, Amit Moryossef, Annette Rios, Dimitar Shterionov, Sandra Sidler-Miserez, Katja Tissi and Davy Van Landuyt

In Proceedings of the Eighth Conference on Machine Translation (WMT), pages 68â94, Singapore (hybrid). Association for Computational Linguistics. December 2023.

[ Abstract PDF BibTeX ]

@inproceedings{muller-etal-2023-findings,
    title = "Findings of the Second {WMT} Shared Task on Sign Language Translation ({WMT}-{SLT}23)",
    author = {M{\"u}ller, Mathias  and
      Alikhani, Malihe  and
      Avramidis, Eleftherios  and
      Bowden, Richard  and
      Braffort, Annelies  and
      Cihan Camg{\"o}z, Necati  and
      Ebling, Sarah  and
      Espa{\~n}a-Bonet, Cristina  and
      G{\"o}hring, Anne  and
      Grundkiewicz, Roman  and
      Inan, Mert  and
      Jiang, Zifan  and
      Koller, Oscar  and
      Moryossef, Amit  and
      Rios, Annette  and
      Shterionov, Dimitar  and
      Sidler-Miserez, Sandra  and
      Tissi, Katja  and
      Van Landuyt, Davy},
    editor = "Koehn, Philipp  and
      Haddow, Barry  and
      Kocmi, Tom  and
      Monz, Christof",
    booktitle = "Proceedings of the Eighth Conference on Machine Translation",
    month = dec,
    year = "2023",
    address = "Singapore",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.wmt-1.4",
    doi = "10.18653/v1/2023.wmt-1.4",
    pages = "68--94"
}

Measuring Spurious Correlation in Classification: ``Clever Hans'' in Translationese

Angana Borah, Daria Pylypenko, Cristina España-Bonet and Josef van Genabith

In Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing (RANLP), pages 196-206, Varna, Bulgaria (hybrid), September 2023.

[ Abstract PDF BibTeX ]

@inproceedings{borahEtAl2023,
    title = "Measuring Spurious Correlation in Classification: {``}Clever Hans{''} in Translationese",
    author = "Borah, Angana  and
      Pylypenko, Daria  and
      Espa{\~n}a-Bonet, Cristina  and
      van Genabith, Josef",
    editor = "Mitkov, Ruslan  and
      Angelova, Galia",
    booktitle = "Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing",
    month = sep,
    year = "2023",
    address = "Varna, Bulgaria",
    publisher = "INCOMA Ltd., Shoumen, Bulgaria",
    url = "https://aclanthology.org/2023.ranlp-1.22",
    pages = "196--206"
}

Human Biases in Multilingual Models

Cristina España-Bonet

Invited talk at the Language In The Human Machine Era Workshop: Bridging the gap between technology and professionals, LITHME WG1-WG7, August 28th-30th, Budapest, Hungary, 2023.

[ Slides Abstract ]

Enriching WayÃºunaikiâSpanish Neural Machine Translation with Linguistic Information

Nora Graichen, Josef van Genabith and Cristina España-Bonet

In proceedings of the third Workshop on Natural Language Processing for Indigenous Languages of the Americas, pages 67-83, July 14th, Toronto, Canada (hybrid), 2023.

[ Abstract PDF BibTeX ]

@inproceedings{graichenEtAl2023,
    title = "Enriching {W}ay{\'u}naiki--{S}panish Neural Machine Translation with Linguistic Information",
    author = "Graichen, Nora and van Genabith, Josef and Espa{\~n}a-Bonet, Cristina",
    booktitle = "Proceedings of the Workshop on Natural Language Processing for Indigenous Languages of the Americas (AmericasNLP)",
    month = jul,
    year = "2023",
    address = "Toronto, Canada",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.americasnlp-1.9",
    doi = "10.18653/v1/2023.americasnlp-1.9",
    pages = "67--83",
    abstract = "We present the first neural machine translation system for the low-resource language pair WayunaikiSpanish and explore strategies to inject linguistic knowledge into the model to improve translation quality. We explore a wide range of methods and combine complementary approaches. Results indicate that incorporating linguistic information through linguistically motivated subword segmentation, factored models, and pretrained embeddings helps the system to generate improved translations, with the segmentation contributing the most.In order to evaluate translation quality in a general domain and go beyond the available religious domain data, we gather and make publicly available a new test set and supplementary material.Although translation quality as measured with automatic metrics is low, we hope these resources will facilitate and support further research on Wayunaiki.",
}

Towards Incorporating 3D Space-Awareness Into an Augmented Reality Sign Language Interpreter

Fabrizio Nunnari, Eleftherios Avramidis, Vemburaj Yadav, Alain Pagani, Yasser Hamidullah, Sepideh Mollanorozy, Cristina España-Bonet, Emil Woop and Patrick Gebhard

In proceedings of the Eighth International Workshop on Sign Language Translation and Avatar Technology. International Workshop on Sign Language Translation and Avatar Technology (SLTAT-2023), located at ICASSP 2023, pages 1-5, June 10th, Rhodes, Greece, 2023.

[ Abstract PDF BibTeX ]

@InProceedings{NunnariEtal:SLAT:2023,
      author = {Fabrizio Nunnari, Eleftherios Avramidis, Vemburaj Yadav, Alain Pagani, Yasser Hamidullah, Sepideh Mollanorozy, Cristina Espa{\~n}a-Bonet, Emil Woop and Patrick Gebhard},
      title = "Towards Incorporating 3D Space-Awareness Into an Augmented Reality Sign Language Interpreter",
      booktitle = "Proceedings of the Eighth International Workshop on Sign Language Translation and Avatar Technology. International Workshop on Sign Language Translation and Avatar Technology (SLTAT-2023), located at ICASSP 2023",
      month = june,
      year = "2023",
      address = "Rhodes, Greece",
      publisher = "IEEE",
      pages = "1--5"
}

Are the Best Multilingual Document Embeddings simply Based on Sentence Embeddings?

Sonal Sannigrahi, Josef van Genabith and Cristina España-Bonet

In proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (Findings), pages 2306-2316, May 2nd-4th, Dubrovnik, Croatia (hybrid), 2023.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{SannigrahiEtal:EACL:2023,
      author = {Sannigrahi, Sonal and van Genabith, Josef and Espa{\~n}a-Bonet, Cristina},
      title = "Are the Best Multilingual Document Embeddings simply Based on Sentence Embeddings?",
      booktitle = "Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Findings)",
      month = may,
      year = "2023",
      address = "Dubrovnik, Croatia (hybrid)",
      publisher = "Association for Computational Linguistics",
      url = "https://aclanthology.org/",
      pages = "2306--2316"
}

Cross-lingual Strategies for Low-resource Language Modeling: A Study on Five Indic Dialects

Niyati Bafna, Cristina España-Bonet, Josef van Genabith, Benoît Sagoit and Rachel Bawden

In proceedings of the 30e ConfÃ©rence sur le Traitement Automatique des Langues Naturelles (TALN), pages 28-42, June 5-9, Paris, France, 2023.

[ Abstract PDF BibTeX ]

@InProceedings{BafnaEtal:TALN:2023,
    title="{Cross-lingual Strategies for Low-resource Language Modeling: A Study on Five Indic Dialects}",
    author={Bafna, Niyati and Espa{\~n}a-Bonet, Cristina and van Genabith, Josef and Sagot, Beno{\^\i}t and Bawden, Rachel},
    booktitle={30e ConfÃ©rence sur le Traitement Automatique des Langues Naturelles (TALN)},
    pages={28--42},
    year={2023},
    address = {Paris, France},
    organization={ATALA}
}

Tailoring and Evaluating the Wikipedia for in-Domain Comparable Corpora Extraction

Cristina España-Bonet, Alberto Barrón-Cedeño, Lluís Màrquez

Knowledge and Information Systems, Volum 65, pages 1365-1397. 2023. Springer-Verlag, London Ldt. https://doi.org/10.1007/s10115-022-01767-5

[ Abstract PDF BibTeX arXiv (pre-review) ]

@article{EspanaBonetEtal:2022,
       author = {{Espa{\~n}a-Bonet}, Cristina and {Barr\'on-Cede{\~n}o}, Alberto and {M\`arquez}, Llu\'{i}s},
        title = "{Tailoring and Evaluating the Wikipedia for in-Domain Comparable Corpora Extraction}",
      journal = {Knowledge and Information Systems},
    publisher = {Springer-Verlag},
      address = {London, England},
     keywords = {Comparable corpora, Wikipedia category graph, Domain-specific corpora, Domainness metrics},
          doi = {10.1007/s10115-022-01767-5},
         year = 2023,
          vol = 65,
        pages = {1365--1397}
}

The (Undesired) Attenuation of Human Biases by Multilinguality

Cristina España-Bonet and Alberto Barrón-Cedeño

In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), pages 2056-2077, Abu Dhabi, UAE (hybrid), 9-11 December, 2022.

[ Abstract PDF BibTeX ]

@inproceedings{espana-bonet-etal-2022-attenuation,
    title = "The (Undesired) Attenuation of Human Biases by Multilinguality",
    author = "Espa{\~n}a-Bonet, Cristina  and  Barr\'on-Cede{\~n}o, Alberto",
    booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
    month = dec,
    year = "2022",
    address = "Abu Dhabi, UAE (hybrid)",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.emnlp-main.133",
    pages = "2056--2077"
}

Explaining Translationese: why are Neural Classifiers Better and what do they Learn?

Kwabena Amponsah-Kaakyire, Daria Pylypenko, Josef van Genabith and Cristina España-Bonet

In proceedings of the fifth BlackBoxNLP Workshop, pages 281-296, December 8th, Abu Dhabi, UAE (hybrid), 2022.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{AmponsahEtal:Blackbox:2022,
      author = {Amponsah-Kaakyire, Kwabena and Pylypenko, Daria and van Genabith, Josef and Espa{\~n}a-Bonet, Cristina},
      title = "Explaining Translationese: why are Neural Classifiers Better and what do they Learn?",
      booktitle = "Proceedings of the fifth BlackBoxNLP Workshop",
      month = dec,
      year = "2022",
      address = "Abu Dhabi, UAE (hybrid)",
      publisher = "Association for Computational Linguistics",
      url = "https://aclanthology.org/2022.blackboxnlp-1.23.pdf",
      pages = "281--296"
}

Combining Noisy Semantic Signals with Orthographic Cues: Cognate Induction for the Indic Dialect Continuum

Niyati Bafna, Josef van Genabith, Cristina España-Bonet and Zdenêk Zabokrtský

In proceedings of the Conference on Computational Natural Language Learning (CoNLL 2022), pages 110-131, December 7-8, Abu Dhabi, UAE (hybrid), 2022.

[ Abstract PDF BibTeX ]

@InProceedings{BafnaEtal:CoNLL:2022,
      author = {Niyati Bafna, Josef van Genabith, Cristina Espa{\~n}a-Bonet and Zden\v{e}k \v{Z}abokrtsk\'{y}},
      title = "Combining Noisy Semantic Signals with Orthographic Cues: Cognate Induction for the Indic Dialect Continuum",
      booktitle = "Proceedings of the 2022 Conference on Computational Natural Language Learning (CoNLL 2022)",
      month = dec,
      year = "2022",
      address = "Abu Dhabi, UAE (hybrid)",
      publisher = "Association for Computational Linguistics",
      url = "https://aclanthology.org/2022.conll-1.9.pdf",
      pages = "110--131"
}

Findings of the WMT 2022 Shared Task on Sign Language Translation

Mathias Müller and Ebling, Sarah and Avramidis, Eleftherios and Battisti, Alessia and Berger, Michèle, and Bowden, Richard and Braffort, Annelies, and Camgöz, Necati Cihan and España-Bonet, Cristina and Grundkiewicz, Roman and Jiang, Zifan and Koller, Oscar and Moryossef, Amit and Perrollaz, Regula and Reinhard, Sabine and Rios, Annette and Shterionov, Dimitar and Sidler-Miserez, Sandra and Tissi, Katja and Van Landuyt, Davy.

In proceedings of the Seventh Conference on Machine Translation (WMT 2022), pages 744-772, December 7-8, Abu Dhabi, UAE (hybrid), 2022.

[ Abstract PDF BibTeX ]

@InProceedings{mullerEtAl:WMT:2022,
   author = {Mathias M\"uller and Ebling, Sarah and Avramidis, Eleftherios and Battisti, Alessia and Berger, Mich{\`e}le, and Bowden, Richard 
             and Braffort, Annelies, and Camg{\"o}z, Necati Cihan and Espa{\~n}a-Bonet, Cristina and Grundkiewicz, Roman and Jiang, Zifan 
             and Koller, Oscar and Moryossef, Amit and Perrollaz, Regula and Reinhard, Sabine and Rios, Annette and Shterionov, Dimitar and
	     Sidler-Miserez, Sandra and Tissi, Katja and Van Landuyt, Davy.},
   title = "Findings of the {WMT} 2022 Shared Task on Sign Language Translation",
   booktitle = {Proceedings of the Seventh Conference on Machine Translation},
   key = {WMT 2022},
   pages = {744--772},
   year = {2022},
   month = {December},
   address = {Abu Dhabi, UAE (hybrid)},
   publisher = {Association for Computational Linguistics}
}

DFKI-MLT at WMT-SLT22: Spatio-temporal Sign Language Representation and Translation.

Yasser Hamidullah, Josef van Genabith and Cristina España-Bonet

In proceedings of the Seventh Conference on Machine Translation (WMT 2022), pages 977-982, December 7-8, Abu Dhabi, UAE (hybrid), 2022.

[ Abstract PDF BibTeX ]

@InProceedings{hamidullaEtAl:WMT:2022,
   author = {Yasser Hamidullah, Josef van Genabith and Cristina Espa{\~n}a-Bonet},
   title = "{DFKI-MLT at WMT-SLT22: Spatio-temporal Sign Language Representation and Translation.}",
   booktitle = {Proceedings of the Seventh Conference on Machine Translation},
   key = {WMT 2022},
   pages = {977--982},
   year = {2022},
   month = {December},
   address = {Abu Dhabi, UAE (hybrid)},
   publisher = {Association for Computational Linguistics}
}

Towards Automated Sign Language Production: A Pipeline for Creating Inclusive Virtual Humans

Bernhard, Lucas and Nunnari, Fabrizio and Unger, Amelie and Bauerdiek, Judith and Dold, Christian and Hauck, Marcel and Stricker, Alexander and Baur, Tobias and Heimerl, Alexander and André, Elisabeth and Reinecker, Melissa and España-Bonet, Cristina and Hamidullah, Yasser and Busemann, Stephan and Gebhard, Patrick and Jäger, Corinna and Wecker, Sonja and Kossel, Yvonne and Müller, Henrik and Waldow, Kristoffer and Fuhrmann, Arnulph and Misiak, Martin and Wallach, Dieter

In Proceedings of the 15th International Conference on PErvasive Technologies Related to Assistive Environments (PETRA '22), pages 26-34. Association for Computing Machinery, New York, NY, USA.

[ Abstract PDF BibTeX arXiv ]

@inproceedings{10.1145/3529190.3529202,
      author = {Bernhard, Lucas and Nunnari, Fabrizio and Unger, Amelie and Bauerdiek, Judith and Dold, Christian and Hauck, Marcel and Stricker, Alexander and Baur, Tobias and Heimerl, Alexander and Andr\'{e}, Elisabeth and Reinecker, Melissa and Espa\~{n}a-Bonet, Cristina and Hamidullah, Yasser and Busemann, Stephan and Gebhard, Patrick and J\"{a}ger, Corinna and Wecker, Sonja and Kossel, Yvonne and M\"{u}ller, Henrik and Waldow, Kristoffer and Fuhrmann, Arnulph and Misiak, Martin and Wallach, Dieter},
      title = {Towards Automated Sign Language Production: A Pipeline for Creating Inclusive Virtual Humans},
      year = {2022},
      isbn = {9781450396318},
      publisher = {Association for Computing Machinery},
      address = {New York, NY, USA},
      url = {https://doi.org/10.1145/3529190.3529202},
      doi = {10.1145/3529190.3529202},
      abstract = {In everyday life, Deaf People face barriers because information is often only available in spoken or written language. Producing sign language videos showing a human interpreter is often not feasible due to the amount of data required or because the information changes frequently. The ongoing AVASAG project addresses this issue by developing a 3D sign language avatar for the automatic translation of texts into sign language for public services. The avatar is trained using recordings of human interpreters translating text into sign language. For this purpose, we create a corpus with video and motion capture data and an annotation scheme that allows for real-time translation and subsequent correction without requiring to correct the animation frames manually. This paper presents the general translation pipeline focusing on innovative points, such as adjusting an existing annotation system to the specific requirements of sign language and making it usable to annotators from the Deaf communities.},
      booktitle = {Proceedings of the 15th International Conference on PErvasive Technologies Related to Assistive Environments},
      pages = {26--35},
      keywords = {sign language production, automatic translation., annotation, motion capture, corpus},
      location = {Corfu, Greece},
      series = {PETRA '22}
}

Multilingual Neural Machine Translation

Cristina España-Bonet

Invited talk at the 11th Advanced Summer School on NLP (IASNLP-2022), IIIT Hyderabad, India, 23rd June 2022.

[ Slides Abstract ]

Towards Debiasing Translation Artifacts

Koel Dutta Chowdhury, Rricha Jalota, Cristina España-Bonet and Josef van Genabith

Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2022), pages 3983-3991, July 10-15, Seattle, 2022.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{DuttaEtal:NAACL:2022,
      author = {Dutta Chowdhury, Koel and Jalota, Rricha and Espa{\~n}a-Bonet, Cristina and van Genabith, Josef},
      title = "Towards Debiasing Translation Artifacts",
      booktitle = "Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2022)",
      month = jul,
      year = "2022",
      address = "Seattle, United States",
      publisher = "Association for Computational Linguistics",
      url = "https://aclanthology.org/2022.naacl-main.292",
      pages = "3983--3991"
}

Exploiting Social Media Content for Self-Supervised Style Transfer

Dana Ruiter, Thomas Kleinbauer, Cristina España-Bonet, Dietrich Klakow, Josef van Genabith

10th International Workshop on Natural Language Processing for Social Media (SocialNLP2022), pages 11-23, July 14-15, Seattle, USA, 2022.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{RuiterEtal:2022,
       author = {Dana Ruiter, Thomas Kleinbauer, Cristina Espa{\~n}a-Bonet, Josef van Genabith, Dietrich Klakow},
        title = "{Exploiting Social Media Content for Self-Supervised Style Transfer}",
     booktitle = {Proceedings of the 10th International Workshop on Natural Language Processing for Social Media},
         year = 2022,
        month = July,
      address = "Seattle, USA",
    publisher = "Association for Computational Linguistics",
          url = "https://aclanthology.org/2022.socialnlp-1.2.pdf",
        pages = {11--23}
}

Low-resource Natural Language Processing (or a Bit of it!)

Cristina España-Bonet

Invited talk at the 3rd AfricaNLP workshop collocated with ICLR, 29th April 2022.

[ Slides Abstract ]

Low-Resource NLP: Multilinguality and Machine Translation

Cristina España-Bonet

LT-BRIDGE Webinar Series, Summer 2021.

[ Topics Session 1 YoutubeS1 Session 2 YoutubeS2 Session 3 YoutubeS3 Session 4 YoutubeS4 Session 5 YoutubeS5 ]


   Session 1 
     - Motivation
   Session 2 
     - Recap on LR-NLP
     - Cross-lingual Embeddings
     - Unsupervised Neural Machine Translation
   Session 3
     - Recap on CL-WE and UNMT 
     - Neural Machine Translation
     - Low-Resource Setting for NMT
     - Multilingual Neural Machine Translation
   Session 4
     - Recap on Multilingual NMT
     - Self-Supervised Neural Machine Translation
     - Sentence Embeddings with LASER
     - Pretrained Language Models and Seq2Seq systems
   Session 5 
     - State-of-the-art: WMT Evaluations
     - Multilingual Low-Resource Translation for Indo-European Languages @WMT21

Findings of the 2021 Conference on Machine Translation (WMT21)

Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska, Ondrej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-jussa, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom Kocmi, Philipp Koehn, Nicholas Lourie, Christof Monz, Makoto Morishita, Masaaki Nagata, Ajay Nagesh, Toshiaki Nakazawa, Matteo Negri, Santanu Pal, Allahsera Auguste Tapo, Marco Turchi, Valentin Vydrin and Marcos Zampieri

In Proceedings of the Sixth Conference on Machine Translation (WMT), pages 1-93, Punta Cana (online), November 2021.

[ Abstract PDF BibTeX ]

@InProceedings{wmt:2021,
      author = {Akhbardeh, Farhad  and  Arkhangorodsky, Arkady  and  Biesialska, Magdalena  and  Bojar, Ond},ej  and  Chatterjee, Rajen  and  Chaudhary, Vishrav  and  Costa-jussa, Marta R. and 
                Espa{\~n}a-Bonet, Cristina and Fan, Angela and Federmann, Christian and Freitag, Markus and Graham, Yvette and Grundkiewicz, Roman and Haddow, Roman and Harter, Leonie and Heafield,
                Kenneth and 
      title = "Findings of the 2021 Conference on Machine Translation (WMT21)",
      booktitle = "Proceedings of the Sixth Conference on Machine Translation (WMT)",
      month = nov,
      year = "2021",
      address = "Punta Cana (Online)",
      publisher = "Association for Computational Linguistics",
      url = "http://statmt.org/wmt21/pdf/2021.wmt-1.1.pdf",
      pages = "1--93"
}

Comparing Feature-Engineering and Feature-Learning Approaches for Multilingual Translationese Classification

Daria Pylypenko, Kwabena Amponsah-Kaakyire, Koel Dutta Chowdhury, Josef van Genabith and Cristina España-Bonet

In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), pages 8596-8611, Punta Cana (online), November 2021.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{PylypenkoEtal:EMNLP:2021,
      author = {Pylypenko, Daria and Amponsah-Kaakyire, Kwabena and Dutta Chowdhury, Koel and van Genabith, Josef and Espa{\~n}a-Bonet, Cristina},
      title = "Comparing Feature-Engineering and Feature-Learning Approaches for Multilingual Translationese Classification",
      booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP21)",
      month = nov,
      year = "2021",
      address = "Punta Cana (Online)",
      publisher = "Association for Computational Linguistics",
      url = "https://aclanthology.org/2021.emnlp-main.676.pdf",
      pages = "8596--8611"
}

Tracing Source Language Interference in Translation with Graph-Isomorphism Measures

Koel Dutta Chowdhury, Cristina España-Bonet and Josef van Genabith

Proceedings of Recent Advances in Natural Language Processing (RANLP 2021), pages 380-390, September 1-3, Virtual, 2021.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{DuttaEtal:RANLP:2021,
      author = {Dutta Chowdhury, Koel and Espa{\~n}a-Bonet, Cristina and van Genabith, Josef},
      title = "Tracing Source Language Interference in Translation with Graph-Isomorphism Measures",
      booktitle = "Proceedings of the International Conference Recent Advances in Natural Language Processing, {RANLP} 2021",
      editor = "Mitkov, Ruslan  and Angelova, Galia",
      month = sep,
      year = "2021",
      address = "Varna, Bulgaria",
      publisher = "INCOMA Ltd.",
      doi = "https://doi.org/10.26615/978-954-452-072-4\_044"
      pages = "380--390"
}

Integrating Unsupervised Data Generation into Self-Supervised Neural Machine Translation for Low-Resource Languages

Dana Ruiter, Dietrich Klakow, Josef van Genabith, Cristina España-Bonet

The 18th biennial conference of the International Association of Machine Translation, MT Summit XVIII, Vol 1: MT Research Track, pages 76-91, August 16-20, Virtual, 2021.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{RuiterEtal:2021,
       author = {Dana Ruiter, Dietrich Klakow, Josef van Genabith, Cristina Espa{\~n}a-Bonet},
        title = "{Integrating Unsupervised Data Generation into Self-Supervised Neural Machine Translation for Low-Resource Languages}",
     booktitle = {Proceedings of the 18th biennial conference of the International Association of Machine Translation, MT Summit XVIII, Vol 1: MT Research Track},
         year = 2021,
        month = August,
      address = "Virtual",
    publisher = "Association for Machine Translation in the Americas",
          url = "https://aclanthology.org/2021.mtsummit-research.7",
        pages = {76--91}
}

The Effect of Domain and Diacritics in Yorúbà-English Neural Machine Translation

David I. Adelani, Dana Ruiter, Jesujoba O. Alabi, Damilola Adebonojo, Adesina Ayeni, Mofe Adeyemi, Ayodele Awokoya, Cristina España-Bonet

The 18th biennial conference of the International Association of Machine Translation, MT Summit XVIII, Vol 1: MT Research Track, pages 62-75, August 16-20, Virtual, 2021.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{AdelaniEtal:2021,
       author = {David I. Adelani, Dana Ruiter, Jesujoba O. Alabi, Damilola Adebonojo, Adesina Ayeni, Mofe Adeyemi, Ayodele Awokoya, Cristina Espa{\~n}a-Bonet},
        title = "{The Effect of Domain and Diacritics in Yor\`ub\'a--English Neural Machine Translation}",
     booktitle = {Proceedings of the 18th biennial conference of the International Association of Machine Translation, MT Summit XVIII, Vol 1: MT Research Track},
         year = 2021,
        month = August,
      address = "Virtual",
    publisher = "Association for Machine Translation in the Americas",
          url = "https://aclanthology.org/2021.mtsummit-research.6",
        pages = {62--75}
}

AVASAG: A German Sign Language Translation System for Public Services

Fabrizio Nunnari, Judith Bauerdiek, Lucas Bernhard, Cristina España-Bonet, Corinna Jäger, Amelie Unger, Kristoffer Waldow, Sonja Wecker, Elisabeth André, Stephan Busemann, Christian Dold, Arnulph Fuhrmann, Patrick Gebhard, Yasser Hamidullah, Marcel Hauck, Yvonne Kossel, Martin Misiak, Dieter Wallach, Alexander Stricker

Proceedings of the 1st International Workshop on Automatic Translation for Signed and Spoken Languages (AT4SSL), pages 43-48, August 16-20, Virtual, 2021.

[ Abstract PDF BibTeX ]

@InProceedings{NunnariEtal:2021,
    title = "{AVASAG}: A {G}erman {S}ign {L}anguage Translation System for Public Services",
    author = {Nunnari, Fabrizio  and
      Bauerdiek, Judith  and
      Bernhard, Lucas  and
      Espa{\~n}a-Bonet, Cristina  and
      J{\"a}ger, Corinna  and
      Unger, Amelie  and
      Waldow, Kristoffer  and
      Wecker, Sonja  and
      Andr{\'e}, Elisabeth  and
      Busemann, Stephan  and
      Dold, Christian  and
      Fuhrmann, Arnulph  and
      Gebhard, Patrick  and
      Hamidullah, Yasser  and
      Hauck, Marcel  and
      Kossel, Yvonne  and
      Misiak, Martin  and
      Wallach, Dieter  and
      Stricker, Alexander},
    booktitle = "Proceedings of the 1st International Workshop on Automatic Translation for Signed and Spoken Languages (AT4SSL)",
    month = aug,
    year = "2021",
    address = "Virtual",
    publisher = "Association for Machine Translation in the Americas",
    url = "https://aclanthology.org/2021.mtsummit-at4ssl.5",
    pages = "43--48"
}

A Data Augmentation Approach for Sign-language-to-text Translation In-the-wild (BEST POSTER AWARD)

Fabrizio Nunnari, Cristina España-Bonet and Eleftherios Avramidis

Proceedings of the 3rd Conference on Language, Data and Knowledge (LDK2021), Open Access Series in Informatics (OASIcs), Vol. 93, pages 36:1-36:8, September 2021.

[ Abstract PDF Poster BibTeX arXiv ]

@InProceedings{NunnariEtal:LDK:2021,
  author =	{Nunnari, Fabrizio and Espa\~{n}a-Bonet, Cristina and Avramidis, Eleftherios},
  title =	{{A Data Augmentation Approach for Sign-Language-To-Text Translation In-The-Wild}},
  booktitle =	{3rd Conference on Language, Data and Knowledge (LDK 2021)},
  pages =	{36:1--36:8},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-199-3},
  ISSN =	{2190-6807},
  year =	{2021},
  volume =	{93},
  editor =	{Gromann, Dagmar and S\'{e}rasset, Gilles and Declerck, Thierry and McCrae, John P. and Gracia, Jorge and Bosque-Gil, Julia and Bobillo, Fernando and Heinisch, Barbara},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2021/14572},
  URN =		{urn:nbn:de:0030-drops-145728},
  doi =		{10.4230/OASIcs.LDK.2021.36},
  annote =	{Keywords: sing language, video recognition, end-to-end translation, data augmentation}
}

Do not Rely on Relay Translations: Multilingual Parallel Direct Europarl

Kwabena Amponsah-Kaakyire, Daria Pylypenko, Cristina España-Bonet and Josef van Genabith

Proceedings of the Workshop on Modelling Translation: Translatology in the Digital Age (MoTra21), pages 1-7, Iceland (online), May 2021.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{AmposahEtal:MOTRA:2021,
      author = {Amponsah-Kaakyire, Kwabena and Pylypenko, Daria and Espa{\~n}a-Bonet, Cristina and van Genabith, Josef},
      title = "Do not Rely on Relay Translations: Multilingual Parallel Direct Europarl",
      booktitle = "Proceedings of the Workshop on Modelling Translation: Translatology in the Digital Age (MoTra21)",
      month = may,
      year = "2021",
      address = "Iceland (Online)",
      publisher = "Association for Computational Linguistics",
      url = "https://www.aclweb.org/anthology/2021.motra-1.1",
      pages = "1--7"
}

MENYO-20k: A Multi-domain English-Yorúbà Corpus for Machine Translation

David I. Adelani, Dana Ruiter, Jesujoba O. Alabi, Damilola Adebonojo, Adesina Ayeni, Mofe Adeyemi, Ayodele Awokoya, Cristina España-Bonet

arXiv pre-print 2103.08647, March 2021. Accepted to the AfricaNLP 2021 Workshop.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{AdelaniEtal:2020,
       author = {David I. Adelani, Dana Ruiter, Jesujoba O. Alabi, Damilola Adebonojo, Adesina Ayeni, Mofe Adeyemi, Ayodele Awokoya, Cristina Espa{\~n}a-Bonet},
        title = "{MENYO-20k: A Multi-domain English-Yor\`ub\'a Corpus for Machine Translation}",
      journal = {arXiv e-prints},
         year = 2021,
        month = April,
        pages = {1--12},
archivePrefix = {arXiv},
       eprint = {2103.08647},
 primaryClass = {cs.CL}
}

Multilingual Sentence Embeddings in/for/and Neural Machine Translation

Cristina España-Bonet

Talk at the Recent Advances in Machine Translation Symposium, 18th March 2021.

[ Slides Abstract ]

Understanding Translationese in Multi-view Embedding Spaces

Koel Dutta Chowdhury, Cristina España-Bonet and Josef van Genabith

Proceedings of the 28th International Conference on Computational Linguistics (COLING 2020), pages 6056-6062, December 2020.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{DuttaEtal:COLING:2020,
      author = {Dutta Chowdhury, Koel and Espa{\~n}a-Bonet, Cristina and van Genabith, Josef},
      title = "Understanding Translationese in Multi-view Embedding Spaces",
      booktitle = "Proceedings of the 28th International Conference on Computational Linguistics",
      month = dec,
      year = "2020",
      address = "Barcelona, Catalonia (Online)",
      publisher = "International Committee on Computational Linguistics",
      url = "https://www.aclweb.org/anthology/2020.coling-main.532",
      pages = "6056--6062"
}

Self-Induced Curriculum Learning in Self-Supervised Neural Machine Translation

Dana Ruiter, Josef van Genabith and Cristina España-Bonet

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 2560-2571, November 2020.

[ Abstract PDF BibTeX ]

@InProceedings{ruiterEtAl:EMNLP:2020,
   author = {Dana Ruiter and Josef van Genabith and Cristina Espa\~na-Bonet},
   title = "{Self-Induced Curriculum Learning in Self-Supervised Neural Machine Translation}",
   booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
   month = nov,
   year = "2020",
   address = "Online",
   publisher = "Association for Computational Linguistics",
   url = "https://www.aclweb.org/anthology/2020.emnlp-main.202",
   doi = "10.18653/v1/2020.emnlp-main.202",
   pages = "2560--2571"
}

Statistical Machine Translation: Main Components

Cristina España-Bonet

Invited talk at the 1r Congreso Internacional de Procesamiento de Lenguaje Natural para Lenguas Indígenas, Morelia, México, 5th November 2020.

[ Slides Youtube ]

Some Aspects of Linguistic Diversity in Europe and Africa

Cristina España-Bonet

Invited talk at the SPARC International Symposium on Mahatma Gandhi and Linguistic Diversity, 23rd September 2020.

[ Slides Youtube ]

Query or Document Translation for Academic Search — What's the real Difference?

Vivien Petras, Andreas Lüschow, Roland Ramthun, Juliane Stiller, Cristina España-Bonet and Sophie Henning

Experimental IR Meets Multilinguality, Multimodality, and Interaction, 11th International Conference of the CLEF Association, CLEF 2020, Thessaloniki, Greece, September 22-25, 2020. Lecture Notes in Computer Science, Vol. 12260, pages 28-42, Springer.

[ Abstract PDF BibTeX ]

@InProceedings{petrasEtAl:CLEF:2020,
   author = {Vivien Petras, Andreas L\"uschow, Roland Ramthun, Juliane Stiller, Cristina Espa{\~n}a-Bonet and Sophie Henning},
   title = "{Query or Document Translation for Academic Search -- What's the real Difference?}",
   booktitle = {Experimental {IR} Meets Multilinguality, Multimodality, and Interaction
               - 11th International Conference of the {CLEF} Association, {CLEF}
               2020, Thessaloniki, Greece, September 22-25, 2020, Proceedings},
   series    = {Lecture Notes in Computer Science},
   volume    = {12260},
   pages     = {28--42},
   publisher = {Springer},
   year      = {2020},
   doi       = {10.1007/978-3-030-58219-7\_3},
   key = {CLEF 2020},
   year = {2020},
   month = {September},
   address = {Thessaloniki, Greece},
}

How Human is Machine Translationese? Comparing Human and Machine Translations of Text and Speech

Yuri Bizzoni, Tom S Juzek, Cristina España-Bonet, Koel Dutta Chowdhury, Josef van Genabith and Elke Teich

Proceedings of the 17th International Workshop on Spoken Language Translation (IWSLT), pages 280-290, Seattle, WA, United States, July 2020.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{BizzoniEtal:IWSLT:2020,
      author = {Bizzoni, Yuri and Juzek, Tom S and Espa{\~n}a-Bonet, Cristina and Dutta Chowdhury, Koel and van Genabith, Josef and Teich, Elke},
      title = "How Human is Machine Translationese? Comparing Human and Machine Translations of Text and Speech",
      booktitle = "Proceedings of the 17th International Conference on Spoken Language Translation",
      month = jul,
      year = "2020",
      publisher = "Association for Computational Linguistics",
      url = "https://www.aclweb.org/anthology/2020.iwslt-1.34",
      pages = "280--290"
}

Tailoring and Evaluating the Wikipedia for in-Domain Comparable Corpora Extraction

Cristina España-Bonet, Alberto Barrón-Cedeño, Lluís Màrquez

arXiv pre-print 2005.01177, May 2020.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{EspanaBonetEtal:2020,
       author = {{Espa{\~n}a-Bonet}, Cristina and {Barr\'on-Cede{\~n}o}, Alberto and {M\`arquez}, Llu\'{i}s},
        title = "{Tailoring and Evaluating the Wikipedia for in-Domain Comparable Corpora Extraction}",
      journal = {arXiv e-prints},
     keywords = {Computer Science - Computation and Language, Computer Science - Information Retrieval},
         year = 2020,
        month = may,
        pages = {1--26},
archivePrefix = {arXiv},
       eprint = {2005.01177},
 primaryClass = {cs.CL}
}

Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction

Marta R. Costa-jussà, Cristina España-Bonet, Pascale Fung and Noah A. Smith

Special Issue of Computational Linguistics: Multilingual and Interlingual Semantic Representations for Natural Language Processing, pages 1-8, March 2020

[ Abstract PDF BibTeX ]

@article{ruizEtal:2020,
    title = "Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction",
    author = "Costa-juss{\`a}, Marta  and Espa{\~n}a-Bonet, Cristina and Fung, Pascale and Smith, Noah A.",
    publisher = {MIT Press},
    address = {Cambridge, MA, USA},
    journal = {Computational Linguistics},
    month = mar,
    year = "2020",
    doi = "10.1162/COLI_a_00373",
    pages = "1--8"
}

Massive vs. Curated Embeddings for Low-Resourced Languages: the Case of Yorùbá and Twi

Jesujoba O. Alabi, Kwabena Amponsah-Kaakyire, David I. Adelani and Cristina España-Bonet

Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020), pages 2754-2762 , Marseille, France, May 2020.

[ Abstract PDF BibTeX ]

@inproceedings{alabiEtal:2020:LREC,
    title = "Massive vs. Curated Embeddings for Low-Resourced Languages: the Case of Yor\`ub\'a and Twi",
    author = "Jesujoba O. Alabi, Kwabena Amponsah-Kaakyire, David I. Adelani and Cristina Espa{\~n}a-Bonet",
    booktitle = "Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020)",
    month = may,
    year = "2020",
    address = "Marseille, France",
    publisher = "European Language Resources Association (ELRA)",
    url = "https://www.aclweb.org/anthology/2020.lrec-1.335/",
    doi = "",
    pages = "2754--2762"
}

GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies

Marta R. Costa-jussà, Pau Li Lin and Cristina España-Bonet

Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020), pages 4081-4088, Marseille, France, May 2020.

[ Abstract PDF BibTeX ]

@inproceedings{ruizEtal:2020:LREC,
    title = "GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies",
    author = "Costa-juss{\`a}, Marta and Li Lin, Pau and Espa{\~n}a-Bonet, Cristina",
    booktitle = "Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020)",
    month = may,
    year = "2020",
    address = "Marseille, France",
    publisher = "European Language Resources Association (ELRA)",
    url = "https://www.aclweb.org/anthology/2020.lrec-1.502/",
    doi = "",
    pages = "4081--4088"
}

Analysing Coreference in Transformer Outputs

Ekaterina Lapshinova-Koltunski, Cristina España-Bonet and Josef van Genabith

Proceedings of the Fourth Workshop on Discourse in Machine Translation (DiscoMT 2019), pages 1-12, Hong Kong, November 2019.

[ Abstract PDF BibTeX ]

@inproceedings{lapshinovaEtal:2019:DiscoMT,
    title = "Analysing Coreference in Transformer Outputs",
    author = "Lapshinova-Koltunski, Ekaterina and Espa{\~n}a-Bonet, Cristina and van Genabith, Josef",
    booktitle = "Proceedings of the Fourth Workshop on Discourse in Machine Translation (DiscoMT 2019)",
    month = nov,
    year = "2019",
    address = "Hong Kong",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/D19-6501",
    doi = "10.18653/v1/D19-6501",
    pages = "1--12"
}

Context-Aware Neural Machine Translation Decoding

Eva Martínez Garcia, Carles Creus and Cristina España-Bonet

Proceedings of the Fourth Workshop on Discourse in Machine Translation (DiscoMT 2019), pages 13-23, Hong Kong, November 2019.

[ Abstract PDF BibTeX ]

@InProceedings{martinezEtAl:DiscoMT:2019,
   title = "Context-Aware Neural Machine Translation Decoding",
    author = "Mart{\'\i}nez Garcia, Eva and Creus, Carles  and  Espa{\~n}a-Bonet, Cristina",
    booktitle = "Proceedings of the Fourth Workshop on Discourse in Machine Translation (DiscoMT 2019)",
    month = nov,
    year = "2019",
    address = "Hong Kong, China",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/D19-6502",
    doi = "10.18653/v1/D19-6502",
    pages = "13--23"
}

Self-Supervised Neural Machine Translation

Dana Ruiter, Cristina España-Bonet and Josef van Genabith

Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Volume 2: Short Papers, pages 1828-1834, Florence, Italy, August 2019.

[ Abstract PDF BibTeX ]

@InProceedings{ruiterEtAl:ACL:2019,
   author = {Dana Ruiter and Cristina Espa\~na-Bonet and Josef van Genabith},
   title = "{Self-Supervised Neural Machine Translation}",
   booktitle = {Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Volume 2: Short Papers. },
   key = {ACL 2019},
   pages = {1828--1834},
   year = {2019},
   month = {August},
   address = {Florence, Italy},
   publisher = {Association for Computational Linguistics}
}

UdS-DFKI Participation at WMT 2019: Low-Resource (en-gu) and Coreference-Aware (en-de) Systems

Cristina España-Bonet, Dana Ruiter and Josef van Genabith

Proceedings of the Fourth Conference on Machine Translation, pages 382-389, Florence, Italy, August 2019.

[ Abstract PDF BibTeX ]

@InProceedings{espanaEtAl:WMT:2019,
   author = {Cristina Espa\~na-Bonet and Dana Ruiter and Josef van Genabith},
   title = "{UdS-DFKI Participation at WMT 2019: Low-Resource ($en$--$gu$) and Coreference-Aware ($en$--$de$) Systems}",
   booktitle = {Proceedings of the Fourth Conference on Machine Translation},
   key = {WMT 2019},
   pages = {382--389},
   year = {2019},
   month = {August},
   address = {Florence, Italy},
   publisher = {Association for Computational Linguistics}
}

Neural Machine Translation is like a Pig

Cristina España-Bonet

Invited talk at the Deep Learning BCN Symposium, Barcelona, Catalunya, 20th December 2018.

[ Abstract Slides ]

Query Translation for Cross-lingual Search in the Academic Search Engine PubPsych (BEST PAPER AWARD)

Cristina España-Bonet, Juliane Stiller, Roland Ramthun, Josef van Genabith and Vivien Petras

Proceedings of the Metadata and Semantics Research, 12th International Research Conference (MTSR 2018), Limassol, Cyprus, October 2018.

CCIS Vol. 846 Communications in Computer and Information Science (CCIS) book series, Springer

[ Abstract PDF BibTeX ]

@InProceedings{espanaBonetEtAl:MTSR:2018,
   author = {Cristina Espa{\~n}a-Bonet and Juliane Stiller and Roland Ramthun and Josef van Genabith and Vivien Petras},
   title = "{Query Translation for Cross-lingual Search in the Academic Search Engine PubPsych}",
   editor="Garoufallou, Emmanouel and Sartori, Fabio and Siatri, Rania and Zervas, Marios",
   booktitle="Metadata and Semantic Research",
   year="2019",
   publisher="Springer International Publishing",
   address="Cham",
   pages="37--49",
   isbn="978-3-030-14401-2"
   doi="10.1007/978-3-030-14401-2_4"
}

Neural Machine Translation with Context & Document Information

Cristina España-Bonet

Invited talk at the First International Workshop on Discourse Processing Guangdong University of Foreign Studies, Guangzhou, China, 23th October 2018.

[ Slides ]

The role of Artifical Intelligence within Natural Language

Cristina España-Bonet

Talk at the Multilingual Public Services in Europe Workshop, EC, Brussels, Belgium, 17th October 2018.

[ Slides ]

Multilingual Semantic Networks for Data-driven Interlingua Seq2Seq Systems

Cristina España-Bonet and Josef van Genabith

Proceedings of the LREC 2018 MLP-MomenT Workshop (MLP-Moment 2018), pages 8-13, Miyazaki, Japan, May 2018.

[ Abstract PDF Slides BibTeX ]

@InProceedings{espanaVanGenabith:LREC:2018,
   author = {Cristina Espa\~na-Bonet and Josef van Genabith},
   title = "{Multilingual Semantic Networks for Data-driven Interlingua Seq2Seq Systems}",
   booktitle = {Proceedings of the LREC 2018 MLP-MomenT Workshop},
   key = {MLP-MomenT 2018},
   pages = {8--13},
   year = {2018},
   month = {May},
   Address = {Miyazaki, Japan}
}

Going beyond zero-shot MT: combining phonological, morphological and semantic factors. The UdS-DFKI System at IWSLT 2017

Cristina España-Bonet and Josef van Genabith

Proceedings of the 14th International Workshop on Spoken Language Translation (IWSLT), pages 15-22, Tokyo, Japan, December 2017.

[ Abstract PDF Poster BibTeX ]

@InProceedings{espanaVanGenabith:IWSLT:2017,
   author = {Cristina Espa\~na-Bonet and Josef van Genabith},
   title = "{Going beyond zero-shot MT: combining phonological, morphological and semantic factors. The UdS-DFKI System at IWSLT 2017}",
   booktitle = {Proceedings of the 14th International Workshop on Spoken Language Translation (IWSLT)},
   key = {IWSLT 2017},
   pages = {15--22},
   year = {2017},
   month = {December},
   Address = {Tokyo, Japan}
}

Multilingual Natural Language Processing

Cristina España-Bonet

Talk at RICOH Institute of ICT, Tokyo, Japan, 11th December 2017.

[ Slides ]

An Empirical Analysis of NMT-Derived Interlingual Embeddings and their Use in Parallel Sentence Identification

Cristina España-Bonet, Ádám Csaba Varga, Alberto Barrón-Cedeño and Josef van Genabith

IEEE Journal of Selected Topics in Signal Processing, volume 11, number 8, pages 1340-1350, IEEE, December 2017.

[ Abstract PDF BibTeX HTML ]

@article{espana-bonetElAl:2017,
  author    = {Cristina Espa{\~{n}}a{-}Bonet and
               {\'{A}}d{\'{a}}m Csaba Varga and
               Alberto Barr{\'{o}}n{-}Cede{\~{n}}o and
               Josef van Genabith},
  title     = {An Empirical Analysis of NMT-Derived Interlingual Embeddings and their
               Use in Parallel Sentence Identification},
  journal   = {IEEE Journal of Selected Topics in Signal Processing},
  volume    = {11},
  number    = {8},
  month     = {December},
  pages     = {1340--1350},
  year      = {2017},
  doi       = {10.1109/JSTSP.2017.2764273}
 }

Learning Bilingual Projections of Embeddings for Vocabulary Expansion in Machine Translation

Pranava Swaroop Madhyastha and Cristina España-Bonet

Proceedings of the 2nd Workshop on Representation Learning for NLP (ACL Workshop RepL4NLP-2017), pages 139-145, Vancouver, Canada, August 2017.

[ Abstract PDF Poster BibTeX ]

@inProceedings{MadhyasthaEspana:2017,
    author    = {Pranava Swaroop Madhyastha and Cristina Espa{\~{n}}a{-}Bonet},
    title     = {Learning Bilingual Projections of Embeddings for Vocabulary Expansion in Machine Translation},
    booktitle = {Proceedings of the 2nd Workshop on Representation Learning for NLP. ACL Workshop on Representation Learning for NLP (RepL4NLP-2017)},
        pages = {139--145},
         year =	{2017},
        month = {August}
      address = {Vancouver, Canada},
    publisher = {Association for Computational Linguistics},
     language = {english},
}

Lump at SemEval-2017 Task 1: Towards an Interlingua Semantic Similarity

Cristina España-Bonet and Alberto Barrón-Cedeño

Proceedings of the 11th International Workshop on Semantic Evaluation (ACL Workshop SemEval-2017), pages 144-149, Vancouver, Canada, August 2017.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{EspanaBarron:2017,
       author = {{Espa{\~n}a-Bonet}, Cristina and {Barr\'on-Cede{\~n}o}, Alberto},
        title =	"{Lump at SemEval-2017 Task 1: Towards an Interlingua Semantic Similarity}",
    booktitle = "{Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)}",
        pages = {144--149},
         year =	{2017},
        month = {August}
      address = {Vancouver, Canada},
    publisher = {Association for Computational Linguistics},
     language = {english},
          url = {http://www.aclweb.org/anthology/S17-2019}
}

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation

Eva Martínez Garcia, Carles Creus, Cristina España-Bonet, Lluís Màrquez

The 20th Annual Conference of the European Association for Machine Translation, Prague, Czech Republic. The Prague Bulletin of Mathematical Linguistics, Vol. 108, pages 85-96, June 2017.

[ Abstract PDF BibTeX arXiv ]

@Article{eamt_martinezetal:2017,
       author = {{Mart\'inez}, Eva and {Creus}, Carles and {Espa{\~n}a-Bonet}, Cristina and {M\`arquez}, Llu\'{i}s},
        title =	{Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation},
      journal = {The 20th Annual Conference of the European Association for Machine Translation. 
                 The Prague Bulletin of Mathematical Linguistics},
        pages = {85--96},
       volume = {108},
         year =	{2017},
        month = {June},
     language = {english}
}

Automatic Speech Recognition with Deep Neural Networks for Impaired Speech

Cristina España-Bonet and José A. R. Fonollosa

Chapter in Advances in Speech and Language Technologies for Iberian Languages, part of the series Lecture Notes in Artificial Intelligence. In A. Abad et al. (Eds.). IberSPEECH 2016, LNAI 10077, Chapter 10, pages 97-107, October 2016.

[ Abstract PDF BibTeX arXiv ]

@inBook{EspanaFonollosa:2016,
	  author    = {Espa\~{n}a-Bonet, Cristina and Fonollosa, Jos\'{e} A. R.},
	  title     = {Automatic Speech Recognition with Deep Neural Networks for Impaired Speech},
	  booktitle = {Advances in Speech and Language Technologies for Iberian Languages},
	  series    = {Lecture Notes in Artificial Intelligence},
	  month     = {October},
	  year      = {2016},
	  publisher = {Springer International Publishing AG},
	  editor    = {Abad, A. and Ortega, A. and Teixeira, A.J.d.S. and Garcia Mateo, C. and Mart\'{i}nez Hinarejos, C.D. 
                       and Perdig\~{a}o, F. and Batista, F. and Mamede, N. (Eds.)},
	  pages     = {97--107},
	  chapter   = 10,
	  isbn      = {978-3-319-49169-1},
	  doi       = {10.1007/978-3-319-49169-1$\_$10},
	  url       = {http://www.springer.com/us/book/9783319491684}
}

The TALP-UPC Spanish-English WMT Biomedical Task: Bilingual Embeddings and Char-based Neural Language Model Rescoring in a Phrase-based System

Marta Ruiz Costa-jussà, Cristina España-Bonet, Pranava Madhyastha, Carlos Escolano and José A. R. Fonollosa

Proceedings of the First Conference on Machine Translation (WMT 2016), pages 463-468, Berlin, Germany, August 2016.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{costajussaEtal:WMT:2016,
  author    = {Costa-juss\`{a}, Marta R. and Espa\~{n}a-Bonet, Cristina and Madhyastha, Pranava and  
               Escolano, Carlos and Fonollosa, Jos\'{e} A. R.},
  title     = {The TALP--UPC Spanish--English WMT Biomedical Task: Bilingual Embeddings and 
               Char-based Neural Language Model Rescoring in a Phrase-based System},
  booktitle = {Proceedings of the First Conference on Machine Translation},
  month     = {August},
  year      = {2016},
  address   = {Berlin, Germany},
  publisher = {Association for Computational Linguistics},
  pages     = {463--468},
  url       = {http://www.aclweb.org/anthology/W/W16/W16-2336}
}

Resolving Out-of-Vocabulary Words with Bilingual Embeddings in Machine Translation

Pranava Madhyastha and Cristina España-Bonet

CoRR abs/1608.01910, August 2016.

[ Abstract PDF BibTeX arXiv ]

@article{MadhyasthaEspana:2016,
  author    = {Pranava Swaroop Madhyastha and Cristina Espa{\~{n}}a{-}Bonet},
  title     = {Resolving Out-of-Vocabulary Words with Bilingual Embeddings in Machine Translation},
  journal   = {CoRR},
  volume    = {abs/1608.01910},
  year      = {2016},
  url       = {http://arxiv.org/abs/1608.01910}
}

Hybrid Machine Translation Overview

Cristina España-Bonet, Marta Ruiz Costa-jussà

Chapter in Hybrid Approaches to Machine Translation, part of the series Theory and Applications of Natural Language Processing, pages 1-24

[ Abstract PDF BibTeX ]

@Inbook{EspanaBonetEtal:2016,
    author={Espa{\~{n}}a-Bonet, Cristina and Costa-juss{\`a}, Marta R.},
    editor={Costa-juss{\`a}, R. Marta and Rapp, Reinhard and Lambert, Patrik
            and Eberle, Kurt and Banchs, E. Rafael and Babych, Bogdan},
    title="{Hybrid Machine Translation Overview}",
    bookTitle="{Hybrid Approaches to Machine Translation"},
    year={2016},
    publisher={Springer International Publishing},
    pages={1--24},
    isbn={978-3-319-21311-8},
    doi={10.1007/978-3-319-21311-8_1},
    url={http://dx.doi.org/10.1007/978-3-319-21311-8_1}}

TweetMT: A Parallel Microblog Corpus

Iñaki San Vicente, Iñaki Alegria, Cristina España-Bonet, Pablo Gamallo, Hugo Gonçalo Oliveira, Eva Martínez Garcia, Antonio Toral, Arkaitz Zubiaga and Nora Aranberri

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), pages 2936-2941, Portoroz, Slovenia, May 2016.

[ Abstract PDF BibTeX arXiv ]

  @InProceedings{LRECSanVicente:2016,
     author = {{San Vicente}, I\~naki and {Alegr\'ia}, I{\~n}aki  and {Espa{\~n}a-Bonet}, Cristina and {Gamallo}, Pablo and 
                 {Gon\c{c}alo Oliveira}, Hugo and {Mart\'inez Garc\'ia}, Eva  and  
                 {Toral}, Antonio and {Zubiaga}, Arkaitz and {Aranberri}},
      title = {TweetMT: A Parallel Microblog Corpus},
  booktitle = {Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)},
      pages = {2936--2941},
       year = {2016},
      month = {may},
       date = {23--28},
   location = {Portoroz, Slovenia},
     editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
       isbn = {978-2-9517408-9-1},
   language = {english}
 } 
}

Resolving Out-of-Vocabulary Words with Bilingual Word Embeddings in Machine Translation

Cristina España-Bonet

Invited talk at Saarland University, DFKI, Saarbrücken, April 29th, 2016.

[ Abstract Slides ]

WikiParable - Data Categorisation Platform (Version 1.0)

Cristina España-Bonet

Technical Report, Universitat Politècnica de Catalunya, Computer Science Department, November 2015.

[ Abstract PDF BibTeX arXiv ]

@TechReport{WikiParableV1.0,
       author = {{Espa{\~n}a-Bonet}, Cristina}
        title =	{WikiParable -- Data Categorisation Platform (Version 1.0) }, 
         year =	{2015},
        month = {November}
         date = {16}, 
  institution = {Universitat Polit\`ecnica de Catalunya, Computer Science Department},
          url = {http://hdl.handle.net/2117/79539},
     language = {english}
}

Journey through Natural Language Processing

Cristina España-Bonet

Poster at Google NLP PhD Summit 2015, Zurich, Switzerland, September 2015.

[ Abstract PDF BibTeX arXiv ]

@Misc{CEBjourney,
       author = {{Espa{\~n}a-Bonet}, Cristina}
        title =	{Journey through Natural Language Processing}, 
 howpublished = {Poster},
         year =	{2015},
        month = {September}
         date = {23},
      address = {Zurich, Switzerland},
     language = {english}
}

Overview of TweetMT: A Shared Task on Machine Translation of Tweets at SEPLN 2015

Iñaki Alegria, Nora Aranberri, Cristina España-Bonet, Pablo Gamallo, Hugo Gonçalo Oliveira, Eva Martínez Garcia, Iñaki San Vicente, Antonio Toral, Arkaitz Zubiaga

Proceedings of the Tweet Translation Workshop, at "XXXI Congreso de la Sociedad Española de Procesamiento de lenguaje natural" and CEUR Workshop Proceedings, volume 1445, pages 8-19, Alacant, Spain, September 2015.

[ Abstract PDF BibTeX arXiv Slides ]

@InProceedings{tweetMT_overview,
       author = {{Alegr\'ia}, I{\~n}aki and {Aranberri}, Nora and {Espa{\~n}a-Bonet}, Cristina and {Gamallo}, Pablo and 
                 {Gon\c{c}alo Oliveira}, Hugo and {Mart\'inez Garc\'ia}, Eva  and {San Vicente}, I\~naki and 
                 {Toral}, Antonio and {Zubiaga}, Arkaitz},
        title =	{Overview of TweetMT: A Shared Task on Machine Translation of Tweets at SEPLN 2015},
    booktitle = {Proceedings of the Tweet Translation Workshop, at "XXXI Congreso de la Sociedad Espa{\~n}ola de 
                 Procesamiento de lenguaje natural" and CEUR Workshop Proceedings.},
        pages = {8--19},
       volume = {1445},
         year =	{2015},
        month = {September}
         date = {15},
      address = {Alacant, Spain},
     language = {english}
}

The UPC TweetMT participation: Translating Formal Tweets using Context Information

Eva Martínez Garcia, Cristina España-Bonet, Lluís Màrquez

Proceedings of the Tweet Translation Workshop, at "XXXI Congreso de la Sociedad Española de Procesamiento de lenguaje natural" and CEUR Workshop Proceedings, volume 1445, pages 25-32, Alacant, Spain, September 2015.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{tweetMT_martinezetal15,
       author = {{Mart\'inez}, Eva and {Espa{\~n}a-Bonet}, Cristina and {M\`arquez}, Llu\'{i}s},
        title =	{The UPC TweetMT participation: Translating Formal Tweets using Context Information},
    booktitle = {Proceedings of the Tweet Translation Workshop, at "XXXI Congreso de la Sociedad Espa{\~n}ola de 
                 Procesamiento de lenguaje natural" and CEUR Workshop Proceedings.},
        pages = {25--32},
       volume = {1445},
         year =	{2015},
        month = {September}
         date = {15},
      address = {Alacant, Spain},
     language = {english}
}

A Factory of Comparable Corpora from Wikipedia

Alberto Barrón-Cedeño, Cristina España-Bonet, Josu Boldoba, Lluís Màrquez

Proceedings of the 8th Workshop on Building and Using Comparable Corpora (BUCC), pages 3-13, Beijing, China, July 2015.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{Barronetal:2015,
       author = {{Barr\'on-Cede{\~n}o}, Alberto and {Espa{\~n}a-Bonet}, Cristina and 
       			{Boldoba}, Josu and {M\`arquez}, Llu\'{i}s},
        title =	"{A Factory of Comparable Corpora from Wikipedia}",
    booktitle = "{Proceedings of the 8th Workshop on Building and Using Comparable Corpora (BUCC)}",
        pages = {3--13},
         year =	{2015},
        month = {July}
         date = {30},
      address = {Beijing, China},
     language = {english},
  	  url = {http://www.aclweb.org/anthology/W15-3402}
}

Document-Level Machine Translation with Word Vector Models

Eva Martínez Garcia, Cristina España-Bonet, Lluís Màrquez

Proceedings of the 18th Annual Conference of the European Association for Machine Translation (EAMT), pages 59-66, Antalya, Turkey, May 2015.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{eamt15_martinezetal15,
       author = {{Mart\'inez}, E. and {Espa{\~n}a-Bonet}, C. and {M\`arquez}, L.},
        title =	{Document-Level Machine Translation with Word Vector Models},
    booktitle = {Proceedings of the 18th Annual Conference of the European Association for Machine Translation (EAMT)},
        pages = {59--66},
         year =	{2015},
        month = {May}
         date = {13},
      address = {Antalya, Turkey},
     language = {english}
}

A broad stroke on Machine Translation Evaluation

Cristina España-Bonet

Invited talk at the Faculty of Informatics (UPV/EHU) Donosti, March 13, 2015.

[ Abstract Slides ]

Word's Vector Representations meet Machine Translation

Eva Martínez Garcia, Cristina España-Bonet, Jörg Tiedemann, Lluís Màrquez

Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, pages 132-134, October 25, 2014, Doha, Qatar.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{sst8_martinezetal14,
       author = {{Mart\'inez}, E. and {Espa{\~n}a-Bonet}, C. and {Tiedemann}, J. and {M\`arquez}, L.},
        title =	{Word's Vector Representations meet Machine Translation},
    booktitle = {Proceedings of the eighth Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-8)},
        pages = {132--134},
         year =	{2014},
        month = {October}
         date = {25},
      address = {Doha, Qatar},
     language = {english}
}

A hybrid machine translation architecture guided by syntax

Gorka Labaka, Cristina España-Bonet, Lluís Màrquez, Kepa Sarasola

Machine Translation Journal, Vol. 28, Issue 2, pages 91-125, October, 2014.

[ Abstract PDF BibTeX arXiv ]

@article{labakaetal14,
       author = {Labaka, Gorka and Espa{\~n}a-Bonet, Cristina and M\`arquez, Llu\'is and Sarasola, Kepa},
        title =	{A hybrid machine translation architecture guided by syntax},
      journal = {Machine Translation},
          doi = {10.1007/s10590-014-9153-0},
       volume = 28,
        issue = 2,
        pages = {91-125},
         year =	{2014},
        month = {October},
         issn = {0922-6567},
          url = {http://dx.doi.org/10.1007/s10590-014-9153-0},
    publisher = {Springer Netherlands}
}

Document-Level Machine Translation as a Re-translation Process

Eva Martínez Garcia, Cristina España-Bonet, Lluís Màrquez

Procesamiento del Lenguaje Natural, 53, 103-110. September, 2014

[ Abstract PDF BibTeX arXiv ]

@article{martinez14,
       author = {{Mart\'inez}, E.  and {Espa{\~n}a-Bonet}, C. and {M\`arquez}, L.},
        title =	{Document-Level Machine Translation as a Re-translation Process},
      journal = {Procesamiento del Lenguaje Natural},
       volume = 53,
        pages = {103--110},
         year =	{2014},
        month = {September}
}

Statistical Machine Translation and Automatic Evaluation

Cristina España-Bonet and Meritxell Gonzàlez

Tutorial at the 9th edition of the Language Resources and Evaluation Conference, Reykjavik, May 2014.

[ Abstract Slides Part I Slides Part II BibTeX ]

@Unpublished{tutorialLREC14,
       author = {{Espa{\~n}a-Bonet}, C. and {Gonz\`alez}, M.},
        title =	{Statistical Machine Translation and Automatic Evaluation},
    booktitle = {Tutorial at the Ninth International Conference on Language Resources and Evaluation (LREC'14)},
          url = {http://slifer.lsi.upc.edu/lrec-mttutorial}
         year = {2014},
        month = {may},
         date = {26--31},
      address = {Reykjavik, Iceland},
     language = {english}}

Wikicardi: Hacia la extracción de oraciones paralelas de Wikipedia

Josu Boldoba, Alberto Barrón-Cedeño, Cristina España-Bonet

Research Report LSI-14-3-R

[ Abstract PDF BibTeX arXiv ]

@TechReport{boldobaLSI143R,
       author = {{Boldoba}, J. and {Barr\'on-Cede{\~n}o}, A. and {Espa{\~n}a-Bonet}, C.},
        title =	{Wikicardi: Hacia la extracci\'on de oraciones paralelas de Wikipedia},
  institution = {LSI, UPC},
         year =	{2014},
        month = {January},
         type = {Research Report},
       number =	{LSI-14-3-R}
}

Experiments on Document Level Machine Translation

Eva Martínez Garcia, Lluís Màrquez, Cristina España-Bonet

Research Report LSI-14-11-R

[ Abstract PDF BibTeX arXiv ]

@TechReport{cespanaLSI093R,
       author = {{Mart\'inez}, E. and {M\`arquez}, L. and {Espa{\~n}a-Bonet}, C.},
        title =	{Experiments on Document Level Machine Translation},
  institution = {LSI, UPC},
         year =	{2014},
        month = {January},
         type = {Research Report},
       number =	{LSI-14-11-R}
}

MT Techniques in a Retrieval System of Semantically Enriched Patents

Meritxell Gonzàlez, Maria Mateva, Ramona Enache, Cristina España-Bonet, Lluís Màrquez, Borislav Popov, Aarne Ranta

Proceedings of the Machine Translation Summit XIV, Nice, France, September 2-6, 2013.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{MTSpropotype,
  author = {{Gonz\`alez}, M. and {Mateva}, M. and {Enache}, R. and {Espa{\~n}a-Bonet}, C. and {M\`arquez}, L. and {Ranta}, A.},
  title = {MT Techniques in a Retrieval System of Semantically Enriched Patents},
  booktitle = {Proceedings of the Machine Translation Summit XIV},
  pages = {-},
  year = {2013},
  month = {sep},
  date = {2},
  address = {Nice, France},
  language = {english}
}

Deep evaluation of hybrid architectures: Use of different metrics in MERT weight optimization

Cristina España-Bonet, Gorka Labaka, Arantza Díaz de Ilarraza, Lluís Màrquez, Kepa Sarasola

Proceedings of the Free/Open-Source Rule-Based Machine Translation Workshop, Gothenburg 14-15 June, 2012.

[ Abstract PDF Slides BibTeX arXiv ]

@InProceedings{SMatxinTeval2,
  author = {{Espa{\~n}a-Bonet}, C. and {Labaka}, G. and {D\'iaz de Ilarraza}, A. and  {M\`arquez}, L. 
	     and {Sarasola}, K.},
  title = {Deep evaluation of hybrid architectures: Use of different metrics in MERT weight optimization},
  booktitle = {Proceedings of the Free/Open-Source Rule-Based Machine Translation Workshop},
  pages = {65-76},
  year = {2012},
  month = {jun},
  date = {14--15},
  address = {Gothenburg},
  language = {english}
}

A Hybrid System for Patent Translation

Ramona Enache, Cristina España-Bonet, Aarne Ranta, Lluís Màrquez

Proceedings of the 16th Annual Conference of the European Association for Machine Translation (EAMT12), Trento, Italy, May 8-30, 2012.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{enacheEtal12,
  author = {{Enache}, R. and {Espa{\~n}a-Bonet}, C. and {Ranta}, A. and {M\`arquez}, L.},
  title = {A Hybrid System for Patent Translation},
  booktitle = {Proceedings of the 16th Annual Conference of the European Association for Machine Translation (EAMT12)},
  pages = {269--276},
  year = {2012},
  month = {may},
  date = {28--30},
  address = {Trento, Italy},
  language = {english}
}

Context-Aware Machine Translation for Software Localization

Víctor Muntés, Patricia Paladini, Cristina España-Bonet, Lluís Màrquez

Proceedings of the 16th Annual Conference of the European Association for Machine Translation (EAMT12), Trento, Italy, May 8-30, 2012.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{muntesEtal12,
  author = {{Munt\'es}, V. and {Paladini}, P. and {Espa{\~n}a-Bonet}, C. and {M\`arquez}, L.},
  title = {Context-Aware Machine Translation for Software Localization},
  booktitle = {Proceedings of the 16th Annual Conference of the European Association for Machine Translation (EAMT12)},
  pages = {77--80},
  year = {2012},
  month = {may},
  date = {28},
  address = {Trento, Italy},
  language = {english}
}

Full Machine Translation for Factoid Question Answering

Cristina España-Bonet, Pere R. Comas

Proceedings of the EACL Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT), Avignon, France, April 23, 2012.

[ Abstract PDF Slides BibTeX arXiv ]

@InProceedings{espanaComas12,
  author = {{Espa{\~n}a-Bonet}, C. and {Comas}, P.R.},
  title = {Full Machine Translation for Factoid Question Answering},
  booktitle = {Proceedings of the EACL Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT)},
  pages = {20--29},
  year = {2012},
  month = {apr},
  date = {23},
  address = {Avignon, France},
  language = {english}
}

The Patents Retrieval Prototype in the MOLTO project

Milen Chechev, Meritxell Gonzàlez, Lluís Màrquez, Cristina España-Bonet

Proceedings of the World Wide Web 2012, Lyon, France, April 16, 2012.

[ Abstract PDF BibTeX arXiv ]

@InProceedings{www12patents,
  author = {{Chechev}, M. and {Gonz\`alez}, M. and {M\`arquez}, L. and {Espa{\~n}a-Bonet}, C.},
  title = {The Patents Retrieval Prototype in the MOLTO project},
  booktitle = {Proceedings of the World Wide Web 2012},
  pages = {4-8},
  year = {2012},
  month = {apr},
  date = {16},
  address = {Lyon, France},
  language = {english}
}

Deep evaluation of hybrid architectures: simple metrics correlated with human judgments

Gorka Labaka, Arantza Díaz de Ilarraza, Cristina España-Bonet, Lluís Màrquez, Kepa Sarasola

Proceedings of the International Workshop on Using Linguistic Information for Hybrid Machine Translation (LIHMT), Barcelona, November 18th, 2011.

[ Abstract PDF Slides BibTeX arXiv ]

@InProceedings{SMatxinTeval,
  author = {{Labaka}, G. and {D\'iaz de Ilarraza}, A. and {Espa{\~n}a-Bonet}, C. and {M\`arquez}, L. 
	     and {Sarasola}, K.},
  title = {Deep evaluation of hybrid architectures: simple metrics correlated with human judgments},
  booktitle = {Proceedings of the International Workshop on Using Linguistic Information for 
               Hybrid Machine Translation},
  pages = {50-57},
  year = {2011},
  month = {nov},
  date = {19},
  address = {Barcelona},
  language = {english}
}

Descobrim l'Univers

Cristina España-Bonet

Invited talk at Tertúlies de Literatura Científica, UVic, Vic, October 25th 2011.

[ Abstract Dossier 1 Dossier 2 Slides Link video arXiv ]

Patent translation within the MOLTO project

Cristina España-Bonet, Ramona Enache, Adam Slaski, Aarne Ranta, Lluís Màrquez, Meritxell Gonzàlez

Proceedings of the 4th Workshop on Patent Translation, MT Summit XIII, Xiamen, China, September 23, 2011.

[ Abstract PDF Slides BibTeX arXiv ]

@InProceedings{SMatxinT1,
  author = {{Espa{\~n}a-Bonet}, C. and {Enache}, R. and {Slaski}, A. and {Ranta}, A. 
	     and {M\`arquez}, L. and {Gonz\`alez}, M.},
  title = {Patent translation within the MOLTO project},
  booktitle = {Proceedings of the 4th Workshop on Patent Translation, MT Summit XIII},
  pages = {70-78},
  year = {2011},
  month = {sep},
  date = {23},
  address = {Xiamen, China},
  language = {english}
}

Hybrid Machine Translation Guided by a Rule-Based System

Cristina España-Bonet, Gorka Labaka, Arantza Díaz de Ilarraza, Lluís Màrquez, Kepa Sarasola

Proceedings of the 13th Machine Translation Summit, Xiamen, China, September 19-23, 2011.

[ Abstract PDF Slides BibTeX arXiv ]

@InProceedings{SMatxinT1,
  author = {{Espa{\~n}a-Bonet}, C. and {Labaka}, G. and {D\'iaz de Ilarraza}, A. and {M\`arquez}, L. 
	     and {Sarasola}, K.},
  title = {Hybrid Machine Translation Guided by a Rule-Based System},
  booktitle = {Proceedings of the 13th Machine Translation Summit},
  pages = {554-561},
  year = {2011},
  month = {sep},
  date = {19-23},
  address = {Xiamen, China},
  language = {english}
}

Introduction to SMT and its standard tools

Cristina España-Bonet

GF Summer School, Barcelona, August 2011.

[ Abstract Slides ]

El Projecte MOLTO: Multi Lingual On-Line Translation

Cristina España-Bonet

Invited talk at the workshop La Indústria de la Traducció entre Llengües Romàniques, UPV, València, September 2010.

[ Abstract Postscript PDF Slides BibTeX arXiv ]

L'objectiu final de MOLTO és desenvolupar un conjunt d'eines per a traduir textos entre diversos idiomes en temps real i amb alta qualitat. En aquestes eines cada llengua està pensada com un mòdul independent i, per tant, es pot afegir de manera directa sobre el sistema base. Dintre del projecte es construiran prototips per cobrir la major part dels 23 idiomes oficials a la UE.

Com a tècnica principal, MOLTO utilitza gramàtiques semàntiques de domini específic i interlingues basades en ontologies. Aquests components s'implementen en Grammatical Framework (GF), un formalisme de gramàtiques on es relacionen diversos idiomes a través d'una sintaxi abstracta comuna. El GF s'ha aplicat en diversos dominis de mida petita i mitjana, típicament per tractar fins a un total de deu idiomes, però MOLTO ampliarà això en termes de productivitat i aplicabilitat.

Part de l'ampliació es dedicarà a augmentar la mida dels dominis i el nombre d'idiomes. També és important fer la tecnologia accessible per als experts del domini sense experiència amb GFs i reduir al mínim l'esforç necessari per a la construcció d'un traductor. Idealment, això es pot aconseguir simplement estenent un lexicó i escrivint un conjunt de frases d'exemple. Per altra banda les parts amb investigació més intensiva de MOLTO són la interoperabilitat entre estàndards d'ontologies (OWL) i les gramàtiques GF, i l'extensió de les traduccions basades en regles amb mètodes estadístics. L'interoperabilitat OWL-GF permetrà la interacció multilingüe basada en llenguatge natural amb coneixement vàlid per a les màquines. Els mètodes estadístics afegiran robustesa al sistema i caldrà desenvolupar nous mètodes per a combinar les gramàtiques GF amb la traducció estadística en benefici de tots dos.

Després dels tres anys que dura el projecte, la tecnologia de MOLTO serà lliurada com a llibreries de codi obert que podran ser connectades a les eines de traducció estàndard i pàgines web i, per tant, podran ser integrades en els fluxos de treball estàndard. En el procés, es crearan demostracions web i la metodologia s'aplicarà a tres estudis de cas: exercicis de matemàtiques en 15 idiomes, dades de patents en almenys 3 idiomes, i descripcions d'objectes de museus en 15 idiomes.

Es pot trobar informació addicional al web oficial http://www.molto-project.eu/.

Robust Estimation of Feature Weights in Statistical Machine Translation

Cristina España-Bonet, Lluís Màrquez

Proceedings of the 14th Annual Conference of the European Association for Machine Translation (EAMT), Saint-Raphaël, France, May 2010.

[ Abstract Postscript PDF Poster BibTeX arXiv ]

@InProceedings{espanaMarquez,
  author = {{Espa{\~n}a-Bonet}, C. and {M\`arquez}, L.},
  title = {Robust Estimation of Feature Weights in Statistical Machine Translation},
  booktitle = {Proceedings of the 14th Annual Conference of the European Association for Machine Translation (EAMT'10)},
  year = {2010},
  month = {may},
  date = {27-28},
  address = {Saint-Rapha\"{e}l, France},
  language = {english}
}

Language Technology Challenges of a 'small' Language (Catalan)

M. Melero, G. Boleda, M. Cuadros, C. España-Bonet, L. Padró, M. Quixal, C. Rodríguez, R. Saurí

Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC), Valletta, Malta, May 2010.

[ Abstract Postscript PDF Poster BibTeX arXiv ]

@InProceedings{MELERO10.628,
  author = {Maite Melero, Gemma Boleda, Montse Cuadros, Cristina Espa{\~n}a-Bonet, Llu\'is Padr\'o, Mart\'i Quixal, 
            Carlos Rodr\'iguez and Roser Saur\'i},
  title = {Language Technology Challenges of a 'Small' Language (Catalan)},
  booktitle = {Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odjik, Stelios Piperidis, 
            Mike Rosner, Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
}

Statistical Machine Translation - A practical tutorial

Cristina España-Bonet

Tutorial at MOLTO kick-off meeting, Barcelona, March 2010.

[ Abstract PDF (to show) PDF (to print) arXiv ]

Robust Estimation of Feature Weights in SMT

Cristina España-Bonet, Lluís Màrquez

Talk at OpenMT2 kick-off meeting, Ulia, Donostia, January 2010.

[ Abstract Postscript PDF arXiv ]

Discriminative Phrase-Based Models for Arabic Machine Translation

Cristina España-Bonet, Jesús Giménez, Lluís Màrquez

ACM Transactions on Asian Language Information Processing Journal (TALIP), vol. 8, No. 4, pag. 1-20. December, 2009.

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@article{talip09,
       author = {{Espa{\~n}a-Bonet}, C. and {Gim\'enez}, J. and {M\`arquez}, L.},
        title =	{Discriminative Phrase-Based Models for Arabic Machine Translation},
      journal = {ACM Transactions on Asian Language Information Processing Journal (TALIP)},
         year = 2010,
        month = March,
       volume = 8,
        issue = 4,
        month = December,
         year = 2009,
        pages = 1--20,
    articleno = 15,
          doi = {http://doi.acm.org/10.1145/1644879.1644882},
    publisher = ACM,
  }

CoCo, a web interface for corpora compilation

C. España-Bonet, M. Vila, H. Rodríguez, M.A. Martí

Procesamiento del Lenguaje Natural, 43, 367-368. September, 2009.

[ Abstract Postscript PDF Poster BibTeX arXiv ]

@ARTICLE{seplncoco2009,
    author = {{Espa{\~n}a-Bonet}, C. and {Vila}, M. and {Mart\'i}. M.A. and {Rodr\'iguez}, H.},
     title = {CoCo, a web interface for corpora compilation},
   journal = {Procesamiento del Lenguaje Natural},
    volume = 43,
     pages = {367-368},
      year = 2009,
     month = September
}

Conclusiones de la primera Jornada del Procesamiento Computacional del Catalán

G. Boleda, M. Cuadros, C. España-Bonet, M. Melero, L. Padró, M. Quixal, C. Rodríguez

Procesamiento del Lenguaje Natural, 43, 387-388. September, 2009.

[ Abstract Postscript PDF Poster BibTeX arXiv ]

@ARTICLE{seplnjpc2009,
    author = {{Boleda}, G. and {Cuadros}, M. and {Espa{\~n}a-Bonet}, C. and {Melero}, M. and {Padr\'o}. L. 
              and {Quixal}, M. and {Rodr\'iguez}, C.},
     title = {Conclusiones de la primera Jornada del Procesamiento Computacional del Catal\'an},
   journal = {Procesamiento del Lenguaje Natural},
    volume = 43,
     pages = {387-388},
      year = 2009,
     month = September
}

Sobre la I Jornada del Processament Computacional del català

G. Boleda, M. Cuadros, C. España-Bonet, M. Melero, L. Padró, M. Quixal, C. Rodríguez

Llengua i Ús, vol 45, 23-32, 2009.

[ Abstract Postscript PDF BibTeX arXiv ]

@ARTICLE{lsc09,
    author = {{Boleda}, G. and {Cuadros}, M. and {Espa{\~n}a-Bonet}, C. and {Melero}, M. and {Padr\'o}. L. 
              and {Quixal}, M. and {Rodr\'iguez}, C.},
     title = "Sobre la I Jornada del Processament Computacional del catal\`a",
   journal = "Llengua i \'Us",
    volume = 45,
     pages = 23-32,
      year = 2009
}

El català i les tecnologies de la llengua

G. Boleda, M. Cuadros, C. España-Bonet, M. Melero, L. Padró, M. Quixal, C. Rodríguez

Llengua, Societat i Comunicació, vol 7, 20-26, 2009.

[ Abstract Postscript PDF BibTeX arXiv ]

@ARTICLE{lsc09,
    author = {{Boleda}, G. and {Cuadros}, M. and {Espa{\~n}a-Bonet}, C. and {Melero}, M. and {Padr\'o}. L. 
              and {Quixal}, M. and {Rodr\'iguez}, C.},
     title = "El catal\`a i les tecnologies de la llengua",
   journal = "Llengua, Societat i Comunicaci\'o",
    volume = 7,
     pages = "20--26",
      year = 2009,
     month = July
}

Type Ia SNe along redshift: the R(SiII) ratio and the expansion velocities in intermediate z supernovae

G. Altavilla, P. Ruiz-Lapuente, A. Balastegui, J. Mendez, M. Irwin, C. España-Bonet, R.S. Ellis, G. Folatelli, A. Goobar, W. Hillebrandt, R.M. McMahon, S. Nobili, V. Stanishev, N.A. Walton

The Astrophysical Journal, vol 695, 135-148, 2009

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@ARTICLE{midzsne2009,
   author = {{Altavilla}, G. and {Ruiz-Lapuente}, P. and {Balastegui}, A. and {Mendez}, J. and {Irwin}, M. and 
            {Espa{\~n}a-Bonet}, C. and {Ellis}, R.~S. and {Folatelli}, G. and {Goobar}, A. and {Hillebrandt}, W. 
	    and {McMahon}, R.~M. and {Nobili}, S. and {Stanishev}, V. and {Walton}, N.~A.},            
    title = "{Type Ia SNe along redshift: the R(Si II) ratio and the expansion velocities in intermediate z supernovae}",
  journal = {Astrophysical Journal},
   eprint = {arXiv:astro-ph/0610143},
     year = 2009,
    month = april,
   volume = 695,
    pages = {135-148},
      doi = {10.1088/0004-637X/695/1/135},
}

Discriminative learning within Arabic Statistical Machine Translation

Cristina España-Bonet, Jesús Giménez, Lluís Màrquez

Research Report LSI-09-3-R

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@TechReport{cespanaLSI093R,
       author = {{Espa{\~n}a-Bonet}, C. and {Gim\'enez}, J. and {M\`arquez}, L.},
        title =	{Discriminative learning within Arabic Statistical Machine Translation},
  institution = {LSI, UPC},
         year =	{2009},
        month = {January},
         type = {Research Report},
       number =	{LSI-09-3-R}
}

The UPC-LSI Discriminative Phrase Selection System: NIST MT Evaluation 2008

Cristina España-Bonet, Jesús Giménez, Lluís Màrquez

Proceedings of the 2008 NIST Open Machine Translation Evaluation Workshop

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@InProceedings{,
      author = {{Espa{\~n}a-Bonet}, C. and {Gim\'enez}, J. and {M\`arquez}, L.},
       title = {The UPC-lsi Discriminative Phrase Selection System: NIST MT Evaluation 2008},
        year = {2008},
organization = {NIST Open Machine Translation Evaluation Workshop}
}

A proposal for an Arabic-to-English SMT

Cristina España-Bonet

Master Thesis, Universitat de Barcelona and Universitat Politècnica de Catalunya (Artificial Intelligence Program)

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@MastersThesis{crisSMTdea,
  author = {{Espa{\~n}a-Bonet}, C.},
   title = {A proposal for an Arabic-to-English SMT},
  school = {Universitat de Barcelona and Universitat Polit\`ecnica de Catalunya},
    year = 2008,
   month = February
}

Exploring the evolution of dark energy and its equation of state

Cristina España-Bonet

Ph.D. Thesis, Universitat de Barcelona (Astronomy and Astrophysics Program)

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@PhdThesis{crisTesi,
  author = {{Espa{\~n}a-Bonet}, C.},
   title = {Exploring the evolution of dark energy and its equation of state},
  school = {Departament d'Astronomia i Meteorologia, Universitat de Barcelona},
    year = 2008,
   month = February
}

Tracing the equation of state and the density of cosmological constant along z

Cristina España-Bonet, Pilar Ruiz-Lapuente

Journal of Cosmology and Astro-Particle Physics, vol 02, pag 18+, 2008

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@ARTICLE{2008JCAP...02..018E,
   author = {{Espa{\~n}a-Bonet}, C. and {Ruiz-Lapuente}, P.},
    title = "{Tracing the equation of state and the density of the cosmological constant along z}",
  journal = {Journal of Cosmology and Astro-Particle Physics},
archivePrefix = "arXiv",
   eprint = {0805.1929},
     year = 2008,
    month = feb,
   volume = 2,
    pages = {18-+},
      doi = {10.1088/1475-7516/2008/02/018},
   adsurl = {http://adsabs.harvard.edu/abs/2008JCAP...02..018E},
  adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

Type Ia SNe along redshift: the R(Si II) ratio and the expansion velocities in intermediate z supernovae

G. Altavilla, P. Ruiz-Lapuente, A. Balastegui, J. Mendez, M. Irwin, C. España-Bonet, K. Schamaneche, C. Balland, R.S. Ellis, S. Fabbro, G. Folatelli, A. Goobar, W. Hillebrandt, R.M. McMahon, M. Mouchet, A. Mourao, S. Nobili, R. Pain, V. Stanishev, N.A. Walton

Submitted to The Astrophysical Journal

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@ARTICLE{2006astro.ph.10143A,
   author = {{Altavilla}, G. and {Ruiz-Lapuente}, P. and {Balastegui}, A. and {Mendez}, J. and {Irwin}, M. and 
            {Espa{\~n}a-Bonet}, C. and {Schamaneche}, K. and {Balland}, C. and {Ellis}, R.~S. and {Fabbro}, S. and 
	    {Folatelli}, G. and {Goobar}, A. and {Hillebrandt}, W. and {McMahon}, R.~M. and {Mouchet}, M. and
	    {Mourao}, A. and {Nobili}, S. and {Pain}, R. and {Stanishev}, V. and {Walton}, N.~A.},
    title = "{Type Ia SNe along redshift: the R(Si II) ratio and the expansion velocities in intermediate z supernovae}",
  journal = {ArXiv Astrophysics e-prints},
   eprint = {arXiv:astro-ph/0610143},
 keywords = {Astrophysics},
     year = 2006,
    month = oct,
   adsurl = {http://adsabs.harvard.edu/abs/2006astro.ph.10143A},
  adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

Dark Energy as an Inverse Problem

Cristina España-Bonet, Pilar Ruiz-Lapuente

Poster at JENAM The many scales in the Universe, IAA, Granada, September 2004

[ Abstract Postcript JPG Slides BibTeX arXiv ]

Viabilitat d'una Constant Cosmològica variable. Contrast amb SNeIa.

Cristina España-Bonet

Master Thesis (DEA), Universitat de Barcelona (Astronomy and Astrophysics Program)

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@MastersThesis{crisAstroDEA,
  author = {{Espa{\~n}a-Bonet}, C.},
   title = {Viabilitat d'una Constant Cosmològica variable. Contrast amb SNeIa.},
  school = {Dept. Astronomia i Meteorologia, Universitat de Barcelona},
    year = 2004,
   month = September
}

Testing the running of the cosmological constant with Type Ia Supernovae at high z

Cristina España-Bonet, Pilar Ruiz-Lapuente, Ilya L. Shapiro, Joan Solà

Journal of Cosmology and Astro-Particle Physics, vol 02, pag 6+, 2004

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@ARTICLE{2004JCAP...02..006E,
   author = {{Espa{\~n}a-Bonet}, C. and {Ruiz-Lapuente}, P. and {Shapiro}, I.~L. and {Sol{\`a}}, J.},
    title = "{Testing the running of the cosmological constant with type Ia supernovae at high z}",
  journal = {Journal of Cosmology and Astro-Particle Physics},
   eprint = {arXiv:hep-ph/0311171},
     year = 2004,
    month = feb,
   volume = 2,
    pages = {6-+},
   adsurl = {http://adsabs.harvard.edu/abs/2004JCAP...02..006E},
  adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

Variable Cosmological Constant as a Planck scale effect

Ilya L. Shapiro, Joan Solà, Cristina España-Bonet, Pilar Ruiz-Lapuente

Physics Letters B, 574, pag 149-155, 2003

[ Abstract Postscript PDF Slides BibTeX arXiv ]

@ARTICLE{2003PhLB..574..149S,
   author = {{Shapiro}, I.~L. and {Sol{\`a}}, J. and {Espa{\~n}a-Bonet}, C. and {Ruiz-Lapuente}, P.},
    title = "{Variable cosmological constant as a Planck scale effect}",
  journal = {Physics Letters B},
   eprint = {arXiv:astro-ph/0303306},
     year = 2003,
    month = nov,
   volume = 574,
    pages = {149-155},
      doi = {10.1016/S0370-2693(03)01376-5},
   adsurl = {http://adsabs.harvard.edu/abs/2003PhLB..574..149S},
  adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}

Supernovae and Cosmology

Cristina España-Bonet

Talk given at Dpt. Estructura i Constituents de la Matèria (Universitat de Barcelona), Dpt. Física i Enginyeria Nuclear (Universitat Politècnica de Catalunya) and Institut de Física d'Altes Energies.

[ Abstract Postscript PDF Slides BibTeX arXiv ]

Present-day running of the cosmological constant

Cristina España-Bonet, Pilar Ruiz-Lapuente

Poster at the Winter School Dark matter and dark energy in the Universe, IAC, Tenerife, November 2002

Llistat de publicacions i treballs relacionats

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2006

2004

2003

2002