Publications

2020

  1. Nils Rethmeier, Vageesh Kumar Saxena, Isabelle Augenstein (2020) TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP, UAI 2020 http://www.auai.org/uai2020/proceedings/197_main_paper.pdf
  2. Hongfei Xu, Deyi Xiong et al. (2020). Efficient Context-Aware Neural Machine Translation with Layer-Wise Weighting and Input-Aware Gating. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20, pp. 3933–3940.
  3. Marta R. Costa-jussà, Pau Li Lin, and Cristina España-Bonet (2020). GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies. In 12th International Conference on Language Resources and Evaluation (LREC-2020). European Language Resources Association (ELRA), pp. 4081–4088.
  4. Ekaterina Loginova, Stalin Varanasi and Günter Neumann (2020) Towards End-to-End Multilingual Question Answering . In Journal Information Systems Frontiers <https://www.springer.com/journal/10796>, https://doi.org/10.1007/s10796-020-09996-1, February, 2020.
  5. Saadullah Amin, Katherine Dunfield, Anna Vechkaeva and Günter Neumann (2020) A Data-driven Approach for Noise Reduction in Distantly Supervised Biomedical Relation Extraction . In Proceedings of BioNLP-2020 <https://aclweb.org/aclwiki/BioNLP_Workshop> at ACL-2020 <https://acl2020.org/>.
  6. Anna Vechkaeva and Günter Neumann (2020) Latent Feature Generation with Adversarial Learning for Aphasia Classification . In Proceedings of RaPID-2020 <https://spraakbanken.gu.se/en/rapid-2020> at LREC-2020 <https://lrec2020.lrec-conf.org/en/>.
  7. Katherine Dunfield and Günter Neumann (2020) Automatic Quantitative Prediction of Severity in Fluent Aphasia Using Sentence Representation Similarity. In Proceedings of RaPID-2020 <https://spraakbanken.gu.se/en/rapid-2020> at LREC-2020 <https://lrec2020.lrec-conf.org/en/>.
  8. Eleni Metheniti and Günter Neumann (2020) Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus. In LREC – International Conference on Language Resources and Evaluation (LREC-2020) May 1-4 LREC 5/2020. <https://lrec2020.lrec-conf.org/en/>
  9. Saadullah Amin, Stalin Varanasi, Katherine Dunfield and Günter Neumann (2020) LowFER: Low-rank Bilinear Pooling for Link Prediction. Proceedings of the 37th International Conference on Machine Learning (ICML-2020), 2020 <https://icml.cc/Conferences/2020>.
  10. Stalin Varanasi, Saadullah Amin, and Günter Neumann. CopyBERT: A Unified Approach to Question Generation with Self-Attention. Proceedings of the 2nd Workshop on Natural Language Processing for Conversational AI. 2020.
  11. Jesujoba Alabi, Kwabena Amponsah-Kaakyire, David I. Adelani, Cristina España i Bonet (2020). “Massive vs. Curated Embeddings for Low-Resourced Languages: the Case of Yorùbá and Twi”. In: 12th International Conference on Language Resources and Evaluation (LREC-2020), May 12-17, Marseille, France. European Language Resources Association (ELRA), Paris.
  12. Jingyi Zhang and Josef van Genabith Translation Quality Estimation by Jointly Learning to Score and Rank, EMNLP 2020
  13. Dana Ruiter, Josef van Genabith and Cristina España-Bonet Self-Induced Curriculum Learning in Self-Supervised Neural Machine Translation, EMNLP 2020
  14. Hongfei Xu, Qiuhui Liu, Josef van Genabith, Deyi Xiong and Jingyi Zhang. “Lipschitz Constrained Parameter Initialization for Deep Transformers”, ACL 2020, short paper
  15. Hongfei Xu, Josef van Genabith, Deyi Xiong and Qiuhui Liu. “Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change”, ACL 2020, short paper
  16. Hongfei Xu, Josef van Genabith, Deyi Xiong, Qiuhui Liu and Jingyi Zhang. “Learning Source Phrase Representations for Neural Machine Translation”, ACL 2020, long paper
  17.  Marta R. Costa-jussà,, Cristina España-Bonet, Pascale Fung, and Noah A. Smith (2020). “Multilingual and Interlingual Semantic Representations for Natural Language Processing: A Brief Introduction”. In: Computational Linguistics Special Issue: Multilingual and Interlingual Semantic Representations for Natural Language Processing, pp. 1-8.
  18. Christine Schäfer (2020). “Evaluation of Transfer Learning Approaches for Cross-Lingual Question Answering”. Master Thesis at Saarland University, 2020.
  19. Christoph Alt, Aleksandra Gabryszak and Leonhard Hennig. “TACRED Revisited: A Thorough Evaluation of the TACRED Relation Extraction Task”, ACL 2020, long paper
  20. Christoph Alt, Aleksandra Gabryszak and Leonhard Hennig. “Probing Linguistic Features of Sentence-Level Representations in Relation Extraction”, ACL 2020, long paper
  21. Santanu Pal, Hongfei Xu, Nico Herbig, Sudip Kumar Naskar, Antonio Krüger and Josef van Genabith The Transference Architecture for Automatic Post-Editing. ArXiv e-prints, 2019, pp.1-10
  22. Marc Hübner; Christoph Alt; Robert Schwarzenberg; and Leonhard Hennig. Defx at SemEval-2020 Task 6: Joint Extraction of Concepts and Relations for Definition Extraction. Proceedings of the Fourteenth Workshop on Semantic Evaluation, Barcelona (online), 2020, p. 704–709.


    2019

  23. Eva Martínez Garcia, Carles Creus and Cristina España-Bonet (2019). “Context-Aware Neural Machine Translation Decoding”. In: 4th Workshop on Discourse in Machine Translation (DiscoMT-2019), located at EMNLP-IJCNLP 2019, November 3, Hong Kong. ACL pp. 13-23.
  24. Robert Schwarzenberg, Marc Hübner, David Harbecke, Christoph Alt, and Leonhard Hennig (2019). “Layerwise Relevance Visualization in Convolutional Text Graph Classifiers”. In: Workshop on Graph-Based Natural Language Processing (EMNLP-2019), November 3-7, Hong Kong. ACL.
  25. Ekaterina Lapshinova-Koltunski, Cristina España-Bone, and Josef van Genabith (2019). Analysing Coreference in Transformer Outputs. In: Fourth Workshop on Discourse in Machine Translation (DiscoMT-2019), November 3, Hong Kong. ACL. ACL, pp. 1–12.
  26. Lisa Raithel; Robert Schwarzenberg. Cross-lingual Neural Vector Conceptualization, NLPCC 2019 Workshop on Explainable Artificial Intelligence (XAI-2019), Dunhuang, China. Lecture Notes in Artificial Intelligence, Springer, 2019
  27. Eleftherios Avramidis; Vivien Macketanz; Ursula Strohriegel; Hans Uszkoreit. Linguistic Evaluation of German-English Machine Translation using a Test Suite. Proceedings of the Fourth Conference on Machine Translation (WMT-2019), at ACL 2019. Florence, Italy, 2019
  28. Christoph Alt; Marc Hübner; Leonhard Hennig. Fine-tuning Pre-trained Transformer Language Models to Distantly Supervised Relation Extraction. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL-2019). Florence, Italy, 2019.
  29. Cristina España-Bonet; Dana Ruiter; Josef van Genabith. UdS-DFKI Participation at WMT 2019: Low-Resource and Coreference-Aware Systems. Fourth Conference on Machine Translation (WMT19), at ACL-2019. Florence, Italy, 2019.
  30. Dana Ruiter; Cristina España-Bonet; Josef van Genabith. Self-Supervised Neural Machine Translation. 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy, 2019
  31. Jingyi Zhang; Josef van Genabith. DFKI-NMT Submission to the WMT19 News Translation Task. Fourth Conference on Machine Translation (WMT19), at ACL-2019. Florence, Italy, 2019
  32. Dominik Stammbach; Stalin Varanasi; Günter Neumann. DOMLIN at SemEval-2019 Task 8: Automated Fact Checking exploiting Ratings in Community Question Answering Forums. The International Workshop on Semantic Evaluation – Proceedings of the Thirteenth Workshop (SemEval 2019), at NAACL-HLT. Minneapolis, USA, 2019.
  33. Robert Schwarzenberg; Lisa Raithel; David Harbecke. Neural Vector Conceptualization for Word Vector Space Interpretation, NAACL-HLT 2019 Workshop on Evaluating Vector Space Representations for NLP (RepEval), Minneapolis, USA, 2019
  34. Robert Schwarzenberg, David Harbecke; Vivien Macketanz; Eleftherios Avramidis; Sebastian Möller. Train, Sort, Explain: Learning to Diagnose Translation Models. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT): Demonstrations. Minneapolis, USA, 2019
  35. Christoph Alt; Marc Hübner; Leonhard Hennig. Improving Relation Extraction by Pre-trained Language Representations. Proceedings of Conference on Automated Knowledge Base Construction. Amherst MA., USA, 2019
  36. Alejandro Figueroa; Carlos Gómez-Pantoja; Günter Neumann. Integrating heterogeneous sources for predicting temporal anchors across Yahoo! Answers, Information Fusion 50, pp. 112-150, Elsevier, 2019
  37. Dominik Stammbach and Günter Neumann (2019) Team DOMLIN: Exploiting Evidence Enhancement for the FEVER Shared Task.  <>In Proceedings of the Second Workshop on Fact Extraction and VERification <https://www.aclweb.org/anthology/volumes/D19-66/> (FEVER), EMNLP workshop, 2019.
  38. Santanu Pal, Hongfei Xu, Nico Herbig, Antonio Krüger, Josef van Genabith. The Transference Architecture for English-German Automatic Post-Editing. Fourth Conference on Machine Translation (WMT-2019), August 1-2, Florence, Italy.
  39. Santanu Pal, Marcos Zampieri, Josef van Genabith. UDS–DFKI Submission to the WMT2019 Czech–Polish Similar Language Translation Shared Task. Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2), August 2019, Florence, Italy, Association for Computational Linguistics, 221–225
  40. Loïc Barraul, Ondřej Bojar, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Shervin Malmasi, Christof Monz, Mathias Müller, Santanu Pal, Matt Post, Marcos Zampieri. Findings of the 2019 Conference on Machine Translation (WMT19). Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), August 2019, Florence, Italy, Association for Computational Linguistics, 1–61
  41. Santanu Pal, Hongfei Xu, Nico Herbig, Antonio Krüger, Josef van Genabith. USAAR-DFKI — The Transference Architecture for English–German Automatic Post-Editing. Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2), August 2019, Florence, Italy, Association for Computational Linguistics, 126–133
  42. Nils Rethmeier and Barbara Plank. “MoRTy: Unsupervised Learning of Task-specialized Word Embeddings by Autoencoding.” In Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019), pp. 49-54. 2019.
  43. Hongfei Xu; Quihui Liu; Josef van Genabith. UdS Submission for the WMT 2019 Automatic Post-Editing Task, Proceedings of the Fourth Conference on Machine Translation, Florence, Italy, 2019.
  44. Daniel Kondratyuk. 2019.75 languages, 1 model: Parsing universal dependencies universally. CoRR, abs/1904.02099.

    2018

  45. Santanu Pal; Nico Herbig; Antonio Krüger; Josef van Genabith. A Transformer- Based Multi-Source Automatic Post-Editing System. Proceedings of the Third Conference on Machine Translation (WMT-2018) at EMNLP. Brussels, Belgium, 2018
  46. Nils Rethmeier; Marc Hübner; Leonhard Hennig. Learning Comment Controversy Prediction in Web Discussions Using Incidentally Supervised Multi-Task CNNs. Proceedings of the 9th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, at EMNLP. Brussels, Belgium, 2018
  47. Prasenjit Basu; Santanu Pal; Sudip Kumar Naskar. Keep It or Not: Word Level Quality Estimation for Post-Editing. Proceedings of the Third Conference on Machine Translation (WMT-2018), Volume 2: Shared Task Papers, at EMNLP. Brussels, Belgium, 2018
  48. Vivien Macketanz; Eleftherios Avramidis; Aljoscha Burchardt; Hans Uszkoreit. Fine-grained evaluation of German-English Machine Translation based on a Test Suite. Proceedings of the Third Conference on Machine Translation (WMT-2018), at EMNLP. Brussels, Belgium, 2018
  49. Gennady Agre; Josef van Genabith; Thierry Declerck (eds.). Proceedings of 18th International Conference AIMSA 2018. Varna, Bulgaria, September 12-14, 2018.
  50. Ekaterina Loginova; Günter Neumann. An Interactive Web-Interface for Visualizing the Inner Workings of the Question Answering LSTM. Proceedings of EMNLP-2018, System Demonstration. Brussels, Belgium, 2018
  51. David Harbecke; Robert Schwarzenberg; Christoph Alt. Learning Explanations From Language Data. Proceedings of the EMNLP Workshop on Interpreting and Analysing Neural Networks for NLP (BlackboxNLP). Brussels, Belgium, 2018
  52. Nicola Ferro; Norbert Fuhr; Gregory Grefenstette; Joseph A. Konstan; Pablo Castells; Elizabeth M. Daly; Thierry Declerck; Michael D. Ekstrand; Werner Geyer; Julio Gonzalo; Tsvi Kuflik; Krister Linden; Bernardo Magnini; Jian-Yun Nie; Raffaele Perego; Bracha Shapira; Ian Soboroff; Nava Tintarev; Karin Verspoor; Martijn C. Willemsen; Justin Zobel. The Dagstuhl Perspectives Workshop on Performance Modeling and Prediction. Claudia Hauff; Craig Macdonald (eds). SIGIR Forum 52(1), pp. 91-101. ACM 2018
  53. Ekaterina Loginova; Stalin Varanasi; Günter Neumann. Towards Multilingual Neural Question Answering. 1st International Workshop on Artificial Intelligence for Question Answering (AI*QA-2018), at 22nd European Conference on Advances in Databases and Information Systems (ADBIS 2018). Budapest, Hungary, 2018
  54. Khyathi Raghavi Chandu; Ekaterina Loginova; Vishal Gupta; Josef van Genabith; Günter Neumann; Manoj Chinnakotla; Eric Nyberg; Alan Black. Code-Mixed Question Answering Challenge: Crowd-sourcing Data and Techniques, Third Workshop on Computational Approaches to Linguistic Code-switching, at ACL-2018. Melbourne, Australia, 2018
  55. Tyler Renslow; Günter Neumann. LIGHTREL at SemEval-2018 Task 7: Lightweight and Fast Relation Classification. Semeval-2018 Task 7: Semantic relation extraction and classification in scientific papers, at NAACL. New Orleans, LA, USA, 2018
  56. Cristina España-Bonet; Josef van Genabith. Multilingual Semantic Networks for Data-driven Interlingua Seq2Seq Systems. Jinhua Du; Mihael Arcan; Qun Liu; Hitoshi Isahara (eds.). Proceedings of the LREC 2018 Workshop “MLP-MomenT”. Miyazaki, Japan, 2018
  57. John P. McCrae; Christian Chiarcos; Thierry Declerck; Jorge Gracia; Bettina Klimek (eds.). Towards a Linked Lexical Data Cloud based on OntoLex-Lemon, Proceedings of the 6th Workshop on Linked Data in Linguistics (LDL-2018) at LREC. Miyazaki. Japan, 2018
  58. Dagmar Gromann; Thierry Declerck. Comparing Pretrained Multilingual Word Embeddings on an Ontology Alignment Task, Proceedings of the LREC 2018, Miyazaki, Japan, 2018
  59. Dagmar Gromann; Luis Espinosa Anke; Thierry Declerck. Special Issue on Semantic Deep Learning. Semantic Web Journal. IOS Press. 2018.
  60. Thierry Declerck. Towards a Linked Lexical Data Cloud based on OntoLex-Lemon. In Proceedings of the 6th Workshop on Linked Data in Linguistics (LDL-2018) at LREC. Miyazaki. Japan, 2018
  61. Thierry Declerck; Kseniya Egorova; Eileen Schnur. An Integrated Formal Representation for Terminological and Lexical Data included in Classification Schemes. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Miyazaki. Japan, 2018