Prof. Dr. Michael Strube

Group Leader Natural Language Processing

Contact

Phone: +49 6221 – 533 – 243

Group leader „Natural Language Processing“ at HITS “Honorarprofessor” at the Computational Linguistics Department at Heidelberg University

Research Interest

Linguistics:

Text and Dialogue
Pragmatics

Computational Linguistics:

Anaphora and Coreference Resolution
Generation of Referring Expressions
Modeling Local (and maybe also Global) Coherence
Discourse and Dialogue Structure (though I don’t believe in it)

Natural Language Processing:

Automatic Summarization
Concept Disambiguation, Entity Linking, Cross-document Coreference Resolution
Information Extraction
Knowledge Acquisition, Ontology Learning
Natural Language Generation Systems

Curriculum Vitae

2017-2018 Scientific Director at HITS
2017/18 Program Co-Chair Workshops on Ethics in NLP at ACL
2015 PC Co-Chair of the ACL’s flagship conference ACL-IJCNLP ’15 in Beijing, China, July 26-31, 2015
Since 2010 “Honorarprofessor” in the Computational Linguistics Department at the University of Heidelberg
Since 2003 Member of the EML Research, now HITS
2000-2003 Member of the EML European Media Laboratory
1997-1999 PostDoc at the Institute for Research in Cognitive Science at the University of Pennsylvania, Philadelphia, US
1996 PhD at the University of Freiburg, Germany

2023

Zhao W, Strube M, Eger S (2023). DiscoScore: Evaluating text generation with BERT and discourse coherence, Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, Dubrovnik, Croatia, May 2023, pp.3865-3883 1588

2022

Yu J, Khosla S, Manuvinakurike R, Levin L, Ng V, Poesio M, Strube M, Rosé C (2022). Proceedings of the CODI-CRAC 2022 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue, Proceedings of the CODI-CRAC 2022 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue, Gyeongju, Republic of Korea, October 2022 1590
Yu J, Khosla S, Manuvinakurike R, Levin L, Ng V, Poesio M, Strube M, Rosé C (2022). The CODI-CRAC 2022 shared task on anaphora, bridging, and discourse deixis in dialogue, Proceedings of the CODI-CRAC 2022 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue, Gyeongju, Repbulic of Korea, October 2022, pp. 1–14 1591
Braud C, Hardmeier C, Li JJ, Loaciga S, Strube M, Zeldes A (2022). Proceedings of the 3rd Workshop on Computational Approaches to Discourse, Proceedings of the 3rd Workshop on Computational Approaches to Discourse, Gyeongju, Republic of Korea, October 2022 1589
Chai H, Moosavi NS, Gurevych I, Strube M (2022). Evaluating coreference resolvers on community-based question answering: From rule-based to state of the art, Proceedings of the Fifth Workshop on Computational Models of Reference, Anaphora and Coreference, Gyeongju, Republic of Korea, 16–17 Octrober, 2022, pp.61–73 1592
Liang S, Kades K, Fink M, Full P, Weber T, Kleesiek J, Strube M, Maier-Hein K (2022). Fine-tuning BERT Models for Summarizing German Radiology Findings, Proceedings of the 4th Clinical Natural Language Processing Workshop, Seattle, Washington, July 2022 1498
Chai H, Strube M (2022). Incorporating Centering Theory into Neural Coreference Resolution, Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Seattle, Washington, July 2022 1496
Jeon S, Strube M (2022). Entity-based Neural Local Coherence Modeling, In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL). Dublin, Ireland, May 2022 1471

2021

López F, Pozzetti B, Trettel S, Strube M, Wienhard A (2021). Vector-valued Distance and Gyrocalculus on the Space of Symmetric Positive Definite Matrices., In Proceedings of the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), Online, December 6-12, 2021. 1287
Fatima M, Strube M (2021). A Novel Wikipedia based Dataset for Monolingual and Cross-lingual Summarization, In Proceedings of the Third Workshop on New Frontiers in Summarization. Punta Cana, Dominican Republic, November 10, 2021. 1288
Jeon S, Strube M (2021). Countering the Influence of Essay Length in Neural Essay Scoring., In Proceedings of the Second Workshop on Sustainable NLP. Punta Cana, Domincan Republic, November 10, 2021. 1289
Khosla S, Yu J, Manuvinakurike R, Ng V, Poesio M, Strube M, Rosé C (2021). The CODI-CRAC 2021 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue, In Proceedings of the CODI-CRAC 2021 Shared Task on Anaphora, Bridging, and Discourse Deixis in Dialogue. Punta Cana, Domincan Republic, November 10, 2021. 1500
Braud C, Hardmeier C, Li JJ, Louis A, Strube M, Zeldes A (2021). Proceedings of the 2nd Workshop on Computational Approaches to Discourse, Punta Cana, Dominican Repbulich and Online 1501
Strube M (2021). Computerwissenschaften und The Circle — The Circle und Computerwissenschaften, In Kempter, Klaus und Martina Engelbrecht (Eds.): Krise(n) der Moderne. Über Literatur und Zeitdiagnostik. Universitätsverlag Winter, Heidelberg, Germany. 451-460. 1167
López F, Pozzetti B, Trettel S, Strube M, Wienhard A (2021). Symmetric Spaces for Graph Embeddings: A Finsler-Riemannian Approach, In Proceedings of the 38th International Conference on Machine Learning 1266
Lopez F, Pozzetti B, Trettel S, Strube M, Wienhard A (2021). Symmetric Spaces for Graph Embeddings: A Finsler-Riemannian Approach, In Proceedings of the 38th International Conference on Machine Learning, vol. 139 of Proceedings of Machine Learning Research, pp. 7090–7101, Eds: Meila, Marina and Zhang, Tong, PMLR 1463

2020

Jeon S, Strube M (2020). Incremental Neural Lexical Coherence Modeling, In Proceedings of the 28th International Conference on Computational Linguistics (COLING), Online, December 2020, pp. 6752–6758 1156
Braud C, Hardmeier C, Li JJ, Louis A, Strube M (2020). Proceedings of the First Workshop on Computational Approaches to Discourse, Online 1502
Mathews K, Strube M (2020). A large harvested corpus of location metonymy, In Proceedings of the 12th International Conference on Language Resources and Evaluation, Marseille, France, 11–16 May 2020 1042
Jeon S, Strube M (2020). Centering-based Neural Coherence Modeling with Hierarchical Discourse Segments, In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online, November 2020, pp. 7458-7472 1144
López F, Strube M (2020). A Fully Hyperbolic Neural Model for Hierarchical Multi-class Classification, In Proceedings of Findings of the Association for Computational Linguistics: EMNLP 2020, Online, November 2020, pp. 460-475 1145
Müller M, Ghosh S, Rey M, Wittig U, Müller W, Strube M (2020). Reconstructing Manual Information Extraction with DB-to-Document Backprojection: Experiments in the Life Science Domain, In Proceedings of the First Workshop on Scholarly Document Processing, Online, November 2020, pp. 81-90. 1147
Chai H, Zhao W, Eger S, Strube M (2020). Evaluation of Coreference Resolution Systems Under Adversarial Attacks, In Proceedings of the First Workshop on Computational Approaches to Discourse, Online, November 2020, pp. 154-159. 1148

2019

Sekulić I, Strube M (2019). Adapting deep learning methods for mental health prediction on social media, In Proceedings of the 5th Workshop on Noisy User-generated Text, Hong Kong, 4 November 2019, pp. 322-327 1044
Zhu Y, Heinzerling B, Vulic I, Strube M, Reichart R, Korhonen A (2019). On the importance of subword information for morphological tasks in truly low-resource languages, In Proceedings of the 23rd Conference on Computational Natural Language Learning, Hong Kong, 3-4 November 2019, pp. 216-226 1043
López F, Heinzerling B, Strube M (2019). Fine-Grained Entity Typing in Hyperbolic Space, In Proceedings of The Fourth Workshop on Representation Learning for NLP (Rep4NLP) @ ACL 2019, Florence, Italy, 2 August 2019, pp. 169-180 1040
Heinzerling B, Strube M (2019). Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation, In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 28 July – 2 August 2019, pp. 273-291 1036
Moosavi NS, Born L, Poesio M, Strube M (2019). Using Automatically Extracted Minimum Spans to Disentangle Coreference Evaluation from Boundary Detection, In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 28 July – 2 August 2019, pp. 4168-4178 1039

2018

Hou Y, Markert K, Strube M (2018). Unrestricted bridging resolution, Computational Linguistics 44(2):237-284 395
Heinzerling B, Strube M (2018). BPEmb: Tokenization-free pre-trained subword embeddings in 275 languages, In Proceedings of the 11th International Conference on Language Resources and Evaluation, Miyazaki, Japan, 7–12 May 2018 394
Kirilin A, Strube M (2018). Exploiting a speaker’s credibility to detect fake news, In Workshop on Data Science, Journalism and Media, London, UK, 20 August 2018 396
Mesgar M, Strube M (2018). A neural local coherence model for text quality assessment, In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October – 4 November 2018, pp. 4328-4339 397
Müller M, Strube M (2018). Transparent, efficient, and robust word embedding access with WOMBAT, In Proceedings of the 27th International Conference on Computational Linguistics: System Demonstrations, Santa Fé, New Mexico, 20–26 August 2018, pp. 53-57 400
Moosavi NS, Strube M (2018). Using linguistic features to improve generalization in neural coreference resolvers, In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October – 4 November 2018, pp. 193-203 401
Suter J, Strube M (2018). Extending and exploiting the entity graph for analysis, classification and visualization of German texts, In Proceedings of the 14th Conference on Natural Language Processing (KONVENS), Vienna, Austria, 17–19 September 2018, pp. 136-140 403
Alfano M, Hovy D, Mitchell M, Strube M (2018). Proceedings of the 2nd ACL Workshop on Ethics in Natural Language Processing, New Orleans, Louis., 5 June 2018, http://aclweb.org/anthology/W18-0800.pdf 406

2017

Born L, Mesgar M, Strube M (2017). Using a graph-based coherence model in document-Level machine translation, In Proceedings of the 3rd Workshop on Discourse in Machine Translation, Copenhagen, Denmark, 8 September 2017, pp. 26-35 278
Heinzerling B, Strube M, Lin C (2017). Trust, but verify! Better entity linking through automatic verification, In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Valencia, Spain, 3–7 April 2017, pp. 828-838 279
Heinzerling B, Moosavi NS, Strube M (2017). Revisiting selectional preferences for coreference resolution, In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark, 7–11 September 2017, pp. 1343-1350 280
Hovy D, Spruit S, Mitchell M, Bender EM, Strube M, Wallach H (2017). Proceedings of the First ACL Workshop on Ethics in Natural Language Processing, Valencia, Spain, 4 April 2017, http://www.aclweb.org/anthology/W17-16.pdf 281
Judea A, Strube M (2017). Event argument identification on dependency graphs with bidirectional LSTMs, In Proceedings of the 8th International Joint Conference on Natural Language Processing, Taipei, Taiwan, 27 November – 1 December 2017, pp. 822-831 282
Kurohashi S, Strube M (2017). Proceedings of the IJCNLP 2017, Tutorial Abstracts, Taipei, Taiwan, 27 November 2017, http://www.aclweb.org/anthology/I17-5.pdf 283
Moosavi NS, Strube M (2017). Use generalized representations, but do not forget surface features, In Proceedings of the 2nd Workshop on Coreference Resolution Beyond OntoNotes, Valencia, Spain, 4 April 2017, pp. 1-7 284
Moosavi NS, Strube M (2017). Lexical features in coreference resolution: To be used with caution, In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (vol. 2: Short Papers), Vancouver, B.C., Canada, 30 July –4 August 2017 285

2016

Resch B, Summa A, Zeile P, Strube M (2016). Citizen-centric urban planning through extracting emotion information from Twitter in an interdisciplinary space-time-linguistics algorithm, UP 1(2):114 196
Heinzerling B, Judea A, Strube M (2016). HITS at TAC KBP 2015: Entity discovery and linking, and event nugget detection, In Proceedings of the Text Analysis Conference, National Institute of Standards and Technology, Gaithersburg, Maryland, USA, 16–17 November 2015 189
Judea A, Strube M (2016). Incremental global event extraction, In Proceedings of the 26th International Conference on Computational Linguistics, Osaka, Japan, 11–16 December 2016, pp. 2279-2289 190
Mesgar M, Strube M (2016). Lexical coherence graph modeling using word embeddings, In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, Cal., 12–17 June 2016, pp. 1414-1423 191
Moosavi NS, Strube M (2016). Search space pruning: A simple solution for better coreference resolvers, In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, Cal., 12–17 June 2016, pp. 1005-1011 192
Moosavi NS, Strube M (2016). Which coreference evaluation metric do you trust? A proposal for a link-based entity aware metric, In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (vol. 1: Long Papers), Berlin, Germany, 7–12 August 2016, pp. 632-642 193
Parveen D, Mesgar M, Strube M (2016). Generating coherent summaries of scientific articles using coherence patterns, In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, Austin, Tex., 1–5 November 2016, pp. 772-783 194
Remse M, Mesgar M, Strube M (2016). Feature-rich error detection in scientific writing using logistic regression, In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, San Diego, Cal., 16 June 2016, pp. 162-171 195
Summa A, Resch B, Strube M (2016). Microblog emotion classification by computing similarity in text, time, and space, In Proceedings of the Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media, Osaka, Japan, 12 December 2016, pp. 153-162 197

Name	Borlabs Cookie
Provider	Eigentümer dieser Website
Purpose	Speichert die Einstellungen der Besucher, die in der Cookie Box von Borlabs Cookie ausgewählt wurden.
Cookie Name	borlabs-cookie
Cookie Expiry	1 Jahr

Accept	Matomo
Name	Matomo
Provider	HITS gGmbH
Purpose	Cookie von Matomo für Website-Analysen. Erzeugt statistische Daten darüber, wie der Besucher die Website nutzt.
Cookie Name	_pk_.
Cookie Expiry	13 Monate

Accept	Facebook
Name	Facebook
Provider	Meta Platforms Ireland Limited, 4 Grand Canal Square, Dublin 2, Ireland
Purpose	Wird verwendet, um Facebook-Inhalte zu entsperren.
Privacy Policy	https://www.facebook.com/privacy/explanation
Host(s)	.facebook.com

Accept	Google Maps
Name	Google Maps
Provider	Google Ireland Limited, Gordon House, Barrow Street, Dublin 4, Ireland
Purpose	Wird zum Entsperren von Google Maps-Inhalten verwendet.
Privacy Policy	https://policies.google.com/privacy
Host(s)	.google.com
Cookie Name	NID
Cookie Expiry	6 Monate

Accept	Instagram
Name	Instagram
Provider	Meta Platforms Ireland Limited, 4 Grand Canal Square, Dublin 2, Ireland
Purpose	Wird verwendet, um Instagram-Inhalte zu entsperren.
Privacy Policy	https://www.instagram.com/legal/privacy/
Host(s)	.instagram.com
Cookie Name	pigeon_state
Cookie Expiry	Sitzung

Accept	OpenStreetMap
Name	OpenStreetMap
Provider	Openstreetmap Foundation, St John’s Innovation Centre, Cowley Road, Cambridge CB4 0WS, United Kingdom
Purpose	Wird verwendet, um OpenStreetMap-Inhalte zu entsperren.
Privacy Policy	https://wiki.osmfoundation.org/wiki/Privacy_Policy
Host(s)	.openstreetmap.org
Cookie Name	_osm_location, _osm_session, _osm_totp_token, _osm_welcome, _pk_id., _pk_ref., _pk_ses., qos_token
Cookie Expiry	1-10 Jahre

Accept	Twitter
Name	Twitter
Provider	Twitter International Company, One Cumberland Place, Fenian Street, Dublin 2, D02 AX07, Ireland
Purpose	Wird verwendet, um Twitter-Inhalte zu entsperren.
Privacy Policy	https://twitter.com/privacy
Host(s)	.twimg.com, .twitter.com
Cookie Name	__widgetsettings, local_storage_support_test
Cookie Expiry	Unbegrenzt

Accept	Vimeo
Name	Vimeo
Provider	Vimeo Inc., 555 West 18th Street, New York, New York 10011, USA
Purpose	Wird verwendet, um Vimeo-Inhalte zu entsperren.
Privacy Policy	https://vimeo.com/privacy
Host(s)	player.vimeo.com
Cookie Name	vuid
Cookie Expiry	2 Jahre