LREC Paper Accepted

The paper “BPEmb: Tokenization-free Pre-trained Subword Embeddings in 275 Languages” by Benjamin Heinzerling and Michael Strube has been accepted for LREC 2018, which will take place in Miyazaki, Japan, from May 7th to 12th, 2018. The paper comes with an innovative natural language resource, which can be downloaded from this GitHub page. BTW: The European…


Second Workshop ‘Ethics in NLP’

Michael Strube will co-organize the second workshop on Ethics in NLP, which will take place at NAACL-HLT 2018 in New Orleans, Louisiana, USA. Michael was also one of the main organizers of the first workshop, which took place in 2017.

Michael Strube at NAACL-HLT 2018

Michael Strube, NLP group leader, will serve as one of the area chairs for Information Extraction at the next NAACL-HLT conference. The conference, which is held by the North American chapter of the ACL, will take place in New Orleans, Louisiana, from June 1st to 8th, 2018.

Digitale Mündigkeit und Bürgerrechte

This content is only available in German.   Wissenschaftsministerin Theresia Bauer und netzpolitischer Sprecher Konstantin von Notz (Grüne) zum Informationsbesuch am HITS Zu einem Informationsbesuch über die Themen „Digitalisierung“ und „Künstliche Intelligenz“ kam jetzt Wissenschaftsministerin Theresia Bauer ans HITS, gemeinsam mit MdB Dr. Konstantin von Notz, dem netzpolitischen Sprecher der Grünen-Fraktion im Deutschen Bundestag und…


TPDL Paper Accepted

The paper “Semantic Author Name Disambiguation with Word Embeddings” by Mark-Christoph Müller has been accepted as a Long Paper at TPDL 2017 (International Conference on Theory and Practice of Digital Libraries). The conference will take place from September 18 to 21, 2017, in Thessaloniki, Greece. The paper is a result of the project SCAD –…


The Dark Side of Natural Language Processing

At the first “Ethics in Natural Language Processing” workshop in Valencia, scientists discussed the opportunities and dangers of automatic speech analysis. According to HITS researcher Michael Strube, “Exceedingly few people know how well we can analyze unstructured data.” Smartphones are a part of our everyday lives both at work and at home. We write emails,…


Publication in Scientometrics Journal

A study entitled “Data Sets for Author Name Disambiguation: An Empirical Analysis and a New Resource” has just been published in the Springer Scientometrics journal. The authors of the study, which was produced in the Leibniz-funded project “Scalable Author Disambiguation for Bibliographic Databases”, are Mark-Christoph Müller from the HITS NLP group, Florian Reitz from dblp…


Another Successful Defense

Sebastian Martschat (second from the right, with his examination committee) has successfully defended his PhD. thesis on “Structured Representations for Coreference Resolution”. Sebastian is a former HITS doctoral student who now works as a research assistant at Heidelberg University.

New Scientific Director: Michael Strube

As of January 2017, Prof. Dr. Michael Strube is the new Scientific Director (“Institutssprecher”) of HITS. As the position of Scientific Director rotates through the group leaders, this is a planned change in the HITS management. Rebecca Wade, who was Scientific Director for the last two years, stepped down from the position at the end…