Skip to main content



30 June 2022: Congratulations to Paul on passing his confirmation of status!

23 June 2022: Janet gave a talk on “Categorization and generalization in morphology.” at the Oxford Berlin Workshop on Morphology. She also gave a flash talk about NLP at the WiE  event celebrating International Women in Engineering Day.

22-23 June 2022: Felix and Janet were both involved in the OMI Conference on NLP in Economics and Finance. Felix co-taught a tutorial on “Using state-of-the-art language models for economic and financial modelling, and Janet opened tihe meeting with a talk on “The Successes and Failures of NLP in forecasting”. A great conference, that also included speakers from Bloomberg, Deepmind, Amazon, and numerous universities.

20 June 2022: Welcome to Li Zhang! Li Zhang finally got her visa and arrived from China to join Isabelle as PDRA on the "Exaggeration and Fragmentation..." project.

14-16 June 2022: Janet participated in a symposium held in York in honour of a grande dame of language acquisition research, Marilyn Vihman. Her talk was entiitled  “On the foundations of systematicity in the lexicon”.

31 May 2022: Congratulations to Alex and Mia for submitting excellent 4YP papers and doing a fine job in their vivas.

8 April 2022: 100% success with NAACL submissions!

  • Our members have had the following papers accepted for the North American Association for Computational Linguistics meeting in Seattle.
    • Drinkall, Zohren &  Pierrehumbert. “Forecasting COVID-19 Caseloads Using Unsupervised Embedding Clusters of Social Media Posts.” (main session).
    • Röttger, Vidgen, Hovy, Pierrehumbert “Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks” (main session)
    • Kirk, Vidgen, Röttger, Thrush & Hale “Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-based Hate” (main session).
    • Hofmann, Pierrehumbert, Schütze, “Modeling Ideological Agenda Setting and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity “ (Findings).
    • Röttger, Seelawi, Nozza, Talat, Vidgen. "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models."

4 April 2022: Welcome to Isabelle

  • Isabelle Lorge has joined us a PDRA in text-mining and experimental semantics. She has a Ph.D in Linguistics from Cambridge and experience in the high-tech sector working for Arabesque AI on analyzing news and social media for stock market prediction.

1 April 2022: Paul is off to Milan

  • Paul starts an extended visit to Dirk Hovy’s group at Bocconi University in Milan.

16 March 2022: ICWSM success.

  • Hofmann, Schütze and Pierrehumbert,  “The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse”  has been accepted for International AAAI Conference on Web and Social Media (ICWSM 2022).

4 March 2022: Congratulations to Alex

  • Alex Goldie has been offered and has accepted a DPhil place in the highly competitive Oxford AIMS  (Autonomous Intelligent Machines and Systems) CDT Program.

1 March 2022: Valentin to Deepmind

  • Valentin Hofmann starts his internship in DeepMind’s Language Team.

1 March 2022: Janet joins PNAS Editorial Board

  • Janet joins the Editorial Board of PNAS (Proceedings of the National Academy of Sciences), where she will handle submissions in cognitive science and natural language processing. Together with Nature and Science, PNAS is one of the very top interdisciplinary scientific journals.

24 February 2022: Two papers accepted for ACL 2022

  • Hofmann, Schütze and Pierrehumbert, “An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers"
  • Weissweiler, Hofmann, Sabet & Schütze “CaMEL: Case marker extraction without labels”

9 February 2022: MPP 2022

  • Janet delivered a plenary address entitled “Using orthographic data to explore the mental lexicon” at MPP 2022 (Morphology in Phonetics and Perception), held virtually over Gather in Dusseldorf.

November 2021: Paper at EMNLP 2021 (Findings)

  • Temporal adaptation of BERT and performance on downstream document classification: Insights from social media

August 2021: Three papers at ACL 2021

  • Dynamic contextualized word embeddings
  • HateCheck: Functional tests for hate speech detection models
  • Superbizarre is not superb: Derivational morphology improves BERT’s interpretation of complex words

June 2021: Paper at SwissText 2021

  • Predicting COVID-19 cases using Reddit posts and other online resources

November 2020: Paper at EMNLP 2020

  • DagoBERT: Generating derivational morphology with a pretrained language model

July 2020: Two papers at ACL 2020

  • A graph auto-encoder model of derivational morphology
  • Predicting the growth of morphological families from social and linguistic factors