Chief Research Scientist

Pierre Lison

Show description information Hide description information
  • Chief Research Scientist at Norsk Regnesentral
  • Adjunct Associate Professor at the University of Oslo.

About

My main research interests are in natural language processing (NLP) and machine learning, most specifically the training, alignment, and evaluation of Large Language Models (LLMs) along with their deployment in various applications. Over my research career, I’ve worked on topics such as spoken dialogue systems, large-scale information extraction, data privacy, neural machine translation and human-robot interaction. I’m particularly interested in research topics at the intersection between NLP and other fields, from natural to social sciences. I’m also involved in several innovation-focused R&D projects looking at the use of LLMs and machine learning to solve practical problems in the public or private sector. 

Background

I’m originally from Belgium and graduated from the University of Louvain in 2006 in Computer Science & Engineering. Becoming increasingly interested by the intersection of computer science and linguistics, I moved to Saarbrücken (Germany) to pursue a master degree in Language Science & Technology. I graduated in 2008 and went on to work as a researcher at the German Research Centre for Artificial Intelligence (DFKI), where I was involved in several EU-funded projects on the development of dialogue systems for human-robot interaction. In 2011, I relocated to Norway to embark on a PhD in the Language Technology Group of the University of Oslo. In 2014, I defended my PhD thesis on probabilistic approaches to dialogue management and worked for two more years as a Postdoctoral Research Fellow within the same group on dialogue modelling for statistical machine translation. 

In 2016, I joined the Norwegian Computing Center (Norsk Regnesentral) as research scientist, where I work on various R&D projects related to natural language processing and machine learning. Two of my most recent projects are CLEANUP, which sought to develop new, data-driven methods for removing personal information from text data and GraphDial, which focused on dialogue management and investigated the use of knowledge graphs to represent the dialogue state of rich conversational domains. Other research projects I am / have been involved with include SAFERS (speech analytics for emergency response services), DialMT (dialogue modelling for machine translation), AICOM (linguistic analysis of human-LLM interactions), Oslo Analytics and most recently CyberRisk (cyber-threat intelligence and risk management).

In addition to my main position as Chief Research Scientist at NR, I also have an adjunct position as Associate Professor in the Language Technology Group of the University of Oslo, where I am involved in several courses in machine learning and natural language processing. I am also a former member of the Young Academy of Norway.

Projects

  • Machine learning
  • Language technology
  • Digital security and privacy

Semi-Automated Cyber Risk Management 

The image shows an illustrated robot against a brown background. Rudimentary illustration.
  • Machine learning

How do we understand machines that talk to us?

Publications

  • 154 publications found
  • Publisher

Re-identification of De-identified Documents with Autoregressive Infilling pp. 1192 1209 , doi: https://doi.org/10.18653/v1/2025.acl-long.60 , 2025. Scientific chapter / article / conference article

Evaluating the disclosure risk of anonymized documents via a machine learning-based re-identification attack Data mining and knowledge discovery, vol. 38, pp. 4040 4075 , (ISSN 1384-5810 1573-756X ), doi: https://doi.org/10.1007/s10618-024-01066-3 , 2024. Scientific article

Nå kan KI-generert tekst vannmerkes 2024. Article feature

Identifying Token-Level Dialectal Features in Social Media , 2023. Scientific chapter / article / conference article

Generation of Replacement Options in Text Sanitization pp. 292 300 , , 2023. Scientific chapter / article / conference article

Pierre Lison; Samia Touileb; Chat GPT egner seg dårlig til eksamenssensuren Morgenbladet, (ISSN 0805-3847 0806-2617 ), 2023. Article feature

Retrieval-Augmented Neural Response Generation Using Logical Reasoning and Relevance Scoring SemDial Proceedings, (ISSN 2308-2275 ), , 2023. Scientific article

Pierre Lison; Venn med kunstig intelligens 2023. Media interview

Pierre Lison; Kunstig Intelligens, en fare for menneskeheten? 2023. Media interview

Michael Riegler; Pierre Lison; Ingrid Lossius Falkum; Kunstig intelligens-modeller er ikke miniatyr­versjoner av den menneskelige hjernen Forskersonen.no, 2023. Article feature

Pierre Lison; Samia Touileb; Chat GPT egner seg dårlig til eksamenssensuren Morgenbladet, 2023. Article feature

Publisher Norsk Regnesentral

The GDPR and Unstructured Data: Is Anonymisation Possible? International Data Privacy Law (IDPL), vol. 12, pp. 184 206 , (ISSN 2044-3994 2044-4001 ), doi: https://doi.org/10.1093/idpl/ipac008 , 2022. Scientific article

Dis, c'est quoi l'intelligence artificielle? (ISSN 9782507057299 ), 2022. Popular science book

Publisher Renaissance Du Livre

Bootstrapping Text Anonymization Models with Distant Supervision pp. 4477 4487 , , 2022. Scientific chapter / article / conference article

Dialogue Management as Graph Transformations pp. 219 227 , doi: https://doi.org/10.1007/978-981-19-5538-9_15 , 2022. Scientific chapter / article / conference article

Automatic Evaluation of Disclosure Risks of Text Anonymization Methods pp. 157 171 , doi: https://doi.org/10.1007/978-3-031-13945-1_12 , 2022. Scientific chapter / article / conference article

The text anonymization benchmark (TAB): A dedicated corpus and evaluation framework for text anonymization Computational Linguistics, vol. 48, pp. 1053 1101 , (ISSN 0891-2017 1530-9312 ), doi: https://doi.org/10.1162/coli_a_00458 , 2022. Scientific article

Neural Text Sanitization with Explicit Measures of Privacy Risk pp. 217 229 , , 2022. Scientific chapter / article / conference article

Hva er universell utforming? 2022. Media participation

skweak: Weak Supervision Made Easy for NLP pp. 337 346 , doi: https://doi.org/10.18653/v1/2021.acl-demo.40 , 2021. Scientific chapter / article / conference article

Assessing the Quality of Human-Generated Summaries with Weakly Supervised Learning pp. 112 123 , , 2021. Scientific chapter / article / conference article

Publisher RobotDial workshop

Nicholas Thomas Walker; Torbjørn Dahl; Pierre Lison; Dialogue Management as Graph Transformations 2021. Scientific lecture

Vi må snakke om Bitcoin , 2021. Article feature

Pierre Lison; Jørgen Bølstad; Anders Kvellestad; Hva skal vi med logaritmer i grafer? Morgenbladet, (ISSN 0805-3847 0806-2617 ), 2021. Article feature

Welcome to Norway! , 2021. Article feature

Pierre Lison; Jeremy Barnes; Aliaksandr Hubin; Samia Touileb; Named Entity Recognition without Labelled Data: A Weak Supervision Approach (ISSN 978-1-950737-48-2 ), 2020. Scientific anthology / conference series

Publisher Association for Computational Linguistics

Named Entity Recognition without Labelled Data: A Weak Supervision Approach pp. 1518 1533 , , 2020. Scientific chapter / article / conference article

Kan kunstig intelligens "forstå" språk? Aftenposten (morgenutg. : trykt utg.), (ISSN 0804-3116 0807-2027 ), , 2020. Science for the public article

For enkelt om kunstig intelligens: – Diskriminerende og fordomsfull AI er ikke alltid lett å løse Forskning.no, (ISSN 1891-635X 1891-6341 ), , 2020. Reader opinion

Hva skjedde med «Don’t be evil»? , 2020. Article feature

Dialogue Modelling: Small data, Big data 2019. Scientific lecture

OpenSubtitles 2018: Statistical rescoring of sentence alignments in large, noisy parallel corpora pp. 1742 1748 , , 2018. Scientific chapter / article / conference article

Detecting Machine-translated Documents in Large Parallel Corpora pp. 25 32 , , 2018. Scientific chapter / article / conference article

Publisher Norsk Regnesentral

Incremental Processing for Neural Conversational Models SemDial Proceedings, pp. 162 163 , (ISSN 2308-2275 ), , 2017. Scientific article

Automatic Detection of Malware-Generated Domains with Recurrent Neural Models Norsk Informasjonssikkerhetskonferanse (NISK), (ISSN 1893-6563 1894-7735 ), , 2017. Scientific article

Redefining Context Windows for Word Embedding Models: An Experimental Study pp. 284 288 , , 2017. Scientific chapter / article / conference article

Neural Reputation Models learned from Passive DNS data pp. 3662 3671 , doi: https://doi.org/10.1109/BigData.2017.8258361 , 2017. Scientific chapter / article / conference article

Automatic Turn Segmentation of Movie and TV Subtitles pp. 245 252 , doi: https://doi.org/10.1109/SLT.2016.7846272 , 2016. Scientific chapter / article / conference article

Svetlana Stoyanchev; Pierre Lison; Srinivas Bangalore; Rapid Prototyping of Form-driven Dialogue Systems Using an Open-source Framework pp. 216 219 , doi: https://doi.org/10.18653/v1/w16-3626 , 2016. Scientific chapter / article / conference article

Pierre Lison; Casey Kennington; OpenDial: A Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules pp. 67 72 , doi: https://doi.org/10.18653/v1/p16-4012 , 2016. Scientific chapter / article / conference article

Paolo Dragone; Pierre Lison; Classification and Resolution of Non-Sentential Utterances in Dialogue Italian Journal of Computational Linguistics (IJCoL), pp. 45 62 24 , , 2016. Scientific article

Pierre Lison; A short introduction to statistical machine translation , 2016. Lecture popular

Pierre Lison; Jörg Tiedemann; OpenSubtitles2016: Extracting Large Parallel Corpora from Movie and TV Subtitles pp. 923 929 , , 2016. Scientific chapter / article / conference article

Paolo Dragone; Pierre Lison; An Active Learning Approach to the Classification of Non-Sentential Utterances pp. 115 119 , doi: https://doi.org/10.4000/books.aaccademia.1464 , 2015. Scientific chapter / article / conference article

Pierre Lison; Structured Probabilistic Modelling for Dialogue Management , 2015. Scientific lecture

Pierre Lison; Casey Kennington; Developing Spoken Dialogue Systems with the OpenDial toolkit , 2015. Poster

Paolo Dragone; Pierre Lison; Non-sentential utterances in dialogue: experiments in classification and interpretation SemDial Proceedings, pp. 170 172 , , 2015. Scientific article

Pierre Lison; Casey Kennington; Developing Spoken Dialogue Systems with the OpenDial Toolkit SemDial Proceedings, pp. 194 196 , , 2015. Scientific article

Pierre Lison; A hybrid approach to dialogue management based on probabilistic rules Computer Speech and Language, pp. 232 255 , doi: https://doi.org/10.1016/j.csl.2015.01.001 , 2015. Scientific article

Publisher Elsevier

Pierre Lison; Structured Probabilistic Modelling for Dialogue Management 2014. Scientific lecture

Pierre Lison; Raveesh Meena; Spoken Dialogue Systems: A New Frontier in Human-Computer Interaction ACM Crossroads, 2014. Science for the public article

Michal Kajetan Kosek; Pierre Lison; An Intelligent Tutoring System for Learning Chinese with a Cognitive Model of the Learner pp. 179 184 , doi: https://doi.org/10.14705/rpnet.2014.000214 , 2014. Scientific chapter / article / conference article

Pierre Lison; Structured Probabilistic Modelling for Dialogue Management , 2014. Doctor dissertat

Publisher Universitetet i Oslo

Pierre Lison; Model-based Bayesian Reinforcement Learning for Dialogue Management Interspeech, , 2013. Scientific article

Publisher International Speech Communication Association

Pierre Lison; Kan man snakke med en robot? , 2013. Lecture popular

Pierre Lison; Dr. Utenlansk , 2013. Media interview

Probabilistic Dialogue Models with Prior Domain Knowledge pp. 179 188 , , 2012. Scientific chapter / article / conference article

Publisher YRRSDS committee

Pierre Lison; Marta Recasens; Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics (ISSN 9781937284190 ), 2012. Scientific anthology / conference series

Publisher Association for Computational Linguistics

Social Robotics , 2012. Lecture

An Introduction to Machine Learning , 2012. Scientific lecture

Declarative Design of Spoken Dialogue Systems with Probabilistic Rules SemDial Proceedings, (ISSN 2308-2275 ), , 2012. Scientific article

Pierre Lison; Kan roboter lære seg selv å snakke med mennesker? , 2012. Media interview

Towards Dialogue Management in Relational Domains , 2012. Scientific chapter / article / conference article

Multi-Policy Dialogue Management pp. 294 300 , , 2011. Scientific chapter / article / conference article

Belief modelling for situation awareness in human-robot interaction 2010. Scientific chapter / article / conference article

Self-Understanding and Self-Extension: A Systems and Representational Approach IEEE Transactions on Autonomous Mental Development, vol. 2, pp. 282 303 , (ISSN 1943-0604 ), 2010. Scientific article

Continual processing of situated dialogue in human-robot collaborative activities 2010. Scientific chapter / article / conference article

Towards Relational POMDPs for Adaptive Dialogue Management 2010. Scientific chapter / article / conference article

Situated Dialogue Processing for Human-Robot Interaction 2010. Scientific chapter / article / conference article

Publisher Diplomica Verlag

Policy activation for open-ended dialogue management 2010. Scientific chapter / article / conference article

A salience-driven approach to speech recognition for human-robot interaction 2010. Scientific chapter / article / conference article

Robust processing of situated spoken dialogue 2009. Scientific chapter / article / conference article

Robust processing of situated spoken dialogue 2009. Scientific chapter / article / conference article

Efficient parsing of spoken inputs for human-robot interaction 2009. Scientific chapter / article / conference article

Salience-driven contextual priming of speech recognition for human-robot interaction 2008. Scientific chapter / article / conference article