{"id":7791,"date":"2020-08-12T12:28:00","date_gmt":"2020-08-12T10:28:00","guid":{"rendered":"https:\/\/nr.no\/?post_type=bc_project&#038;p=7791"},"modified":"2025-04-02T08:33:24","modified_gmt":"2025-04-02T06:33:24","slug":"the-cleanup-project","status":"publish","type":"bc_project","link":"https:\/\/nr.no\/en\/projects\/the-cleanup-project\/","title":{"rendered":"Automatically anonymising text documents (CLEANUP)"},"content":{"rendered":"\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-28f84493 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:66.66%\">\n<p class=\"has-sizing-large\"><br><strong>The goal of CLEANUP is to develop new machine learning methods to automatically anonymise text documents with personal data, such as electronic health records, court decisions or chat based interactions with customers.<\/strong><\/p>\n\n\n\n<p class=\"has-sizing-medium\">The main idea of \u200b\u200bthe project is to combine approaches from natural language processing and privacy to design a new generation of anonymisation techniques.<\/p>\n\n\n\n<p class=\"has-sizing-medium\">The purpose is to modify text documents in a way that prevents the disclosure of personal information, while preserving the internal context and semantic content of the documents.<\/p>\n\n\n\n<p>One of the method we are testing is text sanitization, the task of redacting a document to mask all occurrences of (direct or indirect) personal identifiers, with the goal of concealing the identity of the individual(s) referred in it.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"partners\">Partners<\/h4>\n\n\n\n<p class=\"has-sizing-medium\">The project brings together researchers from machine learning, natural language processing, data protection, statistical modelling, health informatics and IT law.<\/p>\n\n\n\n<p class=\"has-sizing-medium\">In addition, partners from the Norwegian public and private sectors (which cover insurance, welfare, health services and legal publishing) contribute to the project with computer and domain knowledge.<\/p>\n\n\n\n<p class=\"has-sizing-medium\"><\/p>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:33.33%\">\n<h3 class=\"wp-block-heading\">To learn more about this project, please contact:<\/h3>\n\n\n\t\t<div id=\"post-type-multi-block_e303df2522775e3afeca5d9eff4485e7\" class=\"wp-block-post-type-multi type-manual style-card-bc_employee t2-grid\">\n\t\t\t\t\t\t\t<div class=\"t2-grid-item-col-12\">\n\t\t\t\t\t\t<a href=\"https:\/\/nr.no\/en\/employees\/pierre-lison\/\" class='card-employee'>\n\t\t\t\t\t<figure>\n\t\t\t\t<img decoding=\"async\" src=\"https:\/\/nr.no\/content\/uploads\/sites\/2\/2024\/05\/pierre-lison-24.jpg\" alt=\"\">\n\t\t\t<\/figure>\n\t\t\t\t<div class=\"card-employee__content\">\n\t\t\t<p class=\"card-employee__name\">Pierre Lison<\/p>\n\t\t\t\t\t\t\t<p class=\"card-employee__position\">Chief Research Scientist<\/p>\n\t\t\t\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 24 24\" height=\"24\" width=\"24\" class=\"t2-icon t2-icon-arrowforward\" aria-hidden=\"true\" focusable=\"false\"><path d=\"M15.9 4.259a1.438 1.438 0 0 1-.147.037c-.139.031-.339.201-.421.359-.084.161-.084.529-.001.685.035.066 1.361 1.416 2.947 3l2.882 2.88-10.19.02c-8.543.017-10.206.029-10.29.075-.282.155-.413.372-.413.685 0 .313.131.53.413.685.084.046 1.747.058 10.29.075l10.19.02-2.882 2.88c-1.586 1.584-2.912 2.934-2.947 3-.077.145-.085.521-.013.66a.849.849 0 0 0 .342.35c.156.082.526.081.68-.001.066-.035 1.735-1.681 3.709-3.656 2.526-2.53 3.606-3.637 3.65-3.742A.892.892 0 0 0 23.76 12a.892.892 0 0 0-.061-.271c-.044-.105-1.124-1.212-3.65-3.742-1.974-1.975-3.634-3.616-3.689-3.645-.105-.055-.392-.107-.46-.083\"\/><\/svg>\n\t\t<\/div>\n\t<\/a>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\n\n\n<div class=\"wp-block-group has-nr-dark-yellow-background-color has-background\">\n<p class=\"has-sizing-medium\">Project: CleanUp-project<\/p>\n\n\n\n<p class=\"has-sizing-medium\">Partners: &nbsp;The Faculty of Law and the Department of Informatics at University of Oslo, the Norwegian University of Science and Technology, University of Rovira i Virgili, DNB, Norwegian Labour and Welfare Administration, Gjensidige, Lovdata, Norsk Helsearkiv <\/p>\n\n\n\n<p class=\"has-sizing-medium\">Funding: Research Council of Norway&nbsp;<\/p>\n\n\n\n<p class=\"has-sizing-medium\">Period: 2020 \u2013 2024<\/p>\n<\/div>\n\n\n\n<p class=\"has-sizing-medium\"><\/p>\n\n\n\n<p class=\"has-sizing-medium has-nr-light-grey-background-color has-background\"><a aria-label=\"Project website for partners (opens in a new tab)\" href=\"http:\/\/cleanup.nr.no\" target=\"_blank\" rel=\"noreferrer noopener\" class=\"ek-link\">Project website for partners<\/a><\/p>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-28f84493 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:66.66%\"><\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:33.33%\"><\/div>\n<\/div>\n\n\n\n<p class=\"has-sizing-medium\"><\/p>\n","protected":false},"featured_media":30435,"template":"","meta":{"_acf_changed":false,"_trash_the_other_posts":false,"editor_notices":[],"footnotes":""},"class_list":["post-7791","bc_project","type-bc_project","status-publish","has-post-thumbnail"],"acf":[],"_links":{"self":[{"href":"https:\/\/nr.no\/en\/wp-json\/wp\/v2\/bc_project\/7791","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/nr.no\/en\/wp-json\/wp\/v2\/bc_project"}],"about":[{"href":"https:\/\/nr.no\/en\/wp-json\/wp\/v2\/types\/bc_project"}],"version-history":[{"count":5,"href":"https:\/\/nr.no\/en\/wp-json\/wp\/v2\/bc_project\/7791\/revisions"}],"predecessor-version":[{"id":34416,"href":"https:\/\/nr.no\/en\/wp-json\/wp\/v2\/bc_project\/7791\/revisions\/34416"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/nr.no\/en\/wp-json\/wp\/v2\/media\/30435"}],"wp:attachment":[{"href":"https:\/\/nr.no\/en\/wp-json\/wp\/v2\/media?parent=7791"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}