Transactions on
Data Privacy
Foundations and Technologies
http://www.tdp.cat

Articles in Press

Accepted articles here

Latest Issues

Year 2025

Volume 18 Issue 3
Volume 18 Issue 2
Volume 18 Issue 1

Year 2024

Volume 17 Issue 3
Volume 17 Issue 2
Volume 17 Issue 1

Year 2023

Volume 16 Issue 3
Volume 16 Issue 2
Volume 16 Issue 1

Year 2022

Volume 15 Issue 3
Volume 15 Issue 2
Volume 15 Issue 1

Year 2021

Volume 14 Issue 3
Volume 14 Issue 2
Volume 14 Issue 1

Year 2020

Volume 13 Issue 3
Volume 13 Issue 2
Volume 13 Issue 1

Year 2019

Volume 12 Issue 3
Volume 12 Issue 2
Volume 12 Issue 1

Year 2018

Volume 11 Issue 3
Volume 11 Issue 2
Volume 11 Issue 1

Year 2017

Volume 10 Issue 3
Volume 10 Issue 2
Volume 10 Issue 1

Year 2016

Volume 9 Issue 3
Volume 9 Issue 2
Volume 9 Issue 1

Year 2015

Volume 8 Issue 3
Volume 8 Issue 2
Volume 8 Issue 1

Year 2014

Volume 7 Issue 3
Volume 7 Issue 2
Volume 7 Issue 1

Year 2013

Volume 6 Issue 3
Volume 6 Issue 2
Volume 6 Issue 1

Year 2012

Volume 5 Issue 3
Volume 5 Issue 2
Volume 5 Issue 1

Year 2011

Volume 4 Issue 3
Volume 4 Issue 2
Volume 4 Issue 1

Year 2010

Volume 3 Issue 3
Volume 3 Issue 2
Volume 3 Issue 1

Year 2009

Volume 2 Issue 3
Volume 2 Issue 2
Volume 2 Issue 1

Year 2008

Volume 1 Issue 3
Volume 1 Issue 2
Volume 1 Issue 1

Volume 16 Issue 3

Data Sanitization for t-Closeness over Multiple Numerical Sensitive Attributes

Rajiv Bagai^(a),(*), Eric Weber^(b), Vikas Thammanna Gowda^(a)

Transactions on Data Privacy 16:3 (2023) 191 - 210

(a) School of Computing, Wichita State University, Wichita, KS 67260-0083, USA.

(b) NetApp Inc., Wichita, KS 67226, USA.

e-mail:rajiv.bagai @wichita.edu; ;

Abstract

A popular technique for preserving privacy of individuals contained in any released data is to first sanitize the data according to the t-closeness principle. This principle requires partitioning rows of the original data into equivalence classes, in a way that the distribution of sensitive values in any class is sufficiently close, within a given threshold t, to their distribution in the original data. Most existing methods for constructing t-close equivalence classes consider just one sensitive attribute in the data, which is insufficient as many real-life datasets contain multiple sensitive attributes; partitioning attempts for multiple sensitive attributes have thus far been unsatisfactory. We present a method for generating t-close equivalence classes in the presence of multiple numerical sensitive attributes, where each such attribute has its own privacy threshold. The equivalence classes are generated in a way that minimizes information loss caused later by generalizing quasi identifier values within each class. While finding an optimal solution for this problem is known to be NP-hard, we show that our approach results in an acceptable solution in polynomial time.

^* Corresponding author.

ISSN: 1888-5063; ISSN (Digital): 2013-1631; Web Site: http://www.tdp.cat/
Contact: Transactions on Data Privacy; Vicenç Torra; Umeå University; 90187 Umeå (Sweden); e-mail:tdp@tdp.cat
Note: TDP's web site does not use cookies. TDP does not keep information neither on IP addresses nor browsers. For the privacy policy access here.

TDP

Vicenç Torra, Last modified: 10 : 38 July 04 2023.