20 20

Transactions on
Data Privacy
Foundations and Technologies

http://www.tdp.cat


Articles in Press

Accepted articles here

Latest Issues

Year 2026

Volume 19 Issue 2
Volume 19 Issue 1

Year 2025

Volume 18 Issue 3
Volume 18 Issue 2
Volume 18 Issue 1

Year 2024

Volume 17 Issue 3
Volume 17 Issue 2
Volume 17 Issue 1

Year 2023

Volume 16 Issue 3
Volume 16 Issue 2
Volume 16 Issue 1

Year 2022

Volume 15 Issue 3
Volume 15 Issue 2
Volume 15 Issue 1

Year 2021

Volume 14 Issue 3
Volume 14 Issue 2
Volume 14 Issue 1

Year 2020

Volume 13 Issue 3
Volume 13 Issue 2
Volume 13 Issue 1

Year 2019

Volume 12 Issue 3
Volume 12 Issue 2
Volume 12 Issue 1

Year 2018

Volume 11 Issue 3
Volume 11 Issue 2
Volume 11 Issue 1

Year 2017

Volume 10 Issue 3
Volume 10 Issue 2
Volume 10 Issue 1

Year 2016

Volume 9 Issue 3
Volume 9 Issue 2
Volume 9 Issue 1

Year 2015

Volume 8 Issue 3
Volume 8 Issue 2
Volume 8 Issue 1

Year 2014

Volume 7 Issue 3
Volume 7 Issue 2
Volume 7 Issue 1

Year 2013

Volume 6 Issue 3
Volume 6 Issue 2
Volume 6 Issue 1

Year 2012

Volume 5 Issue 3
Volume 5 Issue 2
Volume 5 Issue 1

Year 2011

Volume 4 Issue 3
Volume 4 Issue 2
Volume 4 Issue 1

Year 2010

Volume 3 Issue 3
Volume 3 Issue 2
Volume 3 Issue 1

Year 2009

Volume 2 Issue 3
Volume 2 Issue 2
Volume 2 Issue 1

Year 2008

Volume 1 Issue 3
Volume 1 Issue 2
Volume 1 Issue 1


Volume 19 Issue 2


Issues in Estimating Reidentification Risk Using Log-Linear Models in Complex Survey Samples

Lin Li(a),(*), Jianzhu Li(b), Tom Krenzke(c)

Transactions on Data Privacy 19:2 (2026) 81 - 111

Abstract, PDF

(a) Westat, 7501 Wisconsin Avenue, Bethesda, MD 20814, USA.

(b) FINRA, 1735 K St NW, Washington, DC 20006, USA.

(c) Westat, 7501 Wisconsin Avenue, Bethesda, MD 20814, USA.

e-mail:linli @westat.com; jianzhulee @hotmail.com; tomkrenzke @westat.com


Abstract

In this paper, we discuss some practical issues encountered when estimating record-level and file-level disclosure risk measures of re-identification in survey microdata under complex survey designs. We use the probabilistic modelling approach based on the Poisson Distribution and log-linear modelling proposed in Skinner and Shlomo (2008) to estimate disclosure risk in survey microdata files. We examine the robustness of their GOF criteria to violations of model assumptions, particularly in the context of complex survey designs and differential survey weights, using a case study and simulations. We also provide guidance for variable selection with insights on how to proceed with the disclosure risk assessment and provide meaningful results. For the case study, we use the complex survey dataset from the Survey of Doctorate Recipients conducted by the National Center for Science and Engineering Statistics. The results of evaluating the disclosure risk estimates under different approaches of adjusting the probabilistic modelling to account for the complex survey data lead to guidance for a sensitivity analysis that helps to provide better estimates of record-level and file-level risk of re-identification in survey microdata.

* Corresponding author.


ISSN: 1888-5063; ISSN (Digital): 2013-1631; Web Site: http://www.tdp.cat/
Contact: Transactions on Data Privacy; Vicenç Torra; Umeå University; 90187 Umeå (Sweden); e-mail:tdp@tdp.cat
Note: TDP's web site does not use cookies. TDP does not keep information neither on IP addresses nor browsers. For the privacy policy access here.

 


Vicenç Torra, Last modified: 09 : 27 February 02 2026.