Disclosure limitation through additive noise data masking: Analysis of skewed sensitive data

Research output: Contribution to journalConference articlepeer-review

Abstract

A widely used method for confidentiality protection in statistical databases is to add zero mean noise to sensitive attribute values. Most studies assume that the attributes are normally distributed. Using an exponential random variable as an example, this article investigates the effect of additive noise data masking for attributes with skewed distributions. Examples of exponentially distributed sensitive attributes used for statistical analysis include the time between testing HIV positive and the manifestation of symptoms for AIDS and the time between consecutive arrests for repeat offenders. We analyze the issues of data quality and confidentiality protection. Our results indicate that skewed attributes are, in some sense, better protected than normally distributed attributes under additive noise data masking.

Original languageEnglish
Pages (from-to)581-586
Number of pages6
JournalProceedings of the Hawaii International Conference on System Sciences
Volume3
StatePublished - 1997
Externally publishedYes
EventProceedings of the 1997 30th Annual Hawaii International Conference on System Sciences. Part 1 (of 6) - Wailea, HI, USA
Duration: Jan 7 1997Jan 10 1997

ASJC Scopus Subject Areas

  • General Computer Science

Fingerprint

Dive into the research topics of 'Disclosure limitation through additive noise data masking: Analysis of skewed sensitive data'. Together they form a unique fingerprint.

Cite this