Abstract
A widely used method for confidentiality protection in statistical databases is to add zero mean noise to sensitive attribute values. Most studies assume that the attributes are normally distributed. Using an exponential random variable as an example, this article investigates the effect of additive noise data masking for attributes with skewed distributions. Examples of exponentially distributed sensitive attributes used for statistical analysis include the time between testing HIV positive and the manifestation of symptoms for AIDS and the time between consecutive arrests for repeat offenders. We analyze the issues of data quality and confidentiality protection. Our results indicate that skewed attributes are, in some sense, better protected than normally distributed attributes under additive noise data masking.
| Original language | English |
|---|---|
| Pages (from-to) | 581-586 |
| Number of pages | 6 |
| Journal | Proceedings of the Hawaii International Conference on System Sciences |
| Volume | 3 |
| State | Published - 1997 |
| Externally published | Yes |
| Event | Proceedings of the 1997 30th Annual Hawaii International Conference on System Sciences. Part 1 (of 6) - Wailea, HI, USA Duration: Jan 7 1997 → Jan 10 1997 |
ASJC Scopus Subject Areas
- General Computer Science
Fingerprint
Dive into the research topics of 'Disclosure limitation through additive noise data masking: Analysis of skewed sensitive data'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS