Data Anonymization through Collaborative Multi-view Microaggregation

Journal of Intelligent Systems 30 (1):327-345 (2020)
  Copy   BIBTEX

Abstract

The interest in data anonymization is exponentially growing, motivated by the will of the governments to open their data. The main challenge of data anonymization is to find a balance between data utility and the amount of disclosure risk. One of the most known frameworks of data anonymization is k-anonymity, this method assumes that a dataset is anonymous if and only if for each element of the dataset, there exist at least k − 1 elements identical to it. In this paper, we propose two techniques to achieve k-anonymity through microaggregation: k-CMVM and Constrained-CMVM. Both, use topological collaborative clustering to obtain k-anonymous data. The first one determines the k levels automatically and the second defines it by exploration. We also improved the results of these two approaches by using pLVQ2 as a weighted vector quantization method. The four methods proposed were proven to be efficient using two data utility measures, the separability utility and the structural utility. The experimental results have shown a very promising performance.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,322

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Sharing private data through personalized search.Kei Karasawa - 2009 - Identity in the Information Society 2 (3):205-220.
Big Data: Ethical Considerations.G. Owen Schaefer, Markus K. Labude & Harisan Unais Nasir - 2018 - In David Boonin, Katrina L. Sifferd, Tyler K. Fagan, Valerie Gray Hardcastle, Michael Huemer, Daniel Wodak, Derk Pereboom, Stephen J. Morse, Sarah Tyson, Mark Zelcer, Garrett VanPelt, Devin Casey, Philip E. Devine, David K. Chan, Maarten Boudry, Christopher Freiman, Hrishikesh Joshi, Shelley Wilcox, Jason Brennan, Eric Wiland, Ryan Muldoon, Mark Alfano, Philip Robichaud, Kevin Timpe, David Livingstone Smith, Francis J. Beckwith, Dan Hooley, Russell Blackford, John Corvino, Corey McCall, Dan Demetriou, Ajume Wingo, Michael Shermer, Ole Martin Moen, Aksel Braanen Sterri, Teresa Blankmeyer Burke, Jeppe von Platz, John Thrasher, Mary Hawkesworth, William MacAskill, Daniel Halliday, Janine O’Flynn, Yoaav Isaacs, Jason Iuliano, Claire Pickard, Arvin M. Gouw, Tina Rulli, Justin Caouette, Allen Habib, Brian D. Earp, Andrew Vierra, Subrena E. Smith, Danielle M. Wenner, Lisa Diependaele, Sigrid Sterckx, G. Owen Schaefer, Markus K. Labude, Harisan Unais Nasir, Udo Schuklenk, Benjamin Zolf & Woolwine (eds.), The Palgrave Handbook of Philosophy and Public Policy. Springer Verlag. pp. 593-607.
Towards a Taxonomy of the Model-Ladenness of Data.Alisa Bokulich - forthcoming - PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association.
Data models and the acquisition and manipulation of data.Todd Harris - 2003 - Philosophy of Science 70 (5):1508-1517.
Mental Models in Data Interpretation.Clark A. Chinn & William F. Brewer - 1996 - Philosophy of Science 63 (5):S211-S219.

Analytics

Added to PP
2020-10-03

Downloads
13 (#1,006,512)

6 months
8 (#352,434)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references