Data Anonymization through Collaborative Multi-view Microaggregation

Abdelouahid Lyhyaoui; Nicoleta Rogovschi; Younès Bennani; Sarah Zouinina

Download from

dx.doi.org

Data Anonymization through Collaborative Multi-view Microaggregation

Abdelouahid Lyhyaoui, Nicoleta Rogovschi, Younès Bennani & Sarah Zouinina

Journal of Intelligent Systems 30 (1):327-345 (2020) Copy BIBT_EX

Abstract

The interest in data anonymization is exponentially growing, motivated by the will of the governments to open their data. The main challenge of data anonymization is to find a balance between data utility and the amount of disclosure risk. One of the most known frameworks of data anonymization is k-anonymity, this method assumes that a dataset is anonymous if and only if for each element of the dataset, there exist at least k − 1 elements identical to it. In this paper, we propose two techniques to achieve k-anonymity through microaggregation: k-CMVM and Constrained-CMVM. Both, use topological collaborative clustering to obtain k-anonymous data. The first one determines the k levels automatically and the second defines it by exploration. We also improved the results of these two approaches by using pLVQ2 as a weighted vector quantization method. The four methods proposed were proven to be efficient using two data utility measures, the separability utility and the structural utility. The experimental results have shown a very promising performance.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Keywords

Add keywords

Reprint years

DOI

10.1515/jisys-2020-0026

My notes

Similar books and articles

Lost in Anonymization — A Data Anonymization Reference Classification Merging Legal and Technical Considerations.Kerstin N. Vokinger, Daniel J. Stekhoven & Michael Krauthammer - 2020 - Journal of Law, Medicine and Ethics 48 (1):228-231.

The General Data Protection Regulation in the Age of Surveillance Capitalism.Jane Andrew & Max Baker - 2019 - Journal of Business Ethics 168 (3):565-578.

Anonymity preserving sequential pattern mining.Anna Monreale, Dino Pedreschi, Ruggero G. Pensa & Fabio Pinelli - 2014 - Artificial Intelligence and Law 22 (2):141-173.

Sharing private data through personalized search.Kei Karasawa - 2009 - Identity in the Information Society 2 (3):205-220.

“But the data is already public”: on the ethics of research in Facebook.Michael Zimmer - 2010 - Ethics and Information Technology 12 (4):313-325.

Transferable Feature Representation for Visible-to-Infrared Cross-Dataset Human Action Recognition.Yang Liu, Zhaoyang Lu, Jing Li, Chao Yao & Yanzi Deng - 2018 - Complexity 2018:1-20.

Big Data: Ethical Considerations.G. Owen Schaefer, Markus K. Labude & Harisan Unais Nasir - 2018 - In David Boonin, Katrina L. Sifferd, Tyler K. Fagan, Valerie Gray Hardcastle, Michael Huemer, Daniel Wodak, Derk Pereboom, Stephen J. Morse, Sarah Tyson, Mark Zelcer, Garrett VanPelt, Devin Casey, Philip E. Devine, David K. Chan, Maarten Boudry, Christopher Freiman, Hrishikesh Joshi, Shelley Wilcox, Jason Brennan, Eric Wiland, Ryan Muldoon, Mark Alfano, Philip Robichaud, Kevin Timpe, David Livingstone Smith, Francis J. Beckwith, Dan Hooley, Russell Blackford, John Corvino, Corey McCall, Dan Demetriou, Ajume Wingo, Michael Shermer, Ole Martin Moen, Aksel Braanen Sterri, Teresa Blankmeyer Burke, Jeppe von Platz, John Thrasher, Mary Hawkesworth, William MacAskill, Daniel Halliday, Janine O’Flynn, Yoaav Isaacs, Jason Iuliano, Claire Pickard, Arvin M. Gouw, Tina Rulli, Justin Caouette, Allen Habib, Brian D. Earp, Andrew Vierra, Subrena E. Smith, Danielle M. Wenner, Lisa Diependaele, Sigrid Sterckx, G. Owen Schaefer, Markus K. Labude, Harisan Unais Nasir, Udo Schuklenk, Benjamin Zolf & Woolwine (eds.), The Palgrave Handbook of Philosophy and Public Policy. Springer Verlag. pp. 593-607.

Ethics in Action: Anonymization as a Participant’s Concern and a Participant’s Practice. [REVIEW]Lorenza Mondada - 2014 - Human Studies 37 (2):179-209.

Criminal Prohibition of Wrongful Re‑identification: Legal Solution or Minefield for Big Data?Mark Phillips, Edward S. Dove & Bartha M. Knoppers - 2017 - Journal of Bioethical Inquiry 14 (4):527-539.

Towards a Taxonomy of the Model-Ladenness of Data.Alisa Bokulich - forthcoming - PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association.

Big Data technology.Nicolae Sfetcu - manuscript

Data models and the acquisition and manipulation of data.Todd Harris - 2003 - Philosophy of Science 70 (5):1508-1517.

Philosophical Aspects of Big Data.Nicolae Sfetcu - manuscript

Big Data Ethics.Nicolae Sfetcu - manuscript

Mental Models in Data Interpretation.Clark A. Chinn & William F. Brewer - 1996 - Philosophy of Science 63 (5):S211-S219.

Analytics

Added to PP
2020-10-03

Downloads
13 (#1,006,512)

6 months
8 (#352,434)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Data Anonymization through Collaborative Multi-view Microaggregation

Abstract

Categories

Keywords

Reprint years

DOI

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work