Parallel Differentially Private K-Means Implementation Using COMPSs Framework

Authors Sukgamon Sukpisit, Srdjan Skrbic, Dusan Jakovetic: Parallel Differentially Private K-Means Implementation Using COMPSs Framework, Proceedings of the 10th International Conference on Information Society and Technology, 2020.
Title Parallel Differentially Private K-Means Implementation Using COMPSs Framework
Abstract K-means is one of the most important clustering algorithms, but it does introduce a risk of privacy disclosure in the clustering process. One approach to solving this problem is by applying differential privacy to K-means clustering algorithm to effectively prevent privacy disclosure. Increasing amounts of information generated in big data processing scenarios make clustering a challenging task. In order to deal with the problem, various approaches to the parallelization of clustering algorithms have been attempted. This paper presents an implementation of a differentially private k-means clustering algorithm that uses ε-differential privacy, based on the COMPSs framework for parallel computing. The experimental results show that the proposed implementation scales well and can be used to efficiently process large datasets using high-performance computing equipment.
ISBN -
Conference 10th International Conference on Information Society and Technology (ICIST 2020)
Date 8-11 March 2020
Location Kopaonik, Serbia
Url https://zenodo.org/record/4314276#.X9h2MC0RqZw
DOI -