TFRP

TFRP: An efficient microaggregation algorithm for statistical disclosure control. Recently, the issue of statistic disclosure control (SDC) has attracted much attention. SDC is a very important part of data security dealing with the protection of databases. Microaggregation for SDC techniques is widely used to protect confidentiality in statistical databases released for public use. The basic problem of microaggregation is that similar records are clustered into groups, and each group contains at least k records to prevent disclosure of individual information, where k is a pre-defined security threshold. For a certain k, an optimal multivariable microaggregation has the lowest information loss. The minimum information loss is an NP-hard problem. Existing fixed-size techniques can obtain a low information loss with O(n2) or O(n3/k) time complexity. To improve the execution time and lower information loss, this study proposes the Two Fixed Reference Points (TFRP) method, a two-phase algorithm for microaggregation. In the first phase, TFRP employs the pre-computing and median-of-medians techniques to efficiently shorten its running time to O(n2/k). To decrease information loss in the second phase, TFRP generates variable-size groups by removing the lower homogenous groups. Experimental results reveal that the proposed method is significantly faster than the Diameter and the Centroid methods. Running on several test datasets, TFRP also significantly reduces information loss, particularly in sparse datasets with a large k.

Keywords for this software

Anything in here will be replaced on browsers that support the canvas element


References in zbMATH (referenced in 3 articles )

Showing results 1 to 3 of 3.
Sorted by year (citations)

  1. Monedero, David Rebollo; Mezher, Ahmad Mohamad; Colomé, Xavier Casanova; Forné, Jordi; Soriano, Miguel: Efficient (k)-anonymous microaggregation of multivariate numerical data via principal component analysis (2019)
  2. Panagiotakis, Costas; Tziritas, Georgios: A minimum spanning tree equipartition algorithm for microaggregation (2015)
  3. Chang, Chin-Chen; Li, Yu-Chiang; Huang, Wen-Hung: TFRP: An efficient microaggregation algorithm for statistical disclosure control. (2007) ioport