Computer Science – Databases
Scientific paper
2007-07-03
Computer Science
Databases
21 pages, A short version of this paper has been accepted in FCT 2009 - 17th International Symposium on Fundamentals of Comput
Scientific paper
The problem of publishing personal data without giving up privacy is becoming increasingly important. An interesting formalization recently proposed is the k-anonymity. This approach requires that the rows in a table are clustered in sets of size at least k and that all the rows in a cluster become the same tuple, after the suppression of some records. The natural optimization problem, where the goal is to minimize the number of suppressed entries, is known to be NP-hard when the values are over a ternary alphabet, k = 3 and the rows length is unbounded. In this paper we give a lower bound on the approximation factor that any polynomial-time algorithm can achive on two restrictions of the problem,namely (i) when the records values are over a binary alphabet and k = 3, and (ii) when the records have length at most 8 and k = 4, showing that these restrictions of the problem are APX-hard.
Bonizzoni Paola
Dondi Riccardo
Vedova Gianluca Della
No associations
LandOfFree
The $k$-anonymity Problem is Hard does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.
If you have personal experience with The $k$-anonymity Problem is Hard, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and The $k$-anonymity Problem is Hard will most certainly appreciate the feedback.
Profile ID: LFWR-SCP-O-179677