Efficient estimation of statistical functions while preserving client-side privacy

[thumbnail of WRAP_Theses_Kulkarni_2019.pdf]
Preview
PDF
WRAP_Theses_Kulkarni_2019.pdf - Submitted Version - Requires a PDF viewer.

Download (4MB) | Preview

Request Changes to record.

Abstract

Aggregating service users’ personal data for analytical purposes is a common practice in today’s Internet economy. However, distrust in the data aggregator, data breaches and risks of subpoenas pose significant challenges in the availability of data. The framework of differential privacy is enjoying wide attention due to its scalability and rigour of privacy protection it provides, and has become a de facto standard for facilitating privacy preserving information extraction. In this dissertation, we design and implement resource efficient algorithms for three fundamental data analysis primitives, marginal, range, and count queries while providing strong differential privacy guarantees.

The first two queries are studied in the strict scenario of untrusted aggregation (aka local model) in which the data collector is allowed to only access the noisy/perturbed version of users’ data but not their true data. To the best of our knowledge, marginal and range queries have not been studied in detail in the local setting before our works. We show that our simple data transfomation techniques help us achieve great accuracy in practice and can be used for performing more interesting analysis.

Finally, we revisit the problem of count queries under trusted aggregation. This setting can also be viewed as a relaxation of the local model called limited precision local differential privacy. We first discover certain weakness in a well-known optimization framework leading to solutions exhibiting pathological behaviours. We then propose more constraints in the framework to remove these weaknesses without compromising too much on utility.

Item Type: Thesis [via Doctoral College] (PhD)
Subjects: Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software
Library of Congress Subject Headings (LCSH): Quantitative research, Data sets -- Access control, Data protection, Computer algorithms
Official Date: September 2019
Dates:
Date
Event
September 2019
UNSPECIFIED
Institution: University of Warwick
Theses Department: Department of Computer Science
Thesis Type: PhD
Publication Status: Unpublished
Supervisor(s)/Advisor: Cormode, Graham, 1977-
Sponsors: University of Warwick. Institute for the Science of Cities ; Alan Turing Institute
Format of File: pdf
Extent: xiii, 144 leaves : illustrations (some colour)
Language: eng
URI: https://wrap.warwick.ac.uk/150359/

Export / Share Citation


Request changes or add full text files to a record

Repository staff actions (login required)

View Item View Item