Computer Science > Cryptography and Security

[Submitted on 28 Jul 2020 (v1), last revised 22 Jan 2021 (this version, v2)]

Label-Only Membership Inference Attacks

Membership inference attacks are one of the simplest forms of privacy leakage for machine learning models: given a data point and model, determine whether the point was used to train the model. Existing membership inference attacks exploit models' abnormal confidence when queried on their training data. These attacks do not apply if the adversary only gets access to models' predicted labels, without a confidence measure. In this paper, we introduce label-only membership inference attacks. Instead of relying on confidence scores, our attacks evaluate the robustness of a model's predicted labels under perturbations to obtain a fine-grained membership signal. These perturbations include common data augmentations or adversarial examples. We empirically show that our label-only membership inference attacks perform on par with prior attacks that required access to model confidences. We further demonstrate that label-only attacks break multiple defenses against membership inference attacks that (implicitly or explicitly) rely on a phenomenon we call confidence masking. These defenses modify a model's confidence scores in order to thwart attacks, but leave the model's predicted labels unchanged. Our label-only attacks demonstrate that confidence-masking is not a viable defense strategy against membership inference. Finally, we investigate worst-case label-only attacks, that infer membership for a small number of outlier data points. We show that label-only attacks also match confidence-based attacks in this setting. We find that training models with differential privacy and (strong) L2 regularization are the only known defense strategies that successfully prevents all attacks. This remains true even when the differential privacy budget is too high to offer meaningful provable guarantees.

Comments: 16 pages, 11 figures, 2 tables Revision 2: 19 pages, 12 figures, 3 tables. Improved text and additional experiments Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG); Machine Learning (stat.ML) Cite as: arXiv:2007.14321 [cs.CR] (or arXiv:2007.14321v2 [cs.CR] for this version)

Submission history

From: Christopher A. Choquette-Choo [view email]
[v1] Tue, 28 Jul 2020 15:44:31 UTC (652 KB)
[v2] Fri, 22 Jan 2021 01:42:20 UTC (677 KB)

[2007.14321] Label-Only Membership Inference Attacks

Computer Science > Cryptography and Security

Label-Only Membership Inference Attacks

Submission history

Recommend

提议大盘蓝筹股中试行T+0？来看股票交易制度那些事儿

think in java interview-高级开发人员面试宝典(五)

政府工作报告：稳步推进注册制改革完善常态化退市机制

think in java interview-高级开发人员面试宝典代码示例

深交所通报“国开2009”和“国开2008”异常交易情况

一个架构师谈什么是架构以及怎么成为一个架构师

think in java interview-高级开发人员面试宝典(一)

昊海生科：盐酸莫西沙星滴眼液获注册批准

通向架构师的道路（第二十二天）万能框架spring(四)使用struts2

坚持长期投资，国海富兰克林宝藏基金经理赵晓东发新在即

About Joyk