Data Poisoning Attacks Against Multimodal Encoders

Ziqing Yang; Xinlei He; Zheng Li; Michael Backes; Mathias Humbert; Pascal Berrang; Yang Zhang

Data Poisoning Attacks Against Multimodal Encoders

Ziqing Yang^*, Xinlei He, Zheng Li, Michael Backes, Mathias Humbert, Pascal Berrang, Yang Zhang

^*Corresponding author for this work

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

18 Downloads (Pure)

Abstract

Recently, the newly emerged multimodal models, which leverage both visual and linguistic modalities to train powerful encoders, have gained increasing attention. However, learning from a large-scale unlabeled dataset also exposes the model to the risk of potential poisoning attacks, whereby the adversary aims to perturb the model’s training data to trigger malicious behaviors in it. In contrast to previous work, only poisoning visual modality, in this work, we take the first step to studying poisoning attacks against multimodal models in both visual and linguistic modalities. Specially, we focus on answering two questions: (1) Is the linguistic modality also vulnerable to poisoning attacks? and (2) Which modality is most vulnerable? To answer the two questions, we propose three types of poisoning attacks against multimodal models. Extensive evaluations on different datasets and model architectures show that all three attacks can achieve significant attack performance while maintaining model utility in both visual and linguistic modalities. Furthermore, we observe that the poisoning effect differs between different modalities. To mitigate the attacks, we propose both pretraining and post-training defenses. We empirically show that both defenses can significantly reduce the attack performance while preserving the model’s utility. Our code is available at https: //github.com/zqypku/mm_poison/.

Original language	English
Title of host publication	Proceedings of the 40th International Conference on Machine Learning
Editors	Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett
Publisher	Proceedings of Machine Learning Research
Pages	39299-39313
Number of pages	15
Publication status	Published - 31 Aug 2023
Event	The Fortieth International Conference on Machine Learning - Hawaii Convention Center, Honolulu, United States Duration: 23 Jul 2023 → 29 Jul 2023

Publication series

Name	Proceedings of Machine Learning Research
Volume	202
ISSN (Electronic)	2640-3498

Conference

Conference	The Fortieth International Conference on Machine Learning
Abbreviated title	ICML 2023
Country/Territory	United States
City	Honolulu
Period	23/07/23 → 29/07/23

Access to Document

YangZ2023DataFinal published version, 1.04 MBLicence: Creative Commons: Attribution (CC BY)

https://proceedings.mlr.press/v202/yang23f/yang23f.pdfLicence: Creative Commons: Attribution (CC BY)

Cite this

Yang, Z., He, X., Li, Z., Backes, M., Humbert, M., Berrang, P., & Zhang, Y. (2023). Data Poisoning Attacks Against Multimodal Encoders. In A. Krause, E. Brunskill, K. Cho, B. Engelhardt, S. Sabato, & J. Scarlett (Eds.), Proceedings of the 40th International Conference on Machine Learning (pp. 39299-39313). (Proceedings of Machine Learning Research; Vol. 202). Proceedings of Machine Learning Research. https://proceedings.mlr.press/v202/yang23f/yang23f.pdf

Yang, Ziqing ; He, Xinlei ; Li, Zheng et al. / Data Poisoning Attacks Against Multimodal Encoders. Proceedings of the 40th International Conference on Machine Learning. editor / Andreas Krause ; Emma Brunskill ; Kyunghyun Cho ; Barbara Engelhardt ; Sivan Sabato ; Jonathan Scarlett. Proceedings of Machine Learning Research, 2023. pp. 39299-39313 (Proceedings of Machine Learning Research).

@inproceedings{683eeb8dede64cc2b73dedfad1404ccd,

title = "Data Poisoning Attacks Against Multimodal Encoders",

abstract = "Recently, the newly emerged multimodal models, which leverage both visual and linguistic modalities to train powerful encoders, have gained increasing attention. However, learning from a large-scale unlabeled dataset also exposes the model to the risk of potential poisoning attacks, whereby the adversary aims to perturb the model{\textquoteright}s training data to trigger malicious behaviors in it. In contrast to previous work, only poisoning visual modality, in this work, we take the first step to studying poisoning attacks against multimodal models in both visual and linguistic modalities. Specially, we focus on answering two questions: (1) Is the linguistic modality also vulnerable to poisoning attacks? and (2) Which modality is most vulnerable? To answer the two questions, we propose three types of poisoning attacks against multimodal models. Extensive evaluations on different datasets and model architectures show that all three attacks can achieve significant attack performance while maintaining model utility in both visual and linguistic modalities. Furthermore, we observe that the poisoning effect differs between different modalities. To mitigate the attacks, we propose both pretraining and post-training defenses. We empirically show that both defenses can significantly reduce the attack performance while preserving the model{\textquoteright}s utility. Our code is available at https: //github.com/zqypku/mm_poison/.",

author = "Ziqing Yang and Xinlei He and Zheng Li and Michael Backes and Mathias Humbert and Pascal Berrang and Yang Zhang",

year = "2023",

month = aug,

day = "31",

language = "English",

series = "Proceedings of Machine Learning Research",

publisher = "Proceedings of Machine Learning Research",

pages = "39299--39313",

editor = "Andreas Krause and Emma Brunskill and Kyunghyun Cho and Barbara Engelhardt and Sivan Sabato and Jonathan Scarlett",

booktitle = "Proceedings of the 40th International Conference on Machine Learning",

note = "The Fortieth International Conference on Machine Learning, ICML 2023 ; Conference date: 23-07-2023 Through 29-07-2023",

}

Yang, Z, He, X, Li, Z, Backes, M, Humbert, M, Berrang, P & Zhang, Y 2023, Data Poisoning Attacks Against Multimodal Encoders. in A Krause, E Brunskill, K Cho, B Engelhardt, S Sabato & J Scarlett (eds), Proceedings of the 40th International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 202, Proceedings of Machine Learning Research, pp. 39299-39313, The Fortieth International Conference on Machine Learning, Honolulu, Hawaii, United States, 23/07/23. <https://proceedings.mlr.press/v202/yang23f/yang23f.pdf>

Data Poisoning Attacks Against Multimodal Encoders. / Yang, Ziqing; He, Xinlei; Li, Zheng et al.
Proceedings of the 40th International Conference on Machine Learning. ed. / Andreas Krause; Emma Brunskill; Kyunghyun Cho; Barbara Engelhardt; Sivan Sabato; Jonathan Scarlett. Proceedings of Machine Learning Research, 2023. p. 39299-39313 (Proceedings of Machine Learning Research; Vol. 202).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Data Poisoning Attacks Against Multimodal Encoders

AU - Yang, Ziqing

AU - He, Xinlei

AU - Li, Zheng

AU - Backes, Michael

AU - Humbert, Mathias

AU - Berrang, Pascal

AU - Zhang, Yang

PY - 2023/8/31

Y1 - 2023/8/31

N2 - Recently, the newly emerged multimodal models, which leverage both visual and linguistic modalities to train powerful encoders, have gained increasing attention. However, learning from a large-scale unlabeled dataset also exposes the model to the risk of potential poisoning attacks, whereby the adversary aims to perturb the model’s training data to trigger malicious behaviors in it. In contrast to previous work, only poisoning visual modality, in this work, we take the first step to studying poisoning attacks against multimodal models in both visual and linguistic modalities. Specially, we focus on answering two questions: (1) Is the linguistic modality also vulnerable to poisoning attacks? and (2) Which modality is most vulnerable? To answer the two questions, we propose three types of poisoning attacks against multimodal models. Extensive evaluations on different datasets and model architectures show that all three attacks can achieve significant attack performance while maintaining model utility in both visual and linguistic modalities. Furthermore, we observe that the poisoning effect differs between different modalities. To mitigate the attacks, we propose both pretraining and post-training defenses. We empirically show that both defenses can significantly reduce the attack performance while preserving the model’s utility. Our code is available at https: //github.com/zqypku/mm_poison/.

AB - Recently, the newly emerged multimodal models, which leverage both visual and linguistic modalities to train powerful encoders, have gained increasing attention. However, learning from a large-scale unlabeled dataset also exposes the model to the risk of potential poisoning attacks, whereby the adversary aims to perturb the model’s training data to trigger malicious behaviors in it. In contrast to previous work, only poisoning visual modality, in this work, we take the first step to studying poisoning attacks against multimodal models in both visual and linguistic modalities. Specially, we focus on answering two questions: (1) Is the linguistic modality also vulnerable to poisoning attacks? and (2) Which modality is most vulnerable? To answer the two questions, we propose three types of poisoning attacks against multimodal models. Extensive evaluations on different datasets and model architectures show that all three attacks can achieve significant attack performance while maintaining model utility in both visual and linguistic modalities. Furthermore, we observe that the poisoning effect differs between different modalities. To mitigate the attacks, we propose both pretraining and post-training defenses. We empirically show that both defenses can significantly reduce the attack performance while preserving the model’s utility. Our code is available at https: //github.com/zqypku/mm_poison/.

UR - https://proceedings.mlr.press/pmlr-license-agreement.pdf

UR - https://icml.cc/

UR - https://proceedings.mlr.press/v202/

M3 - Conference contribution

T3 - Proceedings of Machine Learning Research

SP - 39299

EP - 39313

BT - Proceedings of the 40th International Conference on Machine Learning

A2 - Krause, Andreas

A2 - Brunskill, Emma

A2 - Cho, Kyunghyun

A2 - Engelhardt, Barbara

A2 - Sabato, Sivan

A2 - Scarlett, Jonathan

PB - Proceedings of Machine Learning Research

T2 - The Fortieth International Conference on Machine Learning

Y2 - 23 July 2023 through 29 July 2023

ER -

Data Poisoning Attacks Against Multimodal Encoders

Abstract

Publication series

Conference

Access to Document

Fingerprint

Cite this