Mining adversarial patterns via regularized loss minimization

Wei Liu*, Sanjay Chawla

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

46 Citations (Scopus)

Abstract

Traditional classification methods assume that the training and the test data arise from the same underlying distribution. However, in several adversarial settings, the test set is deliberately constructed in order to increase the error rates of the classifier. A prominent example is spam email where words are transformed to get around word based features embedded in a spam filter. In this paper we model the interaction between a data miner and an adversary as a Stackelberg game with convex loss functions. We solve for the Nash equilibrium which is a pair of strategies (classifier weights, data transformations) from which there is no incentive for either the data miner or the adversary to deviate. Experiments on synthetic and real data demonstrate that the Nash equilibrium solution leads to solutions which are more robust to subsequent manipulation of data and also provide interesting insights about both the data miner and the adversary.

Original languageEnglish
Pages (from-to)69-83
Number of pages15
JournalMachine Learning
Volume81
Issue number1
DOIs
Publication statusPublished - Oct 2010
Externally publishedYes

Keywords

  • Loss minimization
  • Nash equilibrium
  • Stackelberg game

Fingerprint

Dive into the research topics of 'Mining adversarial patterns via regularized loss minimization'. Together they form a unique fingerprint.

Cite this