Английская Википедия:Adversarial machine learning

Шаблон:Short description Шаблон:Distinguish Шаблон:Machine learning Adversarial machine learning is the study of the attacks on machine learning algorithms, and of the defenses against such attacks.^[1] A survey from May 2020 exposes the fact that practitioners report a dire need for better protecting machine learning systems in industrial applications.^[2]

Most machine learning techniques are mostly designed to work on specific problem sets, under the assumption that the training and test data are generated from the same statistical distribution (IID). However, this assumption is often dangerously violated in practical high-stake applications, where users may intentionally supply fabricated data that violates the statistical assumption.

Some of the most common attacks in adversarial machine learning include evasion attacks,^[3] data poisoning attacks,^[4] Byzantine attacks^[5] and model extraction.^[6]

History

At the MIT Spam Conference in January 2004, John Graham-Cumming showed that a machine learning spam filter could be used to defeat another machine learning spam filter by automatically learning which words to add to a spam email to get the email classified as not spam.^[7]

In 2004, Nilesh Dalvi and others noted that linear classifiers used in spam filters could be defeated by simple "evasion attacks" as spammers inserted "good words" into their spam emails. (Around 2007, some spammers added random noise to fuzz words within "image spam" in order to defeat OCR-based filters.) In 2006, Marco Barreno and others published "Can Machine Learning Be Secure?", outlining a broad taxonomy of attacks. As late as 2013 many researchers continued to hope that non-linear classifiers (such as support vector machines and neural networks) might be robust to adversaries, until Battista Biggio and others demonstrated the first gradient-based attacks on such machine-learning models (2012^[8]–2013^[9]). In 2012, deep neural networks began to dominate computer vision problems; starting in 2014, Christian Szegedy and others demonstrated that deep neural networks could be fooled by adversaries, again using a gradient-based attack to craft adversarial perturbations.^[10]^[11]

Recently, it was observed that adversarial attacks are harder to produce in the practical world due to the different environmental constraints that cancel out the effect of noises.^[12]^[13] For example, any small rotation or slight illumination on an adversarial image can destroy the adversariality. In addition, researchers such as Google Brain's Nicholas Frosst point out that it is much easier to make self-driving cars^[14] miss stop signs by physically removing the sign itself, rather than creating adversarial examples.^[15] Frosst also believes that the adversarial machine learning community incorrectly assumes models trained on a certain data distribution will also perform well on a completely different data distribution. He suggests that a new approach to machine learning should be explored, and is currently working on a unique neural network that has characteristics more similar to human perception than state of the art approaches.^[15]

While adversarial machine learning continues to be heavily rooted in academia, large tech companies such as Google, Microsoft, and IBM have begun curating documentation and open source code bases to allow others to concretely assess the robustness of machine learning models and minimize the risk of adversarial attacks.^[16]^[17]^[18]

Examples

Examples include attacks in spam filtering, where spam messages are obfuscated through the misspelling of "bad" words or the insertion of "good" words;^[19]^[20] attacks in computer security, such as obfuscating malware code within network packets or modifying the characteristics of a network flow to mislead intrusion detection;^[21]^[22] attacks in biometric recognition where fake biometric traits may be exploited to impersonate a legitimate user;^[23] or to compromise users' template galleries that adapt to updated traits over time.

Researchers showed that by changing only one-pixel it was possible to fool deep learning algorithms.^[24] Others 3-D printed a toy turtle with a texture engineered to make Google's object detection AI classify it as a rifle regardless of the angle from which the turtle was viewed.^[25] Creating the turtle required only low-cost commercially available 3-D printing technology.^[26]

A machine-tweaked image of a dog was shown to look like a cat to both computers and humans.^[27] A 2019 study reported that humans can guess how machines will classify adversarial images.^[28] Researchers discovered methods for perturbing the appearance of a stop sign such that an autonomous vehicle classified it as a merge or speed limit sign.^[14]^[29]^[30]

McAfee attacked Tesla's former Mobileye system, fooling it into driving 50 mph over the speed limit, simply by adding a two-inch strip of black tape to a speed limit sign.^[31]^[32]

Adversarial patterns on glasses or clothing designed to deceive facial-recognition systems or license-plate readers, have led to a niche industry of "stealth streetwear".^[33]

An adversarial attack on a neural network can allow an attacker to inject algorithms into the target system.^[34] Researchers can also create adversarial audio inputs to disguise commands to intelligent assistants in benign-seeming audio;^[35] a parallel literature explores human perception of such stimuli.^[36]^[37]

Clustering algorithms are used in security applications. Malware and computer virus analysis aims to identify malware families, and to generate specific detection signatures.^[38]^[39]

Attack modalities

Taxonomy

Attacks against (supervised) machine learning algorithms have been categorized along three primary axes:^[40] influence on the classifier, the security violation and their specificity.

Classifier influence: An attack can influence the classifier by disrupting the classification phase. This may be preceded by an exploration phase to identify vulnerabilities. The attacker's capabilities might be restricted by the presence of data manipulation constraints.^[41]
Security violation: An attack can supply malicious data that gets classified as legitimate. Malicious data supplied during training can cause legitimate data to be rejected after training.
Specificity: A targeted attack attempts to allow a specific intrusion/disruption. Alternatively, an indiscriminate attack creates general mayhem.

This taxonomy has been extended into a more comprehensive threat model that allows explicit assumptions about the adversary's goal, knowledge of the attacked system, capability of manipulating the input data/system components, and on attack strategy.^[42]^[43] This taxonomy has further been extended to include dimensions for defense strategies against adversarial attacks.^[44]

Strategies

Below are some of the most commonly encountered attack scenarios.

Data poisoning

Poisoning consists of contaminating the training dataset with data designed to increase errors in the output. Given that learning algorithms are shaped by their training datasets, poisoning can effectively reprogram algorithms with potentially malicious intent. Concerns have been raised especially for user-generated training data, e.g. for content recommendation or natural language models. The ubiquity of fake accounts offers many opportunities for poisoning. Facebook reportedly removes around 7 billion fake accounts per year.^[45]^[46] Poisoning has been reported as the leading concern for industrial applications.^[2]

On social medias, disinformation campaigns attempt to bias recommendation and moderation algorithms, to push certain content over others.

A particular case of data poisoning is the backdoor attack,^[47] which aims to teach a specific behavior for inputs with a given trigger, e.g. a small defect on images, sounds, videos or texts.

For instance, intrusion detection systems are often trained using collected data. An attacker may poison this data by injecting malicious samples during operation that subsequently disrupt retraining.^[42]^[43]^[40]^[48]^[49]^[50]

Data poisoning techniques can also be applied to text-to-image models to alter their output.^[51]

Byzantine attacks

As machine learning is scaled, it often relies on multiple computing machines. In federated learning, for instance, edge devices collaborate with a central server, typically by sending gradients or model parameters. However, some of these devices may deviate from their expected behavior, e.g. to harm the central server's model^[52] or to bias algorithms towards certain behaviors (e.g., amplifying the recommendation of disinformation content). On the other hand, if the training is performed on a single machine, then the model is very vulnerable to a failure of the machine, or an attack on the machine; the machine is a single point of failure.^[53] In fact, the machine owner may themselves insert provably undetectable backdoors.^[54]

The current leading solutions to make (distributed) learning algorithms provably resilient to a minority of malicious (a.k.a. Byzantine) participants are based on robust gradient aggregation rules.^[55]^[56]^[57]^[58]^[59]^[60] The robust aggregation rules do not always work especially when the data across participants has a non-iid distribution. Nevertheless, in the context of heterogeneous honest participants, such as users with different consumption habits for recommendation algorithms or writing styles for language models, there are provable impossibility theorems on what any robust learning algorithm can guarantee.^[5]^[61]

Evasion

Evasion attacks^[9]^[42]^[43]^[62] consist of exploiting the imperfection of a trained model. For instance, spammers and hackers often attempt to evade detection by obfuscating the content of spam emails and malware. Samples are modified to evade detection; that is, to be classified as legitimate. This does not involve influence over the training data. A clear example of evasion is image-based spam in which the spam content is embedded within an attached image to evade textual analysis by anti-spam filters. Another example of evasion is given by spoofing attacks against biometric verification systems.^[23]

Evasion attacks can be generally split into two different categories: black box attacks and white box attacks.^[17]

Model extraction

Model extraction involves an adversary probing a black box machine learning system in order to extract the data it was trained on.^[63]^[64] This can cause issues when either the training data or the model itself is sensitive and confidential. For example, model extraction could be used to extract a proprietary stock trading model which the adversary could then use for their own financial benefit.

In the extreme case, model extraction can lead to model stealing, which corresponds to extracting a sufficient amount of data from the model to enable the complete reconstruction of the model.

On the other hand, membership inference is a targeted model extraction attack, which infers the owner of a data point, often by leveraging the overfitting resulting from poor machine learning practices.^[65] Concerningly, this is sometimes achievable even without knowledge or access to a target model's parameters, raising security concerns for models trained on sensitive data, including but not limited to medical records and/or personally identifiable information. With the emergence of transfer learning and public accessibility of many state of the art machine learning models, tech companies are increasingly drawn to create models based on public ones, giving attackers freely accessible information to the structure and type of model being used.^[65]

Specific attack types

There are a large variety of different adversarial attacks that can be used against machine learning systems. Many of these work on both deep learning systems as well as traditional machine learning models such as SVMs^[8] and linear regression.^[70] A high level sample of these attack types include:

Adversarial Examples^[71]
Trojan Attacks / Backdoor Attacks^[72]
Model Inversion^[73]
Membership Inference^[74]

Adversarial examples

An adversarial example refers to specially crafted input which is designed to look "normal" to humans but causes misclassification to a machine learning model. Often, a form of specially designed "noise" is used to elicit the misclassifications. Below are some current techniques for generating adversarial examples in the literature (by no means an exhaustive list).

Gradient-based evasion attack^[9]
Fast Gradient Sign Method (FGSM)^[75]
Projected Gradient Descent (PGD)^[76]
Carlini and Wagner (C&W) attack^[77]
Adversarial patch attack^[78]

Black box attacks

Black box attacks in adversarial machine learning assumes that the adversary can only get outputs for provided inputs and has no knowledge of the model structure or parameters.^[17]^[79] In this case, the adversarial example is generated either using a model created from scratch, or without any model at all (excluding the ability to query the original model). In either case, the objective of these attacks are to create adversarial examples that are able to transfer to the black box model in question.^[80]

Square Attack

The Square Attack was introduced in 2020 as a black box evasion adversarial attack based on querying classification scores without the need of gradient information.^[81] As a score based black box attack, this adversarial approach is able to query probability distributions across model output classes, but has no other access to the model itself. According to the paper's authors, the proposed Square Attack required less queries than when compared to state of the art score based black box attacks at the time.^[81]

To describe the function objective, the attack defines the classifier as <math display="inline">f:[0, 1]^d \rightarrow \reals^K</math>, with <math display="inline">d</math> representing the dimensions of the input and <math display="inline">K</math> as the total number of output classes. <math display="inline">f_k(x)</math> returns the score (or a probability between 0 and 1) that the input <math display="inline">x</math> belongs to class <math display="inline">k</math>, which allows the classifier's class output for any input <math display="inline">x</math> to be defined as <math display="inline">argmax_{k=1,...,K}f_k(x)</math>. The goal of this attack is as follows:^[81]

<math display="block">argmax_{k = 1,...,K}f_k(\hat{x}) \neq y, ||\hat{x} - x||_p \leq \epsilon \text{ and } \hat{x} \in [0, 1]^d</math>

In other words, finding some perturbed adversarial example <math display="inline">\hat{x}</math> such that the classifier incorrectly classifies it to some other class under the constraint that <math display="inline">\hat{x}</math> and <math display="inline">x</math> are similar. The paper then defines loss <math display="inline">L</math> as <math display="inline">L(f(\hat{x}), y) = f_y(\hat{x}) - \max_{k \neq y}f_k(\hat{x})</math> and proposes the solution to finding adversarial example <math display="inline">\hat{x}</math> as solving the below constrained optimization problem:^[81]

<math display="block">\min_{\hat{x} \in [0, 1]^d}L(f(\hat{x}), y), \text{ s.t. } ||\hat{x} - x||_p \leq \epsilon</math>

The result in theory is an adversarial example that is highly confident in the incorrect class but is also very similar to the original image. To find such example, Square Attack utilizes the iterative random search technique to randomly perturb the image in hopes of improving the objective function. In each step, the algorithm perturbs only a small square section of pixels, hence the name Square Attack, which terminates as soon as an adversarial example is found in order to improve query efficiency. Finally, since the attack algorithm uses scores and not gradient information, the authors of the paper indicate that this approach is not affected by gradient masking, a common technique formerly used to prevent evasion attacks.^[81]

HopSkipJump Attack

This black box attack was also proposed as a query efficient attack, but one that relies solely on access to any input's predicted output class. In other words, the HopSkipJump attack does not require the ability to calculate gradients or access to score values like the Square Attack, and will require just the model's class prediction output (for any given input). The proposed attack is split into two different settings, targeted and untargeted, but both are built from the general idea of adding minimal perturbations that leads to a different model output. In the targeted setting, the goal is to cause the model to misclassify the perturbed image to a specific target label (that is not the original label). In the untargeted setting, the goal is to cause the model to misclassify the perturbed image to any label that is not the original label. The attack objectives for both are as follows where <math display="inline">x</math> is the original image, <math display="inline">x^\prime</math> is the adversarial image, <math display="inline">d</math> is a distance function between images, <math display="inline">c^*</math> is the target label, and <math display="inline">C</math> is the model's classification class label function:^[82]

<math display="block">\textbf{Targeted: } \min_{x^\prime}d(x^\prime, x) \text{ subject to } C(x^\prime) = c^* </math>

<math display="block">\textbf{Untargeted: } \min_{x^\prime}d(x^\prime, x) \text{ subject to } C(x^\prime) \neq C(x) </math>

To solve this problem, the attack proposes the following boundary function <math display="inline">S</math> for both the untargeted and targeted setting:^[82]

<math display="block">S(x^\prime):= \begin{cases} max_{c \neq C(x)}{F(x^\prime)_c} - F(x^\prime) _{C(x)} , & \text{(Untargeted)} \\ F(x^\prime) _{c^*} - max_{c \neq c^*}{F(x^\prime)_c}, & \text{(Targeted)} \end{cases} </math>

This can be further simplified to better visualize the boundary between different potential adversarial examples:^[82]

<math display="block">S(x^\prime) > 0 \iff \begin{cases} argmax_cF(x^\prime) \neq C(x) , & \text{(Untargeted)} \\ argmax_cF(x^\prime) = c^*, & \text{(Targeted)} \end{cases} </math>

With this boundary function, the attack then follows an iterative algorithm to find adversarial examples <math display="inline">x^\prime</math> for a given image <math display="inline">x</math> that satisfies the attack objectives.

Initialize <math display="inline">x</math> to some point where <math display="inline">S(x) > 0</math>
Iterate below
1. Boundary search
2. Gradient update
  - Compute the gradient
  - Find the step size

Boundary search uses a modified binary search to find the point in which the boundary (as defined by <math display="inline">S</math>) intersects with the line between <math display="inline">x</math> and <math display="inline">x^\prime</math>. The next step involves calculating the gradient for <math display="inline">x</math>, and update the original <math display="inline">x</math> using this gradient and a pre-chosen step size. HopSkipJump authors prove that this iterative algorithm will converge, leading <math display="inline">x</math> to a point right along the boundary that is very close in distance to the original image.^[82]

However, since HopSkipJump is a proposed black box attack and the iterative algorithm above requires the calculation of a gradient in the second iterative step (which black box attacks do not have access to), the authors propose a solution to gradient calculation that requires only the model's output predictions alone.^[82] By generating many random vectors in all directions, denoted as <math display="inline">u_b</math>, an approximation of the gradient can be calculated using the average of these random vectors weighted by the sign of the boundary function on the image <math display="inline">x^\prime + \delta_{u_b}</math>, where <math display="inline">\delta_{u_b}</math> is the size of the random vector perturbation:^[82]

<math display="block">\nabla S(x^\prime, \delta) \approx \frac{1}{B}\sum_{b=1}^{B}\phi(x^\prime + \delta_{u_b}) u_b </math>

The result of the equation above gives a close approximation of the gradient required in step 2 of the iterative algorithm, completing HopSkipJump as a black box attack.^[83]^[84]^[82]

White box attacks

White box attacks assumes that the adversary has access to model parameters on top of being able to get labels for provided inputs.^[80]

Fast gradient sign method

One of the very first proposed attacks for generating adversarial examples was proposed by Google researchers Ian J. Goodfellow, Jonathon Shlens, and Christian Szegedy.^[85] The attack was called fast gradient sign method (FGSM), and it consists of adding a linear amount of in-perceivable noise to the image and causing a model to incorrectly classify it. This noise is calculated by multiplying the sign of the gradient with respect to the image we want to perturb by a small constant epsilon. As epsilon increases, the model is more likely to be fooled, but the perturbations become easier to identify as well. Shown below is the equation to generate an adversarial example where <math display="inline">x</math> is the original image, <math display="inline">\epsilon</math> is a very small number, <math display="inline">\Delta_x</math> is the gradient function, <math display="inline">J</math> is the loss function, <math display="inline">\theta</math> is the model weights, and <math display="inline">y</math> is the true label.^[86]^[87]

<math display="block">adv_x = x + \epsilon \cdot sign(\Delta_xJ(\theta, x, y))</math>

One important property of this equation is that the gradient is calculated with respect to the input image since the goal is to generate an image that maximizes the loss for the original image of true label <math display="inline">y</math>. In traditional gradient descent (for model training), the gradient is used to update the weights of the model since the goal is to minimize the loss for the model on a ground truth dataset. The Fast Gradient Sign Method was proposed as a fast way to generate adversarial examples to evade the model, based on the hypothesis that neural networks cannot resist even linear amounts of perturbation to the input.^[86]^[87]^[85] FGSM has shown to be effective in adversarial attacks for image classification and skeletal action recognition.^[88]

Carlini & Wagner (C&W)

In an effort to analyze existing adversarial attacks and defenses, researchers at the University of California, Berkeley, Nicholas Carlini and David Wagner in 2016 propose a faster and more robust method to generate adversarial examples.^[89]

The attack proposed by Carlini and Wagner begins with trying to solve a difficult non-linear optimization equation:^[64]

<math display="block">\min(||\delta||_{p}) \text{ subject to } C(x + \delta) = t, x + \delta \in [0, 1]^n</math>

Here the objective is to minimize the noise (<math display="inline">\delta</math>), added to the original input <math display="inline">x</math>, such that the machine learning algorithm (<math display="inline">C</math>) predicts the original input with delta (or <math display="inline">x + \delta</math>) as some other class <math display="inline">t</math>. However instead of directly the above equation, Carlini and Wagner propose using a new function <math display="inline">f</math> such that:^[64]

<math display="block">C(x + \delta) = t \iff f(x + \delta) \leq 0</math>

This condenses the first equation to the problem below:^[64]

<math display="block">\min(||\delta||_{p}) \text{ subject to } f(x + \delta) \leq 0, x + \delta \in [0, 1]^n</math>

and even more to the equation below:^[64]

<math display="block">\min(||\delta||_{p} + c \cdot f(x + \delta)), x + \delta \in [0, 1]^n</math>

Carlini and Wagner then propose the use of the below function in place of <math display="inline">f</math> using <math display="inline">Z</math>, a function that determines class probabilities for given input <math display="inline">x</math>. When substituted in, this equation can be thought of as finding a target class that is more confident than the next likeliest class by some constant amount:^[64]

When solved using gradient descent, this equation is able to produce stronger adversarial examples when compared to fast gradient sign method that is also able to bypass defensive distillation, a defense that was once proposed to be effective against adversarial examples.^[90]^[91]^[89]^[64]

Defenses

Файл:Proactive arms race.jpg

Conceptual representation of the proactive arms race^[43]^[39]

Researchers have proposed a multi-step approach to protecting machine learning.^[11]

Threat modeling – Formalize the attackers goals and capabilities with respect to the target system.
Attack simulation – Formalize the optimization problem the attacker tries to solve according to possible attack strategies.
Attack impact evaluation
Countermeasure design
Noise detection (For evasion based attack)^[92]
Information laundering – Alter the information received by adversaries (for model stealing attacks)^[64]

Mechanisms

A number of defense mechanisms against evasion, poisoning, and privacy attacks have been proposed, including:

Secure learning algorithms^[20]^[93]^[94]
Byzantine-resilient algorithms^[55]^[5]
Multiple classifier systems^[19]^[95]
AI-written algorithms.^[34]
AIs that explore the training environment; for example, in image recognition, actively navigating a 3D environment rather than passively scanning a fixed set of 2D images.^[34]
Privacy-preserving learning^[43]^[96]
Ladder algorithm for Kaggle-style competitions
Game theoretic models^[97]^[98]^[99]
Sanitizing training data
Adversarial training^[75]^[22]
Backdoor detection algorithms^[100]
Gradient masking/obfuscation techniques: to prevent the adversary exploiting the gradient in white-box attacks. This family of defenses is deemed unreliable as these models are still vulnerable to black-box attacks or can be circumvented in other ways.^[101]
Ensembles of models have been proposed in the literature but caution should be applied when relying on them: usually ensembling weak classifiers results in a more accurate model but it does not seem to apply in the adversarial context.^[102]

References

Шаблон:Reflist

External links

MITRE ATLAS: Adversarial Threat Landscape for Artificial-Intelligence Systems
NIST 8269 Draft: A Taxonomy and Terminology of Adversarial Machine Learning
NIPS 2007 Workshop on Machine Learning in Adversarial Environments for Computer Security
AlfaSVMLib Шаблон:Webarchive – Adversarial Label Flip Attacks against Support Vector Machines
Шаблон:Cite journal
Dagstuhl Perspectives Workshop on "Machine Learning Methods for Computer Security"
Workshop on Artificial Intelligence and Security, (AISec) Series

Шаблон:Differentiable computing

↑ Шаблон:Cite book
↑ ^2,0 ^2,1 Шаблон:Cite book
↑ Шаблон:Cite journal Шаблон:Dead link
↑ Шаблон:Cite conference
↑ ^5,0 ^5,1 ^5,2 Шаблон:Cite journal
↑ Шаблон:Cite conference
↑ Шаблон:Cite web
↑ ^8,0 ^8,1 Шаблон:Cite arXiv
↑ ^9,0 ^9,1 ^9,2 Шаблон:Cite book
↑ Шаблон:Cite arXiv
↑ ^11,0 ^11,1 Шаблон:Cite journal
↑ Шаблон:Cite arXiv
↑ Gupta, Kishor Datta, Dipankar Dasgupta, and Zahid Akhtar. "Applicability issues of Evasion-Based Adversarial Attacks and Mitigation Techniques." 2020 IEEE Symposium Series on Computational Intelligence (SSCI). 2020.
↑ ^14,0 ^14,1 Шаблон:Cite journal
↑ ^15,0 ^15,1 Шаблон:Cite web
↑ Шаблон:Cite web
↑ ^17,0 ^17,1 ^17,2 Шаблон:Citation
↑ Шаблон:Cite web
↑ ^19,0 ^19,1 Шаблон:Cite journal
↑ ^20,0 ^20,1 Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ ^22,0 ^22,1 Шаблон:Cite journal
↑ ^23,0 ^23,1 Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ Шаблон:Cite news
↑ Шаблон:Cite arXiv
↑ Шаблон:Cite magazine
↑ Шаблон:Cite journal
↑ Шаблон:Cite web
↑ Шаблон:Cite web
↑ Шаблон:Cite magazine
↑ Шаблон:Cite web
↑ Шаблон:Cite magazine
↑ ^34,0 ^34,1 ^34,2 Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ Шаблон:Cite arXiv
↑ Шаблон:Cite arXiv
↑ D. B. Skillicorn. "Adversarial knowledge discovery". IEEE Intelligent Systems, 24:54–61, 2009.
↑ ^39,0 ^39,1 B. Biggio, G. Fumera, and F. Roli. "Pattern recognition systems under attack: Design issues and research challenges Шаблон:Webarchive". Int'l J. Patt. Recogn. Artif. Intell., 28(7):1460002, 2014.
↑ ^40,0 ^40,1 Шаблон:Cite journal
↑ Шаблон:Cite book
↑ ^42,0 ^42,1 ^42,2 B. Biggio, G. Fumera, and F. Roli. "Security evaluation of pattern classifiers under attack Шаблон:Webarchive". IEEE Transactions on Knowledge and Data Engineering, 26(4):984–996, 2014.
↑ ^43,0 ^43,1 ^43,2 ^43,3 ^43,4 Шаблон:Cite book
↑ Шаблон:Cite journal
↑ Шаблон:Cite web
↑ Шаблон:Cite web
↑ Шаблон:Cite journal
↑ B. Biggio, B. Nelson, and P. Laskov. "Support vector machines under adversarial label noise Шаблон:Webarchive". In Journal of Machine Learning Research – Proc. 3rd Asian Conf. Machine Learning, volume 20, pp. 97–112, 2011.
↑ M. Kloft and P. Laskov. "Security analysis of online centroid anomaly detection". Journal of Machine Learning Research, 13:3647–3690, 2012.
↑ Шаблон:Cite web
↑ Шаблон:Cite web
↑ Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ Шаблон:Cite arXiv
↑ ^55,0 ^55,1 Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ Шаблон:Cite arXiv Review
↑ Шаблон:Cite conference Review
↑ Шаблон:Cite journal
↑ Шаблон:Cite arXiv Review
↑ B. Nelson, B. I. Rubinstein, L. Huang, A. D. Joseph, S. J. Lee, S. Rao, and J. D. Tygar. "Query strategies for evading convex-inducing classifiers". J. Mach. Learn. Res., 13:1293–1332, 2012
↑ Шаблон:Cite web
↑ ^64,0 ^64,1 ^64,2 ^64,3 ^64,4 ^64,5 ^64,6 ^64,7 Шаблон:Cite arXiv
↑ ^65,0 ^65,1 Шаблон:Cite web
↑ Шаблон:Cite journal
↑ Шаблон:Cite book
↑ Шаблон:Cite journal
↑ Шаблон:Cite book
↑ Шаблон:Cite book
↑ Шаблон:Cite web
↑ Шаблон:Cite arXiv
↑ Шаблон:Cite journal
↑ Шаблон:Cite arXiv
↑ ^75,0 ^75,1 Шаблон:Cite arXiv
↑ Шаблон:Cite arXiv
↑ Шаблон:Cite arXiv
↑ Шаблон:Cite arXiv
↑ Шаблон:Cite journal
↑ ^80,0 ^80,1 Шаблон:Cite web
↑ ^81,0 ^81,1 ^81,2 ^81,3 ^81,4 Шаблон:Cite book
↑ ^82,0 ^82,1 ^82,2 ^82,3 ^82,4 ^82,5 ^82,6 Шаблон:Citation
↑ Шаблон:Cite arXiv
↑ Шаблон:Cite web
↑ ^85,0 ^85,1 Шаблон:Cite arXiv
↑ ^86,0 ^86,1 Шаблон:Cite web
↑ ^87,0 ^87,1 Шаблон:Cite web
↑ Шаблон:Cite conference
↑ ^89,0 ^89,1 Шаблон:Cite arXiv
↑ Шаблон:Cite web
↑ Шаблон:Cite web
↑ Шаблон:Cite journal
↑ O. Dekel, O. Shamir, and L. Xiao. "Learning to classify with missing and corrupted features". Machine Learning, 81:149–178, 2010.
↑ Шаблон:Cite journal
↑ B. Biggio, G. Fumera, and F. Roli. "Evade hard multiple classifier systems Шаблон:Webarchive". In O. Okun and G. Valentini, editors, Supervised and Unsupervised Ensemble Methods and Their Applications, volume 245 of Studies in Computational Intelligence, pages 15–38. Springer Berlin / Heidelberg, 2009.
↑ B. I. P. Rubinstein, P. L. Bartlett, L. Huang, and N. Taft. "Learning in a large function space: Privacy- preserving mechanisms for svm learning". Journal of Privacy and Confidentiality, 4(1):65–100, 2012.
↑ M. Kantarcioglu, B. Xi, C. Clifton. "Classifier Evaluation and Attribute Selection against Active Adversaries". Data Min. Knowl. Discov., 22:291–335, January 2011.
↑ Шаблон:Cite journal
↑ Шаблон:Cite journal
↑ Шаблон:Cite web
↑ Шаблон:Cite arXiv
↑ Шаблон:Cite arXiv

[1] Шаблон:Cite book

[:1-2] 2,0 ^2,1 Шаблон:Cite book

[GoodfellowMcDaniel20182-3] Шаблон:Cite journal Шаблон:Dead link

[4] Шаблон:Cite conference

[:13-5] 5,0 ^5,1 ^5,2 Шаблон:Cite journal

[6] Шаблон:Cite conference

[7] Шаблон:Cite web

[Poisoning_Attacks_against_Support_V-8] 8,0 ^8,1 Шаблон:Cite arXiv

[Springer-9] 9,0 ^9,1 ^9,2 Шаблон:Cite book

[10] Шаблон:Cite arXiv

[:02-11] 11,0 ^11,1 Шаблон:Cite journal

[12] Шаблон:Cite arXiv

[13] Gupta, Kishor Datta, Dipankar Dasgupta, and Zahid Akhtar. "Applicability issues of Evasion-Based Adversarial Attacks and Mitigation Techniques." 2020 IEEE Symposium Series on Computational Intelligence (SSCI). 2020.

[LimTaeihagh20192-14] 14,0 ^14,1 Шаблон:Cite journal

[:2-15] 15,0 ^15,1 Шаблон:Cite web

[16] Шаблон:Cite web

[:3-17] 17,0 ^17,1 ^17,2 Шаблон:Citation

[18] Шаблон:Cite web

[BiggioFumera20102-19] 19,0 ^19,1 Шаблон:Cite journal

[Adversarial_Machine_Learning_18A2-20] 20,0 ^20,1 Шаблон:Cite journal

[21] Шаблон:Cite journal

[:0-22] 22,0 ^22,1 Шаблон:Cite journal

[RodriguesLing20092-23] 23,0 ^23,1 Шаблон:Cite journal

[24] Шаблон:Cite journal

[25] Шаблон:Cite news

[26] Шаблон:Cite arXiv

[27] Шаблон:Cite magazine

[28] Шаблон:Cite journal

[29] Шаблон:Cite web

[30] Шаблон:Cite web

[31] Шаблон:Cite magazine

[32] Шаблон:Cite web

[33] Шаблон:Cite magazine

[nature_why2-34] 34,0 ^34,1 ^34,2 Шаблон:Cite journal

[35] Шаблон:Cite journal

[36] Шаблон:Cite arXiv

[37] Шаблон:Cite arXiv

[Adversarial_Machine_Learning_42A2-38] D. B. Skillicorn. "Adversarial knowledge discovery". IEEE Intelligent Systems, 24:54–61, 2009.

[Adversarial_Machine_Learning_46A2-39] 39,0 ^39,1 B. Biggio, G. Fumera, and F. Roli. "Pattern recognition systems under attack: Design issues and research challenges Шаблон:Webarchive". Int'l J. Patt. Recogn. Artif. Intell., 28(7):1460002, 2014.

[Adversarial_Machine_Learning_22-40] 40,0 ^40,1 Шаблон:Cite journal

[41] Шаблон:Cite book

[Adversarial_Machine_Learning_4A2-42] 42,0 ^42,1 ^42,2 B. Biggio, G. Fumera, and F. Roli. "Security evaluation of pattern classifiers under attack Шаблон:Webarchive". IEEE Transactions on Knowledge and Data Engineering, 26(4):984–996, 2014.

[Adversarial_Machine_Learning_5A2-43] 43,0 ^43,1 ^43,2 ^43,3 ^43,4 Шаблон:Cite book

[44] Шаблон:Cite journal

[45] Шаблон:Cite web

[46] Шаблон:Cite web

[47] Шаблон:Cite journal

[Adversarial_Machine_Learning_15A2-48] B. Biggio, B. Nelson, and P. Laskov. "Support vector machines under adversarial label noise Шаблон:Webarchive". In Journal of Machine Learning Research – Proc. 3rd Asian Conf. Machine Learning, volume 20, pp. 97–112, 2011.

[Adversarial_Machine_Learning_29A2-49] M. Kloft and P. Laskov. "Security analysis of online centroid anomaly detection". Journal of Machine Learning Research, 13:3647–3690, 2012.

[50] Шаблон:Cite web

[51] Шаблон:Cite web

[Baruch_2019-52] Шаблон:Cite journal

[53] Шаблон:Cite journal

[54] Шаблон:Cite arXiv

[:14-55] 55,0 ^55,1 Шаблон:Cite journal

[56] Шаблон:Cite journal

[57] Шаблон:Cite journal

[58] Шаблон:Cite arXiv Review

[59] Шаблон:Cite conference Review

[60] Шаблон:Cite journal

[61] Шаблон:Cite arXiv Review

[Adversarial_Machine_Learning_36A2-62] B. Nelson, B. I. Rubinstein, L. Huang, A. D. Joseph, S. J. Lee, S. Rao, and J. D. Tygar. "Query strategies for evading convex-inducing classifiers". J. Mach. Learn. Res., 13:1293–1332, 2012

[63] Шаблон:Cite web

[:12-64] 64,0 ^64,1 ^64,2 ^64,3 ^64,4 ^64,5 ^64,6 ^64,7 Шаблон:Cite arXiv

[:6-65] 65,0 ^65,1 Шаблон:Cite web

[66] Шаблон:Cite journal

[67] Шаблон:Cite book

[68] Шаблон:Cite journal

[69] Шаблон:Cite book

[70] Шаблон:Cite book

[71] Шаблон:Cite web

[72] Шаблон:Cite arXiv

[73] Шаблон:Cite journal

[74] Шаблон:Cite arXiv

[Explaining_and_Harnessing_Adversari2-75] 75,0 ^75,1 Шаблон:Cite arXiv

[76] Шаблон:Cite arXiv

[77] Шаблон:Cite arXiv

[78] Шаблон:Cite arXiv

[79] Шаблон:Cite journal

[:4-80] 80,0 ^80,1 Шаблон:Cite web

[:7-81] 81,0 ^81,1 ^81,2 ^81,3 ^81,4 Шаблон:Cite book

[:8-82] 82,0 ^82,1 ^82,2 ^82,3 ^82,4 ^82,5 ^82,6 Шаблон:Citation

[83] Шаблон:Cite arXiv

[84] Шаблон:Cite web

[:9-85] 85,0 ^85,1 Шаблон:Cite arXiv

[:10-86] 86,0 ^86,1 Шаблон:Cite web

[:11-87] 87,0 ^87,1 Шаблон:Cite web

[coronafigueroaa23unaligned-88] Шаблон:Cite conference

[:5-89] 89,0 ^89,1 Шаблон:Cite arXiv

[90] Шаблон:Cite web

[91] Шаблон:Cite web

[92] Шаблон:Cite journal

[Adversarial_Machine_Learning_22A2-93] O. Dekel, O. Shamir, and L. Xiao. "Learning to classify with missing and corrupted features". Machine Learning, 81:149–178, 2010.

[Adversarial_Machine_Learning_45A2-94] Шаблон:Cite journal

[Adversarial_Machine_Learning_10A2-95] B. Biggio, G. Fumera, and F. Roli. "Evade hard multiple classifier systems Шаблон:Webarchive". In O. Okun and G. Valentini, editors, Supervised and Unsupervised Ensemble Methods and Their Applications, volume 245 of Studies in Computational Intelligence, pages 15–38. Springer Berlin / Heidelberg, 2009.

[Adversarial_Machine_Learning_41A2-96] B. I. P. Rubinstein, P. L. Bartlett, L. Huang, and N. Taft. "Learning in a large function space: Privacy- preserving mechanisms for svm learning". Journal of Privacy and Confidentiality, 4(1):65–100, 2012.

[feature_select2-97] M. Kantarcioglu, B. Xi, C. Clifton. "Classifier Evaluation and Attribute Selection against Active Adversaries". Data Min. Knowl. Discov., 22:291–335, January 2011.

[98] Шаблон:Cite journal

[99] Шаблон:Cite journal

[100] Шаблон:Cite web

[101] Шаблон:Cite arXiv

[102] Шаблон:Cite arXiv

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

[60]

[61]

[62]

[63]

[64]

[65]

[66]

[67]

[68]

[69]

[70]

[71]

[72]

[73]

[74]

[75]

[76]

[77]

[78]

[79]

[80]

[81]

[82]

[83]

[84]

[85]

[86]

[87]

[88]

[89]

[90]

[91]

[92]

[93]

[94]

[95]

[96]

[97]

[98]

[99]

[100]

Партнерские ресурсы
Криптовалюты	Обмен криптовалют - www.bestchange.ru Криптовалютная биржа CoinEx Криптовалютная биржа Binance HIVE OS - операционная система для майнинга e4pool - Мультивалютный пул для майнинга.
Магазины	AliExpress — глобальная виртуальная (в Интернете) торговая площадка, предоставляющая возможность покупать товары производителей из КНР; computeruniverse.net - Интернет-магазин компьютеров(Промо код 5 Евро на первую покупку:FWWC3ZKQ);
Хостинг	DigitalOcean - американский провайдер облачных инфраструктур, с главным офисом в Нью-Йорке и с центрами обработки данных по всему миру;
Разное	Викиум - Онлайн-тренажер для мозга Like Центр - Центр поддержки и развития предпринимательства. Gamersbay - лучший магазин по бустингу для World of Warcraft. Ноотропы OmniMind N°1 - Усиливает мозговую активность. Повышает мотивацию. Улучшает память. Санкт-Петербургская школа телевидения - это федеральная сеть образовательных центров, которая имеет филиалы в 37 городах России. Lingualeo.com — интерактивный онлайн-сервис для изучения и практики английского языка в увлекательной игровой форме. Junyschool (Джунискул) – международная школа программирования и дизайна для детей и подростков от 5 до 17 лет, где ученики осваивают компьютерную грамотность, развивают алгоритмическое и креативное мышление, изучают основы программирования и компьютерной графики, создают собственные проекты: игры, сайты, программы, приложения, анимации, 3D-модели, монтируют видео. Умназия - Интерактивные онлайн-курсы и тренажеры для развития мышления детей 6-13 лет SkillBox - это один из лидеров российского рынка онлайн-образования. Среди партнеров Skillbox ведущий разработчик сервисного дизайна AIC, медиа-компания Yoola, первое и самое крупное русскоязычное аналитическое агентство Tagline, онлайн-школа дизайна и иллюстрации Bang! Bang! Education, оператор PR-рынка PACO, студия рисования Draw&Go, агентство performance-маркетинга Ingate, scrum-студия Sibirix, имидж-лаборатория Персона. «Нетология» — это университет по подготовке и дополнительному обучению специалистов в области интернет-маркетинга, управления проектами и продуктами, дизайна, Data Science и разработки. В рамках Нетологии студенты получают ценные теоретические знания от лучших экспертов Рунета, выполняют практические задания на отработку полученных навыков, общаются с экспертами и единомышленниками. Познакомиться со всеми продуктами подробнее можно на сайте https://netology.ru, линейка курсов и профессий постоянно обновляется. StudyBay Brazil – это онлайн биржа для португалоговорящих студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт. Автор24 — самая большая в России площадка по написанию учебных работ: контрольные и курсовые работы, дипломы, рефераты, решение задач, отчеты по практике, а так же любой другой вид работы. Сервис сотрудничает с более 70 000 авторов. Более 1 000 000 работ уже выполнено. StudyBay – это онлайн биржа для англоязычных студентов и авторов! Студент получает уникальную работу любого уровня сложности и больше свободного времени, в то время как у автора появляется дополнительный заработок и бесценный опыт.