This is a note discussing the results of the new paper "Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI," by Hönig, Rando, Carlini, and Tramèr.
Update: Our initial tests of the noisy upscaler were based on our own implementation. The authors of the attack have shared their code with us, and we have added results from their version to our initial tests. No single test can capture the totality of the impacts of the attack on Glaze and artwork. One can see that the authors' code does produce better quality images compared to our own upscaler implementation. Glaze 2.1 provides stronger protection than prior versions. The impact of the attack on Glazed images varies quite a bit across styles and individual artists, and effects tend to be stronger on images with clear colors and smooth/no textures, and especially for styles that are already well represented in the base models, e.g. anime, cartoons.
We would first like to thank the authors for rigorously testing Glaze against the strong attacks proposed in their paper, and for responsibly disclosing it to us before making it public. The authors introduced a method called "noisy upscaling," which utilizes diffusion models to iteratively remove artifacts from images, drawing on similar principles as DiffPure. They found this technique to be reasonably effective at bypassing the protection offered by Glaze V1.1 (and similar tools Mist and Anti-dreambooth), and also includes results of its performance on Glaze V2.0.
Intuitively, the noisy upscaler attack adds a large volume of randomized noise to the image, then runs it through an upscaler with diffusion fill-in. The significant addition of noise forces the diffusion model to replace large portions of the image's pixels with its best guesses as to what should be there. The effect is somewhat analogous to firing many many small pellets at an image (shotgun style), and then using diffusion to fill in the holes, effectively inpainting lots and lots of small regions of the image. What this attack is able to do is to replace many pixels in the image with diffusion inpainting. But in removing Glaze effects, the attack is also throwing away the original art's pixels.
As one might expect, this means that the more Glaze you remove, the more details you are removing of the original style that the model is trying to learn in its mimicry attempt. Not surprisingly, this affects different art styles in different ways. Smooth/flat color styles with less texture will be more vulnerable, while more textured styles will see their styles degrade much faster along with Glaze under the attack.
After the authors shared their work with us, we were able to implement their noisy upscaler for local testing. In our tests, we verified some of the authors' claims: the method did have significant impact in removing Glaze V1 protection, and had some impact on Glaze V2.0 protection. However, like many prior removal techniques, this attack often pays a price of significantly degraded image quality. The degradation is particularly problematic for faces, textured art, and high-contrast artwork.
Key Difference of Opinion. There is one important difference in how we view the issue of protective tools and the opinions expressed in the paper. The paper's key message is that we should not develop or release protective tools that can be imperfect or broken in the future, and that imperfect protection is worse than no protection at all. We disagree. Security is an ongoing battle, and protective tools like anti-virus scanners, network firewalls, and email spam filters are examples of valuable tools that have great utility despite the lack of future proof guarantees.
More importantly, there are real harms done to artists on a daily basis, and a protective tool can have a huge impact even if it eventually is broken in the future (and is then repaired/updated). Most/all artists today understand this is an ongoing battle, that the protection of their artistic creations requires more than a one-pass solution. So as attacks come, we will be here to analyze them, understand them, and work to improve Glaze (and other tools) to overcome them. And artists will update their tools and re-glaze their images as needed. Will it be perfect? Probably not. But we will do our best to be transparent with artists and help them understand risks and limitations of these tools. For our part, the effort is worth it. Respectfully, we believe the authors perhaps underestimate artists' resilience and the steps they are willing to take in order to protect their art.
Next steps. Having analyzed this new attack on both v1.1 and v2.0, we have added some short term tweaks to the v2.0 algorithm to provide additional robustness against noisy upscaling. The new update Glaze v2.1 is ready for download later today, and will be integrated into WebGlaze later this week. Longer term, we are planning to develop an even stronger update against similar types of attacks. Below, we included some relevant examples of images that illustrate a) the noisy upscaler's limitations on certain image types, and b) some samples of the performance of Glaze 2.1 under the noisy upscaler attack. While we cannot guarantee our implementation of the attack is identical to that described in the paper, it is accurate to the best of our knowledge, and in our tests, it successfully replicates the paper's results on v1.1 samples.
A sample before and after of noisy upscaler.
This illustrates the noisy upscaler's impact on images
with detailed textures and those with lower
contrast. We will add more image samples.
Glaze 2.1 Results under different implementation of noisy upscaler. These images from artists capture three different types of images, detailed photographs, high textured art, and art with more smooth gradients/flat color elements.