You are currently logged in as an
Institutional Subscriber.
If you would like to logout,
please click on the button below.
Home / Publications / E-library page
Only AES members and Institutional Journal Subscribers can download
This paper presents the first steps toward the creation of a tool which enables artists to create music visualizations using pretrained, generative, machine learning models. First, the authors investigate the application of network bending, the process of applying transforms within the layers of a generative network, to image generation diffusion models by utilizing a range of point-wise, tensor-wise, and morphological operators. A number of visual effects that result from various operators, including some that are not easily recreated with standard image editing tools, are identified. The authors find that this process allows for continuous, fine-grain control of image generation, which can be helpful for creative applications. Next, music-reactive videos are generated using Stable Diffusion by passing audio features as parameters to network bending operators. Finally, the authors comment on certain transforms that radically shift the image and the possibilities of learning more about the latent space of Stable Diffusion based on these transforms. This paper is an extended version of the paper “Network Bending of Diffusion Models,” which appeared in the 27th International Conference on Digital Audio Effects.
Author (s): Dzwonczyk, Luke; Cella, Carmine-Emanuele; Ban, David
Affiliation:
Center for New Music and Audio Technologies University of California, Berkeley Berkeley, CA; Center for New Music and Audio Technologies University of California, Berkeley Berkeley, CA; Center for New Music and Audio Technologies University of California, Berkeley Berkeley, CA
(See document for exact affiliation information.)
Publication Date:
2025-06-04
Import into BibTeX
Permalink: https://aes2.org/publications/elibrary-page/?id=22920
(732KB)
Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.
Dzwonczyk, Luke; Cella, Carmine-Emanuele; Ban, David; 2025; Generating Music Reactive Videos by Applying Network Bending to Stable Diffusion [PDF]; Center for New Music and Audio Technologies University of California, Berkeley Berkeley, CA; Center for New Music and Audio Technologies University of California, Berkeley Berkeley, CA; Center for New Music and Audio Technologies University of California, Berkeley Berkeley, CA; Paper ; Available from: https://aes2.org/publications/elibrary-page/?id=22920
Dzwonczyk, Luke; Cella, Carmine-Emanuele; Ban, David; Generating Music Reactive Videos by Applying Network Bending to Stable Diffusion [PDF]; Center for New Music and Audio Technologies University of California, Berkeley Berkeley, CA; Center for New Music and Audio Technologies University of California, Berkeley Berkeley, CA; Center for New Music and Audio Technologies University of California, Berkeley Berkeley, CA; Paper ; 2025 Available: https://aes2.org/publications/elibrary-page/?id=22920
@article{dzwonczyk2025generating,
author={dzwonczyk luke and cella carmine-emanuele and ban david},
journal={journal of the audio engineering society},
title={generating music reactive videos by applying network bending to stable diffusion},
year={2025},
volume={73},
issue={6},
pages={388-398},
month={june},}