A Sentiment-Controllable Music Generation System Based on Conditional Variational Autoencoder

Jun Min, Zhiwei Gao, Lei Wang, Zhenxiang Cai, Haimeng Wu, Aihua Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Intelligent music generation is a significant application of artificial intelligence. To better achieve the function of video background music generation, finer-grained control is necessary to enable emotional transitions in music. Therefore, distinct from the variational autoencoder model, this study combines convolutional neural networks with conditional variational auto encoders to improve the intelligent music generation model. We propose enhancements such as the measure-level regularization mechanism and Kullback-Leibler cyclic annealing mechanism, which not only optimize the posterior collapse issue but also further improve the model's representation performance in the latent space and its controllability in generation, aiming to achieve better music representation and finer-grained generative control.
Original languageEnglish
Title of host publicationProceedings of the 2024 International Symposium on Electrical, Electronics and Information Engineering (ISEEIE)
Place of PublicationPiscataway
PublisherIEEE
Pages352-357
Number of pages6
ISBN (Electronic)9798350355772
ISBN (Print)9798350355789
DOIs
Publication statusPublished - 28 Aug 2024
Event4th International Symposium on Electrical, Electronics and Information Engineering - University of Leicester, Leicester, United Kingdom
Duration: 28 Aug 202430 Aug 2024
https://conferences.ieee.org/conferences_events/conferences/conferencedetails/62461

Conference

Conference4th International Symposium on Electrical, Electronics and Information Engineering
Abbreviated titleISEEIE 2024
Country/TerritoryUnited Kingdom
CityLeicester
Period28/08/2430/08/24
Internet address

Cite this