Publications
.
2005. Tabu Search voor de optimalisatie van muzikale fragmenten. Faculty of Applied Economics. MSc Business Engineer Management Information Systems
Thesis.pdf (1.25 MB)
.
2018. Real-Time Binaural Auralization. ISTD. PhD
NatalieAngus_PhD_Thesis_01Jul18.pdf (6.19 MB)
.
2024. Modern Portfolio Construction with Advanced Deep Learning Models. SUTD. PhD
Joel_Ong_Thesis.pdf (3.44 MB)
.
2020. Data-driven 3D Scene Understanding. PhD
.
2011. A Variable Neighborhood Search Algorithm for Composing First Species Counterpoint Musical Fragments. 2011017
wp_cp1.pdf (775.91 KB)
.
2014. Looking into the minds of Bach, Haydn and Beethoven: Classification and generation of composer-specific music.
RPS-2014-001.pdf (575.42 KB)
.
2014. Generating structured music using quality metrics based on Markov models.
wp_bagana.pdf (1.7 MB)
.
2024. Gamification and skills tree. Trends and Foresight Report on Cyber-Physical Learning.
.
2012. Composing Fifth Species Counterpoint Music With Variable Neighborhood Search.
wp_cp5.pdf (508 KB)
.
2018. Blacklisted speaker identification using triplet neural networks. MCE2018 competition.
SUTD_description.pdf (133.08 KB)
.
2025. Royalties in the age of AI: paying artists for AI-generated songs. WIPO Magazine.
.
2018. O.R. and music generation. OR/MS Today. 45(1)
O.R. and music generation - INFORMS.pdf (825.66 KB)
.
2022. A white paper on cyberphysical learning. White paper, Singapore University of Technology and Design.
LSL_WhitePaper_Cyber-physical-Campus-Higher-Education.pdf (6.98 MB)
.
2024. Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model. Expert Systems with Applications.
2311.00968.pdf (5.51 MB)
.
2017. A variable neighborhood search algorithm to generate piano fingerings for polyphonic sheet music. International Transactions in Operational Research, Special Issue on Variable Neighbourhood Search. 24(3):509–535.
ITOR_VNS_APF_preprint.pdf (840.28 KB)
.
2021. Underwater Acoustic Communication Receiver Using Deep Belief Network. IEEE Transactions on Communications. :1-1.
2102.13397.pdf (12.87 MB)
.
2025. Towards the future of education: cyber-physical learning. Discover Education. 4:1–16.
.
2019. Towards robust audio spoofing detection: a detailed comparison of traditional and learned features. IEEE Access. 7:84229-84241.
ieee_access_herremans.pdf (14.31 MB)
.
2025. Text2midi: Generating Symbolic Music from Captions. Proceedings of AAAI, Philadelphia.
2412.16526v2.pdf (569.51 KB)
.
2025. SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering. arXiv:2508.03448.
2508.03448v2.pdf (3.31 MB)
.
2022. Single Image Video Prediction with Auto-Regressive GANs. Sensors. 22:3533.
.
2018. Singing Voice Separation Using a Deep Convolutional Neural Network Trained by Ideal Binary Mask and Cross Entropy. Neural Computing and Applications.
main.pdf (2.59 MB)
.
2025. PRESENT: Zero-Shot Text-to-Prosody Control. IEEE Signal Processing Letters.
2408.06827v1.pdf (367.55 KB)
.
2022. Predicting emotion from music videos: exploring the relative contribution of visual and auditory information to affective responses. Arxiv preprint.
.
2018. Perceptual evaluation of measures of spectral variance. Journal of the Acoustical Society of America. 143(6):3300–3311.
jasa_an_dh_preprint.pdf (2.46 MB)
.
2018. A Novel Interface for the Graphical Analysis of Music Practice Behaviours. Frontiers in Psychology - Human-Media Interaction. 9
practice_browser.pdf (4.9 MB)
.
2020. nnAudio: An on-the-fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolution Neural Networks. IEEE Access.
nnAudio.pdf (10.2 MB)
.
2025. Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey. ACM Computing Surveys.
2402.17467.pdf (1.01 MB)
.
2021. Music, Computing, and Health: A roadmap for the current and future roles of music technology for healthcare and well-being. Music & Science.
Preprint for OSF_Agres, Schaefer, Volk, et al. (2021)_Music & Science_watermark.pdf (4.07 MB)
.
2023. A Multimodal Model with Twitter Finbert Embeddings for Extreme Price Movement Prediction of Bitcoin. Expert Systems with Applications.
2206.00648.pdf (3.26 MB)
.
2017. MorpheuS: generating structured music with constrained patterns and tension. IEEE Transactions on Affective Computing. PP (In Press)(99)
herremans2017morpheusFullIEEE.pdf (5.71 MB)
.
2018. Minimally Simple Binaural Room Modelling Using a Single Feedback Delay Network. Journal of the Audio Engineering Society. 66(10):791-807.
angus_jaes_preprint.pdf (6.39 MB)
.
2023. MERP: A Music Dataset with Emotion Ratings and Raters’ Profile Information. Sensors - Intelligent Sensors. 23(1)
sensors-23-00382 (2).pdf (1.21 MB)
.
2025. MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection. arXiv:2505.20979.
.
2019. Machine Learning Research that Matters for Music Creation: A Case Study. Journal of New Music Research. 48(1):36-55.
concert_paper_preprint.pdf (1.6 MB)
.
2025. LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions. arXiv:2508.18321.
.
2017. Harmonic Structure Predicts the Enjoyment of Uplifting Trance Music. Frontiers in Psychology, Cognitive Science. 7(1999)
agres16ut.pdf (1.15 MB)
.
2015. Generating structured music for bagana using quality metrics based on Markov models. Expert Systems With Applications. 42 (21)(21):424–7435.
paper-bagana.pdf (1.73 MB)
.
2017. Generating guitar solos by integer programming. Journal of the Operational Research Society. :971-985.
preprint_guitar_solo_generation_dh.pdf (772.59 KB)
.
2017. A Functional Taxonomy of Music Generation Systems. ACM Computing Surveys. 50(5):30.
music_generation_survey_dh_preprint.pdf (349.15 KB)
.
2018. From Context to Concept: Exploring Semantic Relationships in Music with Word2Vec. Neural Computing and Applications.
paper.pdf (1.64 MB)
.
2025. Forecasting Bitcoin Volatility Spikes from Whale Transactions and Cryptoquant Data Using Synthesizer Transformer Models. IEEE Access. 13:117788-117807.
SSRN-id4247684.pdf (5.05 MB)
.
2025. An exploration of controllability in symbolic music infilling. IEEE Access.
.
2021. Evaluating the Effectiveness of an Augmented Reality Game Promoting Environmental Action. Sustainability. 13(24):13912.
sustainability-13-13912.pdf (16.23 MB)
.
2022. EmoMV: Affective Music-Video Correspondence Learning Datasets for Classification and Retrieval. Information Fusion.
SSRN-id4189323.pdf (2.01 MB)
.
2019. The emergence of deep learning: new opportunities for music and audio technologies. Neural Computing and Applications.
main_preprint.pdf (102.16 KB)
.
2022. Downscaling using Deep Convolutional Autoencoders, a case study for South East Asia. Egusphere preprint.
egusphere-2022-234.pdf (8.99 MB)
.
2019. Development of Machine Learning for asthmatic and healthy voluntary cough - a proof of concept study. Applied Sciences. 9(14)
applsci-09-02833.pdf (2.06 MB)
.
2025. Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics. arXiv:2510.05137.
.
2024. DeepUnifiedMom: Unified Time-series Momentum Portfolio Construction via Multi-Task Learning with Multi-Gate Mixture of Experts. arXiv:2406.08742.
2406.08742v1.pdf (1.06 MB)
.
2021. Deep Neural Network Based Respiratory Pathology Classification Using Cough Sounds. Sensors. 21(16):5555.
2106.12174.pdf (6.52 MB)
.
2014. Dance hit song prediction. Journal of New music Research. 43:302.
wp_hit.pdf (689.07 KB)
.
2023. Constructing Time-Series Momentum Portfolios with Deep Multi-Task Learning. Expert Systems with Applications. 230(120587)
2306.13661.pdf (707.95 KB)
.
2012. Composing first species counterpoint musical scores with a variable neighbourhood search algorithm. Journal of Mathematics and the Arts. 6:169-189.
.
2013. Composing Fifth Species Counterpoint Music With A Variable Neighborhood Search Algorithm. Expert Systems with Applications. 40
paper_preprint_cp5.pdf (405.75 KB)
.
2015. Compose ≡ compute. 4OR. 13:335–336.
.
2015. Classification and generation of composer-specific music using global feature models and variable neighborhood search. Computer Music Journal. 39(3):91.
papercmj-dh_preprint.pdf (637.63 KB)
.
2025. BandCondiNet: Parallel Transformers-based Conditional Popular Music Generation with Multi-View Features. Expert Systems with Applications. 130059
2407.10462v2.pdf (2.6 MB)
.
2021. AttendAffectNet – Emotion Prediction of Movie Viewers Using Multimodal Fusion with Self-attention. Sensors. Special issue on Intelligent Sensors: Sensor Based Multi-Modal Emotion Recognition.
sensors-21-08356.pdf (1.03 MB)
.
2020. Asthmatic versus healthy child classification based on cough and vocalised /a:/ sounds. The Journal of the Acoustical Society of America (JASA). 148, EL253
.
2025. Are we there yet? A brief survey of Music Emotion Prediction Datasets, Models and Outstanding Challenges IEEE Transactions on Affective Computing.
2406.08809v1.pdf (156.19 KB)
.
2021. aiSTROM - A roadmap for developing a successful AI strategy. IEEE Access.
.
2017. Visualizing the evolution of alternative hit charts. The 18th International Society for Music Information Retrieval Conference (ISMIR) - Late Breaking Demo.
dh_visualiation_preprint.pdf (5.34 MB)
.
2016. Uma abordagem baseada em programação linear inteira para a geração de solos de guitarra. XLVIII Simpósio Brasileiro de Pesquisa Operacional (SBPO).
sbpo_dh.pdf (346.61 KB)
.
2016. Tension ribbons: Quantifying and visualising tonal tension. Second International Conference on Technologies for Music Notation and Representation (TENOR). 2:8-18.
paper_tenor_dh_preprint_small.pdf (1.67 MB)
.
2016. Music generation with structural constraints: an operations research approach. 30th Annual Conference of the Belgian Operational Research (OR) Society (ORBEL30). :37-39.
orbel30_dh.pdf (117.78 KB)
.
2017. Music and Motion-Detection: A Game Prototype for Rehabilitation and Strengthening in the Elderly. IEEE International Conference on Orange Technologies (ICOT) .
agres_herr_music_rehab_preprint.pdf (1.77 MB)
.
2016. MorpheuS: constraining structure in automatic music generation. Dagstuhl seminar on Computational Music Structure Analysis.
abstract_dagstuhl_dh.pdf (88.49 KB)
.
2017. Modeling Musical Context with Word2vec. First International Workshop On Deep Learning and Music. 1:11-18.
herremans2017work2vec.pdf (745.8 KB)
.
2017. IMMA-Emo: A Multimodal Interface for Visualising Score- and Audio-synchronised Emotion Annotations. Audio Mostly.
IMMA-emo_preprint.pdf (1.4 MB)
.
2017. Hit Song Prediction Based on Early Adopter Data and Audio Features. The 18th International Society for Music Information Retrieval Conference (ISMIR) - Late Breaking Demo.
paper_preprint_hit.pdf (221.73 KB)
.
2020. Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance. Workshop on Machine Learning for Music Discover (ML4MD) as part of ICML.
2006.09833.pdf (2.81 MB)
.
2015. Generating music with an optimization algorithm using a Markov based objective function. ORBEL29, Belgian Conference on Operations Research.
orbel29abs.pdf (138.67 KB)
.
2014. First species counterpoint generation with VNS and vertical viewpoints. Annual Conference of the Belgian Operation Research Society (ORBEL28).
orbel28_dh.pdf (216.63 KB)
.
2013. First species counterpoint generation with VNS and vertical viewpoints. Digital Music Research Network (DMNR+8).
dnmr8_dh_dc.pdf (147.73 KB)
.
2016. The Effect of Repetitive Structure on Enjoyment in Uplifting Trance Music. 14th International Conference for Music Perception and Cognition (ICMPC). :280-282.
preprint_trance.pdf (139.27 KB)
.
2013. Dance Hit Song Science. International Workshop on Music and Machine Learning.
abstract_preprint_MML2013_DH.pdf (194.82 KB)
.
2012. Composing counterpoint musical scores with variable neighborhood search. Annual Conference of the Belgian Operation Research Society (ORBEL26).
orbel26abs_vnsforcp.pdf (116.85 KB)
.
2020. A variational autoencoder for music generation controlled by tonal tension. Joint Conference on AI Music Creativity (CSMC + MuMe).
2010.06230.pdf (622.82 KB)
.
2020. Unsupervised disentanglement of pitch and timbre for isolated musical instrument sounds. Proceedings of the International Society of Music Information Retrieval (ISMIR).
.
2022. Understanding Audio Features via Trainable Basis Functions. Arxiv preprint.
2204.11437.pdf (7.36 MB)
.
2019. Towards emotion based music generation: A tonal tension model based on the spiral array. Proceedings of Cognitive Science (CogSci).
CogSci_tension (1).pdf (610.91 KB)
.
2026. Text2midi-InferAlign: Improving Symbolic Music Generation with Inference-Time Alignment. ICASSP.
.
2018. The Structure of Chord Progressions Influences Listeners’ Enjoyment and Absorptive States in EDM. 15th International Conference on Music Perception and Cognition.
Agres460_preprint_v2.pdf (387.15 KB)
.
2025. SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning. Proceedings of the 6th Conference on AI Music Creativity (AIMC 2025), Brussels, Belgium, September 10th - 12th, 2025.
.
2024. SNIPER Training: Variable Sparsity Rate Training For Text-To-Speech. Proc. of IEEE Tencon, Singapore.
2211.07283.pdf (435.22 KB)
.
2025. Smart Timing for Mining: A Deep Learning Framework for Bitcoin Hardware ROI Prediction.
2512.05402v1.pdf (908 KB)
.
2020. Singing voice conversion with disentangled representations of singer and vocal technique using variational autoencoders. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP).
1912.02613.pdf (2.9 MB)
.
2026. Scaffolded Vulnerability: Chatbot-Mediated Reciprocal Self-Disclosure and Need-Supportive Interaction in Couples. Proceedings of CHI.
.
2014. Sampling the extrema from statistical models of music with variable neighbourhood search. ICMC/SMC.
icmc_dh.pdf (1.07 MB)
.
2021. Revisiting the Onsets and Frames Model with Additive Attention. Proceedings of the International Joint Conference on Neural Networks (IJCNN).
2104.06607.pdf (1.52 MB)
.
2020. Regression-based music emotion prediction using triplet neural networks. Proceedings of the International Joint Conference on Neural Networks (IJCNN).
2001.09988.pdf (777.31 KB)
.
2021. ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data. ACM Multimedia.
.
2020. PerceptionGAN: Real-world image construction from provided text through perceptual understanding. 4th Int. Conf. on Imaging, Vision and Pattern Recognition (IVPR), and 9th Int. Conf. on Informatics, Electronics & Vision (ICIEV).
perceptionGAN-preprint.pdf (2.83 MB)
.
2019. A novel music-based game with motion capture to support cognitive and motor function in the elderly. IEEE Conference on Games.
preprint.pdf (2.6 MB)
.
2019. nnAudio: A PyTorch Audio Processing Tool Using 1D Convolution neural networks. ISMIR - Late Breaking Demo.
nnAudio.pdf (399.08 KB)
.
2024. Mustango: Toward Controllable Text-to-Music Generation. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). pages 8293–8316.
2311.08355 (1).pdf (11.38 MB)
]