Publications

 
   2025   |   2024   |   2023   |   2022   |   2021   |   2020   |   2019   |   2018   |   2017   |   2016   
  • SpikePR: Position Regression with Deep Spiking Neural Network
    Huang Z, Zeng Y, Poslad S and Gu F
    Ieee Sensors Journal, Institute of Electrical and Electronics Engineers (Ieee) vol. PP (99), 1-1.  
    27-12-2024
  • DMRN+19: Digital Music Research Network One-day Workshop 2024
    Dixon S, Guinot J and Yusuf F
    DMRN+19: Digital Music Research Network One-day Workshop 2024 Arst Two - QMUL (Queen Mary University of London); London E1 4NS, UK. 17 Dec 2024. Editors: Bort A. 
    17-12-2024
  • Using GPT-4 to guide causal machine learning.
    Constantinou AC, Kitson NK and Zanga A
    Expert Systems With Applications, Elsevier 
    12-12-2024
  • Shifting Ambiguity, Collapsing Indeterminacy: Designing with Data as Baradian Apparatus
    Reed CN, Benito AL, Caspe F and McPherson AP
    Acm Transactions On Computer-Human Interaction, Association For Computing Machinery (Acm) vol. 31 (6), 1-41.  
    06-12-2024
  • Pitch-aware generative pretraining improves multi-pitch estimation with scarce data
    Pilataki M, Mauch M and Dixon S
    Proceedings of the 6th ACM International Conference on Multimedia in Asia., 1-8.  
    03-12-2024
  • Classification of spontaneous and scripted speech for multilingual audio
    Elisha S, McDowell A, Beguerisse-Díaz M and Benetos E
    IEEE Spoken Language Technology Workshop 2024 Macao, China 2 Dec 2024 - 5 Dec 2024
    02-12-2024
  • S 2 Reg: Structure-semantics collaborative point cloud registration
    Xu Z, Gao X, Jiang X, Cheng S, Zhang Q and Li W
    Pattern Recognition, Elsevier 
    01-12-2024
  • Robotic Grasping and Manipulation Competition at the 2024 IEEE/RAS International Conference on Robotics and Automation [Competitions]
    Sun Y, Calli B, Kimble K, wyffels F, De Gusseme V-L, Hang K, DAvella S, Xompero A, Cavallaro A, Roa MA, Avendano J and Mavrommati A
    Ieee Robotics & Automation Magazine, Institute of Electrical and Electronics Engineers (Ieee) vol. 31 (4), 174-185.  
    01-12-2024
  • Guest Editorial: Special Issue on Human Centered AI in Game Evaluation
    Denisova A, Perez-Liebana D, Volz V, Frommel J and Asadi S
    Ieee Transactions On Games, Institute of Electrical and Electronics Engineers (Ieee) vol. 16 (4), 742-745.  
    01-12-2024
  • Evaluating impact of movement on diabetes via artificial intelligence and smart devices systematic literature review
    Rotbei S, Tseng WH, Merino-Barbancho B, Haleem MS, Montesinos L, Pecchia L, Fico G and Botta A
    Expert Systems With Applications, Elsevier vol. 257 
    01-12-2024
  • Development of a User-Friendly Pipeline for Constructing Atrial Models at Scale: Importance of the End-User for Clinical Uptake
    Bevis L, Misghina S, Rauseo E, Lopez Barrera C, Plank G, Vigmond E, Loewe A, Karabelas E, Solis-Lemus JA, Niederer S, Petersen S, Slabaugh G, Mathur A and Roney C
    Computing in Cardiology 2024 (CinC24) Karlsruhe, Germany 11 Sep 2024 - 9 Dec 2024. vol. 51 
    01-12-2024
  • Clinical features, myocardial injury and systolic impairment in acute myocarditis.
    Shyam-Sundar V, Slabaugh G, Mohiddin SA, Petersen SE and Aung N
    Open Heart, Bmj vol. 11 (2), e002901-e002901.  
    01-12-2024
  • Incremental Object 6D Pose Estimation
    Tian L, Sorrenti A, Pang YL, Bellitto G, Palazzo S, Spampinato C and Oh C
    International Conference on Pattern Recognition (ICPR) 1 Dec 2024
    29-11-2024
  • Incremental Object 6D Pose Estimation
    Tian L, Sorrenti A, Pang YL, Bellitto G, Palazzo S, Spampinato C and Oh C
    International Conference on Pattern Recognition
    29-11-2024
  • RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF
    Catley-Chandar S, Shaw R, Slabaugh G and Pérez-Pellitero E
    European Conference on Computer Vision (2024). vol. 15070, 54-71.  
    28-11-2024
  • Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
    Maniadis Metaxas I, Tzimiropoulos G and Patras I
    Lecture Notes in Computer Science, Springer Nature vol. 15090, 436-454.  
    23-11-2024
  • Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation
    Shatri E and Fazekas G
    International Conference on Knowledge Discovery and Information Retrieval Porto, Portugal 17 Nov 2024 - 19 Nov 2024
    19-11-2024
  • Developing DIY Solar-Powered, Off-Grid Audio Streamers for Forest Soundscapes: Progress and Challenges
    Marino L and Xambo Sedo A
    CHIME Annual One-day Music and HCI Conference 2024 The Open University, in Milton Keynes, UK 2 Dec 2024
    18-11-2024
  • ‘Journeys in the Dark’ - Towards Game Master AI in Complex Board Games
    Best T, Lucas S and Gaina R
    Artificial Intelligence and Interactive Digital Entertainment
    15-11-2024
  • Presenting predictions and performance of probabilistic models for clinical decision support in trauma care
    Alptekin C, Wohlgemut JM, Perkins ZB, Marsh W, Tai NRM and Yet B
    International Journal of Medical Informatics, Elsevier vol. 194 
    14-11-2024
  • Diff-MSTC: A Mixing Style Transfer Prototype for Cubase
    Vanka S, Hannink L, Rolland J-B and Fazekas G
    International Society for Music Information Retrieval San Francisco 10 Nov 2024 - 15 Nov 2024
    11-11-2024
  • ST-ITO: Controlling audio effects for style transfer with inference-time optimization
    Steinmetz C, Singh S, Comunit� M, Ibnyahya I, Yuan S, Benetos E and Reiss J
    25th International Society for Music Information Retrieval Conference (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024
    10-11-2024
  • Proceedings of the 25th International Society for Music Information Retrieval Conference
    Guinot J, Fazekas G and Quinton E
    The 25th International Society for Music Information Retrieval Conference San Francisco, USA 9 Nov 2024 - 15 Nov 2024
    10-11-2024
  • MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models
    Weck B, Manco I, Benetos E, QUINTON E, Fazekas G and Bogdanov D
    25th International Society for Music Information Retrieval Conference (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024
    10-11-2024
  • I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition
    Vasilakis I, Bittner R and Pauwels J
    25th International Society for Music Information Retrieval (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024
    10-11-2024
  • Diff-MST: Differentiable Mixing Style Transfer
    Vanka S, Steinmetz C, Rolland J-B, Reiss J and Fazekas G
    International Society of Music Information Retrieval San Francisco 10 Nov 2024 - 14 Nov 2024
    10-11-2024
  • ComposerX: Multi-Agent Symbolic Music Composition with LLMs
    Deng Q, Yang Q, Yuan R, Huang Y, Wang Y, Liu X, Tian Z, Pan J, Zhang G, Lin H, Li Y, Ma Y, Fu J, Lin C, Benetos E, Wang W, Xia G, Xue W and Guo Y
    25th International Society for Music Information Retrieval Conference (ISMIR), San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024
    10-11-2024
  • Can LLMs Reason in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
    Zhou Z, Wu Y, Wu Z, Zhang X, Yuan R, Ma Y, Wang L, Benetos E, Xue W and Guo Y
    25th International Society for Music Information Retrieval Conference (ISMIR) San Franscisco, CA, USA 10 Nov 2024 - 14 Nov 2024
    10-11-2024
  • A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks
    Chen F, Lin W, Liu Z and Chan AB
    Lecture Notes in Computer Science. vol. 15098, 428-445.  
    10-11-2024
  • Introduction to the Special Issue on Realistic Synthetic Data: Generation, Learning, Evaluation
    Ionescu B, Patras I, Müller H and Del Bimbo A
    Acm Transactions On Multimedia Computing Communications and Applications, Association For Computing Machinery (Acm) 
    09-11-2024
  • Actually I Can Count My Blessings: User-Centered Design of an Application to Promote Gratitude Among Young Adults
    Bhattacharjee A, Gong Z, Wang B, Luckcock TJ, Watson E, Abellan EA, Gutman L, Hsu A and Williams JJ
    Proceedings of the ACM on Human-Computer Interaction. vol. 8 (CSCW2), 1-29.  
    07-11-2024
  • Editorial: Variable autonomy for human-robot teaming
    Theodorou A, Chiou M, Lacerda B and Rothfuß S
    Frontiers in Robotics and Ai, Frontiers vol. 11 
    06-11-2024
  • A multimodal understanding of the role of sound and music in gendered toy marketing
    Marinelli L, Lucht P and Saitis C
    Plos One, Public Library of Science (Plos) vol. 19 (11) 
    06-11-2024
  • PhenoGemini: Enhancing Molecular Diagnoses of Mendelian Disorders through Identifying Twin Patients with Large Language Models
    Chen Z, Cai J, Liu P, Yang Y, Zhao S, Li G, Xu K, Niu Y, Hospedales T, Qiu G, Wu Z, Zhang TJ and Wu N
    ASHG Annual Meeting Denver 5 Nov 2024 - 9 Nov 2024
    05-11-2024
  • A scoping review, novel taxonomy and catalogue of implementation frameworks for clinical decision support systems
    Wohlgemut JM, Pisirir E, Stoner RS, Perkins ZB, Marsh W, Tai NRM and Kyrimi E
    Bmc Medical Informatics and Decision Making, Springer Nature vol. 24 (1) 
    01-11-2024
  • The impact of multimorbidity on cardiac remodelling in the UK Biobank
    Shyam-Sundar V, Nicholls H, Chadalavada S, Vargas J, Slabaugh G, Mohiddin S, Petersen S and Aung N
    ESC 2024. vol. 45 (Supplement_1), ehae666.248-ehae666.248.  
    28-10-2024
  • MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing
    Ghosh S, Cai Z, Dhall A, Kollias D, Goecke R and Gedeon T
    Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing., 1-6.  
    28-10-2024
  • MRAC '24 Chairs' Welcome
    Tao J, Ghosh S, Lian Z, Cai Z, Schuller BW, Dhall A, Zhao G, Kollias D, Cambria E, Goecke R and Gedeon T
    Mrac 2024 - Proceedings of The 2nd International Workshop On Multimodal and Responsible Affective Computing 
    28-10-2024
  • Diagnostic and prognostic value of ECG-predicted hypertension mediated left ventricular hypertrophy using machine learning
    Naderi H, Ramirez J, Van Duijvenboden S, Pujadas ER, Aung N, Wang L, Chamling B, Dorr M, Markus MRP, Lekadir K, Petersen SE and Munroe PB
    European Heart Journal. vol. 45 (Supplement_1) 
    28-10-2024
  • CLIPCleaner: Cleaning Noisy Labels with CLIP
    Feng C, Tzimiropoulos G and Patras I
    Proceedings of the 32nd ACM International Conference on Multimedia., 876-885.  
    28-10-2024
  • 1M-Deepfakes Detection Challenge
    Cai Z, Dhall A, Ghosh S, Hayat M, Kollias D, Stefanov K and Tariq U
    Proceedings of the 32nd ACM International Conference on Multimedia., 11355-11359.  
    28-10-2024
  • Multi-Signal Informed Attention for Beat and Downbeat Detection
    Bolt J, Pauwels J and Fazekas G
    2024 IEEE 5th International Symposium on the Internet of Sounds (IS2). vol. 00, 1-7.  
    02-10-2024
  • Artificial intelligence in respiratory care: perspectives on critical opportunities and challenges
    Drummond D, Adejumo I, Hansen K, Poberezhets V, Slabaugh G and Hui CY
    Breathe, European Respiratory Society (Ers) vol. 20 (3) 
    01-10-2024
  • Improving image de-raining using reference-guided transformers
    Ye Z, Cho J and Oh C
    IEEE International Conference on Image Processing 27 Oct 2024 - 30 Oct 2024
    27-09-2024
  • ˜YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
    Chang SK, Benetos E, KIRCHHOFF H and Dixon S
    IEEE International Workshop on Machine Learning for Signal Processing (MLSP) London, UK 22 Sep 2024 - 25 Sep 2024
    22-09-2024
  • Building Sketch-to-Sound Mapping with Unsupervised Feature Extraction and Interactive Machine Learning
    Zheng S, Del Sette BM, Saitis C, Xambo Sedo A and Bryan-Kinns N
    New Interfaces for Musical Expression Utrecht, The Netherlands 4 Sep 2024 - 6 Sep 2024
    04-09-2024
  • LEVERAGING REAL ELECTRIC GUITAR TONES AND EFFECTS TO IMPROVE ROBUSTNESS IN GUITAR TABLATURE TRANSCRIPTION MODELING
    Roman Guzman I
    27th International Conference on Digital Audio Effects (DAFx24)
    03-09-2024
  • Differentiable All-pole Filters for Time-varying Audio Systems
    Yu C-Y, Mitcheltree C, Carson A, Bilbao S, Reiss J and Fazekas G
    International Conference on Digital Audio Effects 2024 Guildford, Surrey, UK 3 Sep 2024 - 7 Sep 2024
    03-09-2024
  • Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis
    Yu C-Y and Fazekas G
    INTERSPEECH 2024 Kos Island, Greece 1 Sep 2024 - 5 Sep 2024
    01-09-2024
  • A Multi‐Criteria Decision Support Tool for Shared Decision Making in Clinical Consultation
    Şakar CT, Keith‐Jopp C, Yet B, Joyner C, Hill A, Roberts J, Marsh W and Morrissey D
    Journal of Multi-Criteria Decision Analysis, Wiley vol. 31 (5-6) 
    01-09-2024
  • Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model
    Huang J and Benetos E
    32nd European Signal Processing Conference (EUSIPCO) Lyon, France 26 Aug 2024 - 30 Aug 2024., 146-150.  
    26-08-2024
  • ChatMusician: Understanding and Generating Music Intrinsically with LLM
    Yuan R, Lin H, Wang Y, Tian Z, Wu S, Shen T, Zhang G, Wu Y, Liu C, Zhou Z, Ma Z, Xue L, Wang Z, Liu Q, Zheng T, Li Y, Ma Y, Liang Y, Chi X, Liu R, et al.
    62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) Bangkok, Thailand 11 Aug 2024 - 16 Aug 2024
    11-08-2024
  • MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
    Zhang Y, Ikemiya Y, Xia G, Murata N, Martinez M, Liao W, Mitsufuji Y and Dixon S
    International Joint Conference on Artificial Intelligence Jeju, Koera 3 Aug 2024 - 8 Aug 2024
    03-08-2024
  • What characteristics of clinical decision support system implementations lead to adoption for regular use? A scoping review
    Hill A, Morrissey D and Marsh W
    Bmj Health & Care Informatics, Bmj vol. 31 (1) 
    01-08-2024
  • One-Shot Neural Face Reenactment via Finding Directions in GAN's Latent Space.
    Bounareli S, Tzelepis C, Argyriou V, Patras I and Tzimiropoulos G
    Int. J. Comput. Vis. vol. 132, 3324-3354.  
    01-08-2024
  • Evaluation of a Musculoskeletal Digital Assessment Routing Tool (DART): Crossover Noninferiority Randomized Pilot Trial
    Lowe C, Sephton R, Marsh W and Morrissey D
    Jmir Formative Research, Jmir Publications vol. 8 
    30-07-2024
  • DExter: Learning and Controlling Performance Expression with Diffusion Models
    Zhang H, Chowdhury S, Cancino-Chacón CE, Liang J, Dixon S and Widmer G
    Applied Sciences, Mdpi vol. 14 (15), 1-17.  
    26-07-2024
  • Automatic Detection of Moral Values in Music Lyrics
    Preniqi V, Ghinassi I, Ive J, Kalimeri K and Saitis C
    25th International Society for Music Information Retrieval Conference 10 Nov 2024 - 14 Nov 2024
    26-07-2024
  • Musician-AI partnership mediated by emotionally-aware smart musical instruments
    Turchet L, Stefani D and Pauwels J
    International Journal of Human-Computer Studies, Elsevier vol. 191, 103340-103340.  
    23-07-2024
  • Bayesian networks may allow better performance and usability than logistic regression
    Wohlgemut JM, Pisirir E, Stoner RS, Kyrimi E, Yet B, Marsh W, Perkins ZB and Tai NRM
    Critical Care, Springer Nature vol. 28 (1) 
    11-07-2024
  • TEMPORAL ANALYSIS OF EMOTION PERCEPTION IN FILM MUSIC: INSIGHTS FROM THE FME-24 DATASET
    Crocker R and Fazekas G
    Sound and Music Computing 2024 ESMAE, Porto, Portugal 4 Jul 2024 - 6 Jul 2024
    06-07-2024
  • Simulating Piano Performance Mistakes for Music Learning
    Morsi A, Zhang H, Maezawa A, Dixon S and Serra X
    Sound and Music Computing Conference 4 Jul 2024 - 6 Jul 2024
    06-07-2024
  • Reconstructing the Charlie Parker Omnibook using an audio-to-score automatic transcription pipeline
    Riley J and Dixon S
    Sound and Music Computing Conference Porto 4 Jul 2024 - 6 Jul 2024
    04-07-2024
  • Can Machine Learning Assist in Diagnosis of Primary Immune Thrombocytopenia? A Feasibility Study
    Miah H, Kollias D, Pedone GL, Provan D and Chen F
    Diagnostics, Mdpi vol. 14 (13) 
    26-06-2024
  • The 6th Affective Behavior Analysis in-the-wild (ABAW) Competition
    Kollias D, Tzirakis P, Cowen A, Zafeiriou S, Kotsia I, Baird A, Gagne C, Shao C and Hu G
    2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). vol. 00, 4587-4598.  
    18-06-2024
  • Explaining models relating objects and privacy
    Xompero A, Bontonou M, Arbona J-M, Benetos E and Cavallaro A
    3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024 Seattle Convention Center, Seattle WA, USA 18 Jun 2024
    18-06-2024
  • CUE-Net: Violence Detection Video Analytics with Spatial Cropping, Enhanced UniformerV2 and Modified Efficient Additive Attention
    Senadeera DC, Yang X, Kollias D and Slabaugh G
    2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). vol. 00, 4888-4897.  
    18-06-2024
  • Open-vocabulary object 6D pose estimation
    Corsetti J, Boscaini D, Oh C, Cavallaro A and Poiesi F
    IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024
    17-06-2024
  • Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
    Kim J, Oh C, Do H, Kim S and Sohn K
    IEEE/CVF International Conference on Computer Vision and Pattern Recognition 2024
    17-06-2024
  • Automatic Generation of Expressive Piano Miniatures
    Colton S, Bradshaw L, Banar B and Bhandari K
    International Conference on Computational Creativity (ICCC) Sweden 17 Jun 2024 - 21 Jun 2024
    17-06-2024
  • MusiLingo: bridging music and text with pre-trained language models for music captioning and query response
    Deng Z, Ma Y, Liu Y, Guo R, Zhang G, Chen W, Huang W and Benetos E
    2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024) Mexico City, Mexico 16 Jun 2024 - 21 Jun 2024., 3643-3655.  
    16-06-2024
  • Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming
    Yu C-Y, Pauwels J and Fazekas G
    Audio Engineering Society 156th Convention Madrid, Spain 15 Jun 2024 - 17 Jun 2024
    15-06-2024
  • Ensuring UAV Safety: A Vision-Only and Real-Time Framework for Collision Avoidance Through Object Detection, Tracking, and Distance Estimation
    Karampinis V, Arsenos A, Filippopoulos O, Petrongonas E, Skliros C, Kollias D, Kollias S and Voulodimos A
    2024 International Conference on Unmanned Aircraft Systems (ICUAS). vol. 00, 1072-1079.  
    07-06-2024
  • Common Corruptions for Enhancing and Evaluating Robustness in Air-to-Air Visual Object Detection
    Arsenos A, Karampinis V, Petrongonas E, Skliros C, Kollias D, Kollias S and Voulodimos A
    Ieee Robotics and Automation Letters, Institute of Electrical and Electronics Engineers 
    03-06-2024
  • The effect of risk communication on consumers’ risk perception, risk tolerance and utility of smart and non-smart home appliances
    Hunte JL, Neil M, Fenton NE, Osman M and Bechlivanidis C
    Safety Science, Elsevier vol. 174 
    01-06-2024
  • CSP2023: 28 Digital Health Technology - Narrowing or Widening the Digital Divide? Learning From Validation of a Musculoskeletal Digital Assessment Tool (DART)
    Lowe C, Browne M, Marsh W and Morrissey D
    Physiotherapy, Elsevier vol. 123, e86-e87.  
    01-06-2024
  • A Self-Attention Deep Neural Network Regressor for real time blood glucose estimation in paediatric population using physiological signals
    Haleem MS, Cisuelo O, Andellini M, Castaldo R, Angelini M, Ritrovato M, Schiaffini R, Franzese M and Pecchia L
    Biomedical Signal Processing and Control, Elsevier vol. 92 
    01-06-2024
  • Bridging the Gap: Protocol Towards Fair and Consistent Affect Analysis
    Hu G, Papadopoulou E, Kollias D, Tzouveli P, Wei J and Yang X
    2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG). vol. 00, 1-9.  
    31-05-2024
  • Covid-19 Computer-Aided Diagnosis through AI-Assisted CT Imaging Analysis: Deploying a Medical AI System
    Gerogiannis D, Arsenos A, Kollias D, Nikitopoulos D and Kollias S
    2024 IEEE International Symposium on Biomedical Imaging (ISBI). vol. 00, 1-4.  
    30-05-2024
  • Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer
    Zhao Z, Cao Y, Gong S and Patras I
     
    29-05-2024
  • COVID‐19 Detection from Computed Tomography Images Using Slice Processing Techniques and a Modified Xception Classifier
    Morani K, Ayana EK, Kollias D and Unay D
    International Journal of Biomedical Imaging, Hindawi vol. 2024 (1) 
    24-05-2024
  • PyTAG: Tabletop Games for Multi-Agent Reinforcement Learning
    Balla M, Long G, Goodman J, Gaina R and Perez-Liebana D
    Ieee Transactions On Games, Institute of Electrical and Electronics Engineers, 1-10.  
    22-05-2024
  • WavCraft: audio editing and generation with large language models
    Liang J, Zhang H, Liu H, Cao Y, Kong Q, Liu X, Wang W, Plumbley MD, Phan H and Benetos E
    ICLR 2024 Workshop on LLM Agents Vienna, Austria 11 May 2024
    11-05-2024
  • Thinking with Sound: Exploring the Experience of Listening to an Ultrasonic Art Installation
    Robson N, McPherson A and Bryan-Kinns N
    Proceedings of the CHI Conference on Human Factors in Computing Systems., 1-14.  
    11-05-2024
  • Entangling Entanglement: A Diffractive Dialogue on HCI and Musical Interactions
    Morrison L and McPherson A
    Proceedings of the CHI Conference on Human Factors in Computing Systems., 1-17.  
    11-05-2024
  • Introducing the TISMIR Education Track: What, Why, How?
    Müller M, Dixon S, Volk A, Sturm BLT, Rao P and Gotham M
    Transactions of The International Society For Music Information Retrieval, Ubiquity Press vol. 7 (1), 85-98.  
    09-05-2024
  • MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
    Li Y, Yuan R, Zhang G, Ma Y, Chen X, Yin H, Xiao C, Lin C, Ragni A, Benetos E, Gyenge N, Dannenberg R, Liu R, Chen W, Xia G, Shi Y, Huang W, Wang Z, Guo Y and Fu J
    International Conference on Learning Representations (ICLR) Vienna, Austria 7 May 2024 - 11 May 2024
    07-05-2024
  • Parameter Reduction of Kernel-Based Video Frame Interpolation Methods Using Multiple Encoders
    Khalifeh I, Murn L and Izquierdo E
    Ieee Journal On Emerging and Selected Topics in Circuits and Systems, Institute of Electrical and Electronics Engineers (Ieee) vol. 14 (2), 245-260.  
    30-04-2024
  • Unsupervised Pitch-Timbre Disentanglement of Musical Instruments Using a Jacobian Disentangled Sequential Autoencoder
    Luo Y-J, Ewert S and Dixon S
    ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1036-1040.  
    19-04-2024
  • Uncertainty-Guided Contrastive Learning For Single Source Domain Generalisation
    Arsenos A, Kollias D, Petrongonas E, Skliros C and Kollias S
    ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 6935-6939.  
    19-04-2024
  • Syncfusion: Multimodal Onset-Synchronized Video-to-Audio Foley Synthesis
    Comunità M, Gramaccioni RF, Postolache E, Rodolà E, Comminiello D and Reiss JD
    ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 936-940.  
    19-04-2024
  • SSFE-M: A Self-Supervised Feature Extraction Model for Enhanced Camera Calibration
    Zhang N and Izquierdo E
    Ieee Signal Processing Letters, Institute of Electrical and Electronics Engineers (Ieee) vol. 31, 1179-1183.  
    16-04-2024
  • Posterior Variance-Parameterised Gaussian Dropout: Improving Disentangled Sequential Autoencoders for Zero-Shot Voice Conversion
    Luo Y-J and Dixon S
    ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 14 Apr 2024 - 19 Apr 2024., 11676-11680.  
    14-04-2024
  • MERTech: instrument playing technique detection using self-supervised pretrained model with multi-task finetuning
    Li D, Ma Y, Wei W, KONG Q, Wu Y, Che M, Xia F, Benetos E and Li W
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 521-525.  
    14-04-2024
  • Learning from taxonomy: multi-label few-shot classification for everyday sound recognition
    Liang J, Phan QH and Benetos E
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 771-775.  
    14-04-2024
  • High Resolution Guitar Transcription via Domain Adaptation
    Riley JX, EDWARDS D and Dixon S
    International Conference on Acoustics, Speech and Signal Processing Seoul, South Korea 14 Apr 2024 - 19 Apr 2024
    14-04-2024
  • Generalized multi-source inference for text conditioned music diffusion models
    Postolache E, Mariani G, Cosmo L, Benetos E and Rodola E
    IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 6980-6984.  
    14-04-2024
  • Faster Person Re-Identification: One-Shot-Filter and Coarse-to-Fine Search
    Wang G, Huang X, Gong S, Zhang J and Gao W
    Ieee Transactions On Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers (Ieee) vol. 46 (5), 3013-3030.  
    03-04-2024
  • Learning by Erasing: Conditional Entropy Based Transferable Out-of-Distribution Detection
    Xing M, Feng Z, Su Y and Oh C
    AAAI Conference on Artificial Intelligence 2024
    24-03-2024
  • Distribution Matching for Multi-Task Learning of Classification Tasks: A Large-Scale Study on Faces & Beyond
    Kollias D, Sharmanska V and Zafeiriou S
    Proceedings of the AAAI Conference on Artificial Intelligence. vol. 38 (3), 2813-2821.  
    24-03-2024
  • HRTF Upsampling With a Generative Adversarial Network Using a Gnomonic Equiangular Projection
    Hogg AOT, Jenkins M, Liu H, Squires I, Cooper SJ and Picinali L
    Ieee/Acm Transactions On Audio Speech and Language Processing, Institute of Electrical and Electronics Engineers (Ieee) vol. 32, 2085-2099.  
    11-03-2024
  • Document structure-driven investigative information retrieval
    Ketola T and Roelleke T
    Information Systems, Elsevier vol. 121, 102315-102315.  
    01-03-2024
  • Auditory imagery ability influences accuracy when singing with altered auditory feedback
    Reed CN, Pearce M and McPherson A
    Musicae Scientiae, Sage Publications 
    15-02-2024
  • Exploring User Perspectives on Brief Reflective Questioning Activities for Stress Management: Mixed Methods Study
    Bhattacharjee A, Chen P, Mandal A, Hsu A, O'Leary K, Mariakakis A and Williams JJ
    Jmir Formative Research, Jmir Publications vol. 8 
    08-02-2024
  • A Data-Driven Analysis of Robust Automatic Piano Transcription
    EDWARDS D, Dixon S, Benetos E, Maezawa A and Kusaka Y
    Ieee Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 681-685.  
    08-02-2024
  • Test-time adaptation for 6D pose tracking
    Tian L, Oh C and Cavallaro A
    Pattern Recognition, Elsevier Bv, 110390-110390.  
    01-02-2024
  • A critical analysis of image-based camera pose estimation techniques
    Xu M, Wang Y, Xu B, Zhang J, Ren J, Huang Z, Poslad S and Xu P
    Neurocomputing, Elsevier vol. 570 
    01-02-2024
  • YourMT3+: Multi-Instrument Music Transcription with Enhanced Transformer Architectures and Cross-Dataset STEM Augmentation
    Chang S, Benetos E, Kirchhoff H and Dixon S
    , Institute of Electrical and Electronics Engineers (Ieee) vol. 00, 1-6.  
    25-01-2024
  • Composer Style-Specific Symbolic Music Generation using Vector Quantized Discrete Diffusion Models
    Zhang J, Fazekas G and Saitis C
    2024 IEEE 34th International Workshop on Machine Learning for Signal Processing (MLSP). vol. 00, 1-6.  
    25-01-2024
  • The Role of Communication and Reference Songs in the Mixing Process: Insights from Professional Mix Engineers
    Vanka S, Safi M, Rolland J-B and Fazekas G
    Journal of The Audio Engineering Society, Audio Engineering Society 
    20-01-2024
  • Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation
    Hu G, Wei J, Song S, Kollias D, Yang X, Sun Z and Kaloidas O
    2024 IEEE International Joint Conference on Biometrics (IJCB). vol. 00, 1-10.  
    18-01-2024
  • Exploring the impact of transfer learning on GAN-based HRTF upsampling
    Hogg A, Liu H, Jenkins M and Picinali L
    Proceedings of the 10th Convention of the European Acoustics Association Forum Acusticum 2023., 2323-2328.  
    17-01-2024
  • ATGNN: audio tagging graph neural network
    Singh S, Steinmetz C, Benetos E, Phan QH and Stowell D
    Ieee Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 825-829.  
    17-01-2024
  • Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization
    Zheng M, Gong S, Jin H, Peng Y and Liu Y
    Annual Meeting of the Association for Computational Linguistics. vol. 1, 14197-14209.  
    12-01-2024
  • A review of differentiable digital signal processing for music and speech synthesis
    Hayes B, Shier J, Fazekas G, McPherson A and Saitis C
    Frontiers in Signal Processing, Frontiers vol. 3 
    11-01-2024
  • Spectrogram-based approach with convolutional neural network for human activity classification
    Sassi M, Haleem MS and Pecchia L
    Mediterranean Conference on Medical and Biological Engineering and Computing International Conference on Medical and Biological Engineering MEDICON 2023, CMBEBIH 2023: MEDICON’23 and CMBEBIH’23 14 Sep 2023 - 16 Sep 2023
    04-01-2024
  • Spectrogram-Driven Convolutional Neural Network for Real-Time Non-invasive Hyperglycaemia Detection in Paediatric Type-1 Diabetes via Wearable Sensors
    Cisuelo O, Haleem MS, Hattersley J and Pecchia L
    MEDICON: Mediterranean Conference on Medical and Biological Engineering and Computing, CMBEBIH: International Conference on Medical and Biological Engineering 14 Sep 2023 - 16 Sep 2023
    04-01-2024
  • Wavelet-based network for high dynamic range imaging
    Dai T, Li W, Cao X, Liu J, Jia X, Leonardis A, Yan Y and Yuan S
    Computer Vision and Image Understanding, Elsevier vol. 238 
    01-01-2024
  • VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning.
    Xenos A, Foteinopoulou NM, Ntinou I, Patras I and Tzimiropoulos G
    Corr vol. abs/2404.07078 
    01-01-2024
  • Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model
    Huang J and Benetos E
    European Signal Processing Conference, 146-150.  
    01-01-2024
  • TOWARDS EFFICIENT MODELLING OF STRING DYNAMICS: A COMPARISON OF STATE SPACE AND KOOPMAN BASED DEEP LEARNING METHODS
    Diaz R, De La Vega Martin C and Sandler M
    Proceedings of the International Conference on Digital Audio Effects, DAFx., 200-207.  
    01-01-2024
  • Self-Supervised Facial Representation Learning with Facial Region Awareness.
    Gao Z and Patras I
    CVPR., 2081-2092.  
    01-01-2024
  • Self-Supervised Facial Representation Learning with Facial Region Awareness
    Gao Z and Patras I
    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition., 2081-2092.  
    01-01-2024
  • Real-time Timbre Remapping with Differentiable DSP
    Shier J, Saitis C, Robertson A and McPherson A
    Proceedings of the International Conference on New Interfaces for Musical Expression
    01-01-2024
  • Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization.
    Oldfield J, Georgopoulos M, Chrysos GG, Tzelepis C, Panagakis Y, Nicolaou MA, Deng J and Patras I
    Corr vol. abs/2402.12550 
    01-01-2024
  • Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection
    Liang J, Nolasco I, Ghani B, Phan H, Benetos E and Stowell D
    European Signal Processing Conference., 1257-1261.  
    01-01-2024
  • MOAB: Multi-Modal Outer Arithmetic Block For Fusion Of Histopathological Images And Genetic Data For Brain Tumor Grading.
    Alwazzan O, Khan A, Patras I and Slabaugh GG
    Corr vol. abs/2403.06349 
    01-01-2024
  • MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance.
    Meng D, Tzelepis C, Patras I and Tzimiropoulos G
    Corr vol. abs/2409.11010 
    01-01-2024
  • LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition.
    Sun Z, Feng C, Patras I and Tzimiropoulos G
    CVPR., 1639-1649.  
    01-01-2024
  • LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition
    Sun Z, Feng C, Patras I and Tzimiropoulos G
    Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition., 1639-1649.  
    01-01-2024
  • Investigating Adversarial Policy Learning for Robust Agents in Automated Driving Highway Simulations
    Pighetti A, Bellotti F, Oh C, Lazzaroni L, Forneris L, Fresta M and Berta R
    Lecture Notes in Electrical Engineering. vol. 1110, 124-129.  
    01-01-2024
  • Improving Fairness using Vision-Language Driven Image Augmentation.
    D'Incà M, Tzelepis C, Patras I and Sebe N
    WACV., 4683-4692.  
    01-01-2024
  • Identification of major hemorrhage in trauma patients in the prehospital setting: diagnostic accuracy and impact on outcome
    Wohlgemut JM, Pisirir E, Stoner RS, Kyrimi E, Christian M, Hurst T, Marsh W, Perkins ZB and Tai NRM
    Trauma Surgery & Acute Care Open, Bmj vol. 9 (1) 
    01-01-2024
  • Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization.
    Zhang Z, Liu Z and Patras I
    Corr vol. abs/2408.04983 
    01-01-2024
  • Foundation Models for Music: A Survey.
    Ma Y, Øland A, Ragni A, Sette BMD, Saitis C, Donahue C, Lin C, Plachouras C, Benetos E, Quinton E, Shatri E, Morreale F, Zhang G, Fazekas G, Xia G, Zhang H, Manco I, Huang J, Guinot J, Lin L, et al.
    Corr vol. abs/2408.14340 
    01-01-2024
  • FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion.
    Singh AK and Patras I
    Corr vol. abs/2404.18591 
    01-01-2024
  • FOAA: Flattened Outer Arithmetic Attention for Multimodal Tumor Classification.
    Alwazzan O, Patras I and Slabaugh GG
    ISBI., 1-5.  
    01-01-2024
  • Efficient Vision-Language pre-training via domain-specific learning for human activities
    Bulat A, Ouali Y, Guerrero R, Martinez B and Tzimiropoulos G
    Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing., 7978-8000.  
    01-01-2024
  • Discerning real from synthetic: analysis and perceptual evaluation of sound effects
    Garcia N, Zong Y and Reiss J
    2024 6th International Conference on Audio for Games., 87-94.  
    01-01-2024
  • DIFFERENTIABLE ALL-POLE FILTERS FOR TIME-VARYING AUDIO SYSTEMS
    Yu CY, Mitcheltree C, Carson A, Bilbao S, Reiss JD and Fazekas G
    Proceedings of the International Conference on Digital Audio Effects, DAFx., 345-352.  
    01-01-2024
  • CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition.
    Sun Z, Song S, Patras I and Tzimiropoulos G
    CoRR. vol. abs/2409.18876 
    01-01-2024
  • CLIPCleaner: Cleaning Noisy Labels with CLIP.
    Feng C, Tzimiropoulos G and Patras I
    ACM Multimedia., 876-885. Editors: Cai J, Kankanhalli MS, Prabhakaran B, Boll S, Subramanian R, Zheng L, Singh VK, César P, Xie L and Xu D. 
    01-01-2024
  • Are CLIP features all you need for Universal Synthetic Image Origin Attribution?
    Cioni D, Tzelepis C, Seidenari L and Patras I
    Corr vol. abs/2408.09153 
    01-01-2024
  • A hybrid Bayesian network for medical device risk assessment and management
    Hunte JL, Neil M and Fenton NE
    Reliability Engineering & System Safety, Elsevier vol. 241 
    01-01-2024
  • A Machine learning method to evaluate and improve sound effects synthesis model design
    Zong Y, Garcia-Sihuay N and Reiss J
    2024 6th International Conference on Audio for Games., 11-19.  
    01-01-2024