Publications
- SpikePR: Position Regression with Deep Spiking Neural Network
Huang Z, Zeng Y, Poslad S and Gu F
Ieee Sensors Journal, Institute of Electrical and Electronics Engineers (Ieee) vol. PP (99), 1-1.
27-12-2024 - DMRN+19: Digital Music Research Network One-day Workshop 2024
Dixon S, Guinot J and Yusuf F
DMRN+19: Digital Music Research Network One-day Workshop 2024 Arst Two - QMUL (Queen Mary University of London); London E1 4NS, UK. 17 Dec 2024. Editors: Bort A.
17-12-2024 - Using GPT-4 to guide causal machine learning.
Constantinou AC, Kitson NK and Zanga A
Expert Systems With Applications, Elsevier
12-12-2024 - Shifting Ambiguity, Collapsing Indeterminacy: Designing with Data as Baradian Apparatus
Reed CN, Benito AL, Caspe F and McPherson AP
Acm Transactions On Computer-Human Interaction, Association For Computing Machinery (Acm) vol. 31 (6), 1-41.
06-12-2024 - Pitch-aware generative pretraining improves multi-pitch estimation with scarce data
Pilataki M, Mauch M and Dixon S
Proceedings of the 6th ACM International Conference on Multimedia in Asia., 1-8.
03-12-2024 - Classification of spontaneous and scripted speech for multilingual audio
Elisha S, McDowell A, Beguerisse-Díaz M and Benetos E
IEEE Spoken Language Technology Workshop 2024 Macao, China 2 Dec 2024 - 5 Dec 2024.
02-12-2024 - S 2 Reg: Structure-semantics collaborative point cloud registration
Xu Z, Gao X, Jiang X, Cheng S, Zhang Q and Li W
Pattern Recognition, Elsevier
01-12-2024 - Robotic Grasping and Manipulation Competition at the 2024 IEEE/RAS International Conference on Robotics and Automation [Competitions]
Sun Y, Calli B, Kimble K, wyffels F, De Gusseme V-L, Hang K, DAvella S, Xompero A, Cavallaro A, Roa MA, Avendano J and Mavrommati A
Ieee Robotics & Automation Magazine, Institute of Electrical and Electronics Engineers (Ieee) vol. 31 (4), 174-185.
01-12-2024 - Guest Editorial: Special Issue on Human Centered AI in Game Evaluation
Denisova A, Perez-Liebana D, Volz V, Frommel J and Asadi S
Ieee Transactions On Games, Institute of Electrical and Electronics Engineers (Ieee) vol. 16 (4), 742-745.
01-12-2024 - Evaluating impact of movement on diabetes via artificial intelligence and smart devices systematic literature review
Rotbei S, Tseng WH, Merino-Barbancho B, Haleem MS, Montesinos L, Pecchia L, Fico G and Botta A
Expert Systems With Applications, Elsevier vol. 257
01-12-2024 - Development of a User-Friendly Pipeline for Constructing Atrial Models at Scale: Importance of the End-User for Clinical Uptake
Bevis L, Misghina S, Rauseo E, Lopez Barrera C, Plank G, Vigmond E, Loewe A, Karabelas E, Solis-Lemus JA, Niederer S, Petersen S, Slabaugh G, Mathur A and Roney C
Computing in Cardiology 2024 (CinC24) Karlsruhe, Germany 11 Sep 2024 - 9 Dec 2024. vol. 51
01-12-2024 - Clinical features, myocardial injury and systolic impairment in acute myocarditis.
Shyam-Sundar V, Slabaugh G, Mohiddin SA, Petersen SE and Aung N
Open Heart, Bmj vol. 11 (2), e002901-e002901.
01-12-2024 - Incremental Object 6D Pose Estimation
Tian L, Sorrenti A, Pang YL, Bellitto G, Palazzo S, Spampinato C and Oh C
International Conference on Pattern Recognition (ICPR) 1 Dec 2024.
29-11-2024 - Incremental Object 6D Pose Estimation
Tian L, Sorrenti A, Pang YL, Bellitto G, Palazzo S, Spampinato C and Oh C
International Conference on Pattern Recognition.
29-11-2024 - RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF
Catley-Chandar S, Shaw R, Slabaugh G and Pérez-Pellitero E
European Conference on Computer Vision (2024). vol. 15070, 54-71.
28-11-2024 - Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
Maniadis Metaxas I, Tzimiropoulos G and Patras I
Lecture Notes in Computer Science, Springer Nature vol. 15090, 436-454.
23-11-2024 - Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation
Shatri E and Fazekas G
International Conference on Knowledge Discovery and Information Retrieval Porto, Portugal 17 Nov 2024 - 19 Nov 2024.
19-11-2024 - Developing DIY Solar-Powered, Off-Grid Audio Streamers for Forest Soundscapes: Progress and Challenges
Marino L and Xambo Sedo A
CHIME Annual One-day Music and HCI Conference 2024 The Open University, in Milton Keynes, UK 2 Dec 2024.
18-11-2024 - ‘Journeys in the Dark’ - Towards Game Master AI in Complex Board Games
Best T, Lucas S and Gaina R
Artificial Intelligence and Interactive Digital Entertainment.
15-11-2024 - Presenting predictions and performance of probabilistic models for clinical decision support in trauma care
Alptekin C, Wohlgemut JM, Perkins ZB, Marsh W, Tai NRM and Yet B
International Journal of Medical Informatics, Elsevier vol. 194
14-11-2024 - Diff-MSTC: A Mixing Style Transfer Prototype for Cubase
Vanka S, Hannink L, Rolland J-B and Fazekas G
International Society for Music Information Retrieval San Francisco 10 Nov 2024 - 15 Nov 2024.
11-11-2024 - ST-ITO: Controlling audio effects for style transfer with inference-time optimization
Steinmetz C, Singh S, Comunit� M, Ibnyahya I, Yuan S, Benetos E and Reiss J
25th International Society for Music Information Retrieval Conference (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.
10-11-2024 - Proceedings of the 25th International Society for Music Information Retrieval Conference
Guinot J, Fazekas G and Quinton E
The 25th International Society for Music Information Retrieval Conference San Francisco, USA 9 Nov 2024 - 15 Nov 2024.
10-11-2024 - MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models
Weck B, Manco I, Benetos E, QUINTON E, Fazekas G and Bogdanov D
25th International Society for Music Information Retrieval Conference (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.
10-11-2024 - I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition
Vasilakis I, Bittner R and Pauwels J
25th International Society for Music Information Retrieval (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.
10-11-2024 - Diff-MST: Differentiable Mixing Style Transfer
Vanka S, Steinmetz C, Rolland J-B, Reiss J and Fazekas G
International Society of Music Information Retrieval San Francisco 10 Nov 2024 - 14 Nov 2024.
10-11-2024 - ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Deng Q, Yang Q, Yuan R, Huang Y, Wang Y, Liu X, Tian Z, Pan J, Zhang G, Lin H, Li Y, Ma Y, Fu J, Lin C, Benetos E, Wang W, Xia G, Xue W and Guo Y
25th International Society for Music Information Retrieval Conference (ISMIR), San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.
10-11-2024 - Can LLMs Reason in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Zhou Z, Wu Y, Wu Z, Zhang X, Yuan R, Ma Y, Wang L, Benetos E, Xue W and Guo Y
25th International Society for Music Information Retrieval Conference (ISMIR) San Franscisco, CA, USA 10 Nov 2024 - 14 Nov 2024.
10-11-2024 - A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks
Chen F, Lin W, Liu Z and Chan AB
Lecture Notes in Computer Science. vol. 15098, 428-445.
10-11-2024 - Introduction to the Special Issue on Realistic Synthetic Data: Generation, Learning, Evaluation
Ionescu B, Patras I, Müller H and Del Bimbo A
Acm Transactions On Multimedia Computing Communications and Applications, Association For Computing Machinery (Acm)
09-11-2024 - Actually I Can Count My Blessings: User-Centered Design of an Application to Promote Gratitude Among Young Adults
Bhattacharjee A, Gong Z, Wang B, Luckcock TJ, Watson E, Abellan EA, Gutman L, Hsu A and Williams JJ
Proceedings of the ACM on Human-Computer Interaction. vol. 8 (CSCW2), 1-29.
07-11-2024 - Editorial: Variable autonomy for human-robot teaming
Theodorou A, Chiou M, Lacerda B and Rothfuß S
Frontiers in Robotics and Ai, Frontiers vol. 11
06-11-2024 - A multimodal understanding of the role of sound and music in gendered toy marketing
Marinelli L, Lucht P and Saitis C
Plos One, Public Library of Science (Plos) vol. 19 (11)
06-11-2024 - PhenoGemini: Enhancing Molecular Diagnoses of Mendelian Disorders through Identifying Twin Patients with Large Language Models
Chen Z, Cai J, Liu P, Yang Y, Zhao S, Li G, Xu K, Niu Y, Hospedales T, Qiu G, Wu Z, Zhang TJ and Wu N
ASHG Annual Meeting Denver 5 Nov 2024 - 9 Nov 2024.
05-11-2024 - A scoping review, novel taxonomy and catalogue of implementation frameworks for clinical decision support systems
Wohlgemut JM, Pisirir E, Stoner RS, Perkins ZB, Marsh W, Tai NRM and Kyrimi E
Bmc Medical Informatics and Decision Making, Springer Nature vol. 24 (1)
01-11-2024 - The impact of multimorbidity on cardiac remodelling in the UK Biobank
Shyam-Sundar V, Nicholls H, Chadalavada S, Vargas J, Slabaugh G, Mohiddin S, Petersen S and Aung N
ESC 2024. vol. 45 (Supplement_1), ehae666.248-ehae666.248.
28-10-2024 - MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing
Ghosh S, Cai Z, Dhall A, Kollias D, Goecke R and Gedeon T
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing., 1-6.
28-10-2024 - MRAC '24 Chairs' Welcome
Tao J, Ghosh S, Lian Z, Cai Z, Schuller BW, Dhall A, Zhao G, Kollias D, Cambria E, Goecke R and Gedeon T
Mrac 2024 - Proceedings of The 2nd International Workshop On Multimodal and Responsible Affective Computing
28-10-2024 - Diagnostic and prognostic value of ECG-predicted hypertension mediated left ventricular hypertrophy using machine learning
Naderi H, Ramirez J, Van Duijvenboden S, Pujadas ER, Aung N, Wang L, Chamling B, Dorr M, Markus MRP, Lekadir K, Petersen SE and Munroe PB
European Heart Journal. vol. 45 (Supplement_1)
28-10-2024 - CLIPCleaner: Cleaning Noisy Labels with CLIP
Feng C, Tzimiropoulos G and Patras I
Proceedings of the 32nd ACM International Conference on Multimedia., 876-885.
28-10-2024 - 1M-Deepfakes Detection Challenge
Cai Z, Dhall A, Ghosh S, Hayat M, Kollias D, Stefanov K and Tariq U
Proceedings of the 32nd ACM International Conference on Multimedia., 11355-11359.
28-10-2024 - Multi-Signal Informed Attention for Beat and Downbeat Detection
Bolt J, Pauwels J and Fazekas G
2024 IEEE 5th International Symposium on the Internet of Sounds (IS2). vol. 00, 1-7.
02-10-2024 - Artificial intelligence in respiratory care: perspectives on critical opportunities and challenges
Drummond D, Adejumo I, Hansen K, Poberezhets V, Slabaugh G and Hui CY
Breathe, European Respiratory Society (Ers) vol. 20 (3)
01-10-2024 - Improving image de-raining using reference-guided transformers
Ye Z, Cho J and Oh C
IEEE International Conference on Image Processing 27 Oct 2024 - 30 Oct 2024.
27-09-2024 - ˜YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
Chang SK, Benetos E, KIRCHHOFF H and Dixon S
IEEE International Workshop on Machine Learning for Signal Processing (MLSP) London, UK 22 Sep 2024 - 25 Sep 2024.
22-09-2024 - Building Sketch-to-Sound Mapping with Unsupervised Feature Extraction and Interactive Machine Learning
Zheng S, Del Sette BM, Saitis C, Xambo Sedo A and Bryan-Kinns N
New Interfaces for Musical Expression Utrecht, The Netherlands 4 Sep 2024 - 6 Sep 2024.
04-09-2024 - LEVERAGING REAL ELECTRIC GUITAR TONES AND EFFECTS TO IMPROVE ROBUSTNESS IN GUITAR TABLATURE TRANSCRIPTION MODELING
Roman Guzman I
27th International Conference on Digital Audio Effects (DAFx24).
03-09-2024 - Differentiable All-pole Filters for Time-varying Audio Systems
Yu C-Y, Mitcheltree C, Carson A, Bilbao S, Reiss J and Fazekas G
International Conference on Digital Audio Effects 2024 Guildford, Surrey, UK 3 Sep 2024 - 7 Sep 2024.
03-09-2024 - Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis
Yu C-Y and Fazekas G
INTERSPEECH 2024 Kos Island, Greece 1 Sep 2024 - 5 Sep 2024.
01-09-2024 - A Multi‐Criteria Decision Support Tool for Shared Decision Making in Clinical Consultation
Şakar CT, Keith‐Jopp C, Yet B, Joyner C, Hill A, Roberts J, Marsh W and Morrissey D
Journal of Multi-Criteria Decision Analysis, Wiley vol. 31 (5-6)
01-09-2024 - Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model
Huang J and Benetos E
32nd European Signal Processing Conference (EUSIPCO) Lyon, France 26 Aug 2024 - 30 Aug 2024., 146-150.
26-08-2024 - ChatMusician: Understanding and Generating Music Intrinsically with LLM
Yuan R, Lin H, Wang Y, Tian Z, Wu S, Shen T, Zhang G, Wu Y, Liu C, Zhou Z, Ma Z, Xue L, Wang Z, Liu Q, Zheng T, Li Y, Ma Y, Liang Y, Chi X, Liu R, et al.
62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) Bangkok, Thailand 11 Aug 2024 - 16 Aug 2024.
11-08-2024 - MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
Zhang Y, Ikemiya Y, Xia G, Murata N, Martinez M, Liao W, Mitsufuji Y and Dixon S
International Joint Conference on Artificial Intelligence Jeju, Koera 3 Aug 2024 - 8 Aug 2024.
03-08-2024 - What characteristics of clinical decision support system implementations lead to adoption for regular use? A scoping review
Hill A, Morrissey D and Marsh W
Bmj Health & Care Informatics, Bmj vol. 31 (1)
01-08-2024 - One-Shot Neural Face Reenactment via Finding Directions in GAN's Latent Space.
Bounareli S, Tzelepis C, Argyriou V, Patras I and Tzimiropoulos G
Int. J. Comput. Vis. vol. 132, 3324-3354.
01-08-2024 - Evaluation of a Musculoskeletal Digital Assessment Routing Tool (DART): Crossover Noninferiority Randomized Pilot Trial
Lowe C, Sephton R, Marsh W and Morrissey D
Jmir Formative Research, Jmir Publications vol. 8
30-07-2024 - DExter: Learning and Controlling Performance Expression with Diffusion Models
Zhang H, Chowdhury S, Cancino-Chacón CE, Liang J, Dixon S and Widmer G
Applied Sciences, Mdpi vol. 14 (15), 1-17.
26-07-2024 - Automatic Detection of Moral Values in Music Lyrics
Preniqi V, Ghinassi I, Ive J, Kalimeri K and Saitis C
25th International Society for Music Information Retrieval Conference 10 Nov 2024 - 14 Nov 2024.
26-07-2024 - Musician-AI partnership mediated by emotionally-aware smart musical instruments
Turchet L, Stefani D and Pauwels J
International Journal of Human-Computer Studies, Elsevier vol. 191, 103340-103340.
23-07-2024 - Bayesian networks may allow better performance and usability than logistic regression
Wohlgemut JM, Pisirir E, Stoner RS, Kyrimi E, Yet B, Marsh W, Perkins ZB and Tai NRM
Critical Care, Springer Nature vol. 28 (1)
11-07-2024 - TEMPORAL ANALYSIS OF EMOTION PERCEPTION IN FILM MUSIC: INSIGHTS FROM THE FME-24 DATASET
Crocker R and Fazekas G
Sound and Music Computing 2024 ESMAE, Porto, Portugal 4 Jul 2024 - 6 Jul 2024.
06-07-2024 - Simulating Piano Performance Mistakes for Music Learning
Morsi A, Zhang H, Maezawa A, Dixon S and Serra X
Sound and Music Computing Conference 4 Jul 2024 - 6 Jul 2024.
06-07-2024 - Reconstructing the Charlie Parker Omnibook using an audio-to-score automatic transcription pipeline
Riley J and Dixon S
Sound and Music Computing Conference Porto 4 Jul 2024 - 6 Jul 2024.
04-07-2024 - Can Machine Learning Assist in Diagnosis of Primary Immune Thrombocytopenia? A Feasibility Study
Miah H, Kollias D, Pedone GL, Provan D and Chen F
Diagnostics, Mdpi vol. 14 (13)
26-06-2024 - The 6th Affective Behavior Analysis in-the-wild (ABAW) Competition
Kollias D, Tzirakis P, Cowen A, Zafeiriou S, Kotsia I, Baird A, Gagne C, Shao C and Hu G
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). vol. 00, 4587-4598.
18-06-2024 - Explaining models relating objects and privacy
Xompero A, Bontonou M, Arbona J-M, Benetos E and Cavallaro A
3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024 Seattle Convention Center, Seattle WA, USA 18 Jun 2024.
18-06-2024 - CUE-Net: Violence Detection Video Analytics with Spatial Cropping, Enhanced UniformerV2 and Modified Efficient Additive Attention
Senadeera DC, Yang X, Kollias D and Slabaugh G
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). vol. 00, 4888-4897.
18-06-2024 - Open-vocabulary object 6D pose estimation
Corsetti J, Boscaini D, Oh C, Cavallaro A and Poiesi F
IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024.
17-06-2024 - Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
Kim J, Oh C, Do H, Kim S and Sohn K
IEEE/CVF International Conference on Computer Vision and Pattern Recognition 2024.
17-06-2024 - Automatic Generation of Expressive Piano Miniatures
Colton S, Bradshaw L, Banar B and Bhandari K
International Conference on Computational Creativity (ICCC) Sweden 17 Jun 2024 - 21 Jun 2024.
17-06-2024 - MusiLingo: bridging music and text with pre-trained language models for music captioning and query response
Deng Z, Ma Y, Liu Y, Guo R, Zhang G, Chen W, Huang W and Benetos E
2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024) Mexico City, Mexico 16 Jun 2024 - 21 Jun 2024., 3643-3655.
16-06-2024 - Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming
Yu C-Y, Pauwels J and Fazekas G
Audio Engineering Society 156th Convention Madrid, Spain 15 Jun 2024 - 17 Jun 2024.
15-06-2024 - Ensuring UAV Safety: A Vision-Only and Real-Time Framework for Collision Avoidance Through Object Detection, Tracking, and Distance Estimation
Karampinis V, Arsenos A, Filippopoulos O, Petrongonas E, Skliros C, Kollias D, Kollias S and Voulodimos A
2024 International Conference on Unmanned Aircraft Systems (ICUAS). vol. 00, 1072-1079.
07-06-2024 - Common Corruptions for Enhancing and Evaluating Robustness in Air-to-Air Visual Object Detection
Arsenos A, Karampinis V, Petrongonas E, Skliros C, Kollias D, Kollias S and Voulodimos A
Ieee Robotics and Automation Letters, Institute of Electrical and Electronics Engineers
03-06-2024 - The effect of risk communication on consumers’ risk perception, risk tolerance and utility of smart and non-smart home appliances
Hunte JL, Neil M, Fenton NE, Osman M and Bechlivanidis C
Safety Science, Elsevier vol. 174
01-06-2024 - CSP2023: 28 Digital Health Technology - Narrowing or Widening the Digital Divide? Learning From Validation of a Musculoskeletal Digital Assessment Tool (DART)
Lowe C, Browne M, Marsh W and Morrissey D
Physiotherapy, Elsevier vol. 123, e86-e87.
01-06-2024 - A Self-Attention Deep Neural Network Regressor for real time blood glucose estimation in paediatric population using physiological signals
Haleem MS, Cisuelo O, Andellini M, Castaldo R, Angelini M, Ritrovato M, Schiaffini R, Franzese M and Pecchia L
Biomedical Signal Processing and Control, Elsevier vol. 92
01-06-2024 - Bridging the Gap: Protocol Towards Fair and Consistent Affect Analysis
Hu G, Papadopoulou E, Kollias D, Tzouveli P, Wei J and Yang X
2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG). vol. 00, 1-9.
31-05-2024 - Covid-19 Computer-Aided Diagnosis through AI-Assisted CT Imaging Analysis: Deploying a Medical AI System
Gerogiannis D, Arsenos A, Kollias D, Nikitopoulos D and Kollias S
2024 IEEE International Symposium on Biomedical Imaging (ISBI). vol. 00, 1-4.
30-05-2024 - Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer
Zhao Z, Cao Y, Gong S and Patras I
29-05-2024 - COVID‐19 Detection from Computed Tomography Images Using Slice Processing Techniques and a Modified Xception Classifier
Morani K, Ayana EK, Kollias D and Unay D
International Journal of Biomedical Imaging, Hindawi vol. 2024 (1)
24-05-2024 - PyTAG: Tabletop Games for Multi-Agent Reinforcement Learning
Balla M, Long G, Goodman J, Gaina R and Perez-Liebana D
Ieee Transactions On Games, Institute of Electrical and Electronics Engineers, 1-10.
22-05-2024 - WavCraft: audio editing and generation with large language models
Liang J, Zhang H, Liu H, Cao Y, Kong Q, Liu X, Wang W, Plumbley MD, Phan H and Benetos E
ICLR 2024 Workshop on LLM Agents Vienna, Austria 11 May 2024.
11-05-2024 - Thinking with Sound: Exploring the Experience of Listening to an Ultrasonic Art Installation
Robson N, McPherson A and Bryan-Kinns N
Proceedings of the CHI Conference on Human Factors in Computing Systems., 1-14.
11-05-2024 - Entangling Entanglement: A Diffractive Dialogue on HCI and Musical Interactions
Morrison L and McPherson A
Proceedings of the CHI Conference on Human Factors in Computing Systems., 1-17.
11-05-2024 - Introducing the TISMIR Education Track: What, Why, How?
Müller M, Dixon S, Volk A, Sturm BLT, Rao P and Gotham M
Transactions of The International Society For Music Information Retrieval, Ubiquity Press vol. 7 (1), 85-98.
09-05-2024 - MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
Li Y, Yuan R, Zhang G, Ma Y, Chen X, Yin H, Xiao C, Lin C, Ragni A, Benetos E, Gyenge N, Dannenberg R, Liu R, Chen W, Xia G, Shi Y, Huang W, Wang Z, Guo Y and Fu J
International Conference on Learning Representations (ICLR) Vienna, Austria 7 May 2024 - 11 May 2024.
07-05-2024 - Parameter Reduction of Kernel-Based Video Frame Interpolation Methods Using Multiple Encoders
Khalifeh I, Murn L and Izquierdo E
Ieee Journal On Emerging and Selected Topics in Circuits and Systems, Institute of Electrical and Electronics Engineers (Ieee) vol. 14 (2), 245-260.
30-04-2024 - Unsupervised Pitch-Timbre Disentanglement of Musical Instruments Using a Jacobian Disentangled Sequential Autoencoder
Luo Y-J, Ewert S and Dixon S
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1036-1040.
19-04-2024 - Uncertainty-Guided Contrastive Learning For Single Source Domain Generalisation
Arsenos A, Kollias D, Petrongonas E, Skliros C and Kollias S
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 6935-6939.
19-04-2024 - Syncfusion: Multimodal Onset-Synchronized Video-to-Audio Foley Synthesis
Comunità M, Gramaccioni RF, Postolache E, Rodolà E, Comminiello D and Reiss JD
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 936-940.
19-04-2024 - SSFE-M: A Self-Supervised Feature Extraction Model for Enhanced Camera Calibration
Zhang N and Izquierdo E
Ieee Signal Processing Letters, Institute of Electrical and Electronics Engineers (Ieee) vol. 31, 1179-1183.
16-04-2024 - Posterior Variance-Parameterised Gaussian Dropout: Improving Disentangled Sequential Autoencoders for Zero-Shot Voice Conversion
Luo Y-J and Dixon S
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 14 Apr 2024 - 19 Apr 2024., 11676-11680.
14-04-2024 - MERTech: instrument playing technique detection using self-supervised pretrained model with multi-task finetuning
Li D, Ma Y, Wei W, KONG Q, Wu Y, Che M, Xia F, Benetos E and Li W
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 521-525.
14-04-2024 - Learning from taxonomy: multi-label few-shot classification for everyday sound recognition
Liang J, Phan QH and Benetos E
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 771-775.
14-04-2024 - High Resolution Guitar Transcription via Domain Adaptation
Riley JX, EDWARDS D and Dixon S
International Conference on Acoustics, Speech and Signal Processing Seoul, South Korea 14 Apr 2024 - 19 Apr 2024.
14-04-2024 - Generalized multi-source inference for text conditioned music diffusion models
Postolache E, Mariani G, Cosmo L, Benetos E and Rodola E
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 6980-6984.
14-04-2024 - Faster Person Re-Identification: One-Shot-Filter and Coarse-to-Fine Search
Wang G, Huang X, Gong S, Zhang J and Gao W
Ieee Transactions On Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers (Ieee) vol. 46 (5), 3013-3030.
03-04-2024 - Learning by Erasing: Conditional Entropy Based Transferable Out-of-Distribution Detection
Xing M, Feng Z, Su Y and Oh C
AAAI Conference on Artificial Intelligence 2024.
24-03-2024 - Distribution Matching for Multi-Task Learning of Classification Tasks: A Large-Scale Study on Faces & Beyond
Kollias D, Sharmanska V and Zafeiriou S
Proceedings of the AAAI Conference on Artificial Intelligence. vol. 38 (3), 2813-2821.
24-03-2024 - HRTF Upsampling With a Generative Adversarial Network Using a Gnomonic Equiangular Projection
Hogg AOT, Jenkins M, Liu H, Squires I, Cooper SJ and Picinali L
Ieee/Acm Transactions On Audio Speech and Language Processing, Institute of Electrical and Electronics Engineers (Ieee) vol. 32, 2085-2099.
11-03-2024 - Document structure-driven investigative information retrieval
Ketola T and Roelleke T
Information Systems, Elsevier vol. 121, 102315-102315.
01-03-2024 - Auditory imagery ability influences accuracy when singing with altered auditory feedback
Reed CN, Pearce M and McPherson A
Musicae Scientiae, Sage Publications
15-02-2024 - Exploring User Perspectives on Brief Reflective Questioning Activities for Stress Management: Mixed Methods Study
Bhattacharjee A, Chen P, Mandal A, Hsu A, O'Leary K, Mariakakis A and Williams JJ
Jmir Formative Research, Jmir Publications vol. 8
08-02-2024 - A Data-Driven Analysis of Robust Automatic Piano Transcription
EDWARDS D, Dixon S, Benetos E, Maezawa A and Kusaka Y
Ieee Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 681-685.
08-02-2024 - Test-time adaptation for 6D pose tracking
Tian L, Oh C and Cavallaro A
Pattern Recognition, Elsevier Bv, 110390-110390.
01-02-2024 - A critical analysis of image-based camera pose estimation techniques
Xu M, Wang Y, Xu B, Zhang J, Ren J, Huang Z, Poslad S and Xu P
Neurocomputing, Elsevier vol. 570
01-02-2024 - YourMT3+: Multi-Instrument Music Transcription with Enhanced Transformer Architectures and Cross-Dataset STEM Augmentation
Chang S, Benetos E, Kirchhoff H and Dixon S
, Institute of Electrical and Electronics Engineers (Ieee) vol. 00, 1-6.
25-01-2024 - Composer Style-Specific Symbolic Music Generation using Vector Quantized Discrete Diffusion Models
Zhang J, Fazekas G and Saitis C
2024 IEEE 34th International Workshop on Machine Learning for Signal Processing (MLSP). vol. 00, 1-6.
25-01-2024 - The Role of Communication and Reference Songs in the Mixing Process: Insights from Professional Mix Engineers
Vanka S, Safi M, Rolland J-B and Fazekas G
Journal of The Audio Engineering Society, Audio Engineering Society
20-01-2024 - Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation
Hu G, Wei J, Song S, Kollias D, Yang X, Sun Z and Kaloidas O
2024 IEEE International Joint Conference on Biometrics (IJCB). vol. 00, 1-10.
18-01-2024 - Exploring the impact of transfer learning on GAN-based HRTF upsampling
Hogg A, Liu H, Jenkins M and Picinali L
Proceedings of the 10th Convention of the European Acoustics Association Forum Acusticum 2023., 2323-2328.
17-01-2024 - ATGNN: audio tagging graph neural network
Singh S, Steinmetz C, Benetos E, Phan QH and Stowell D
Ieee Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 825-829.
17-01-2024 - Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization
Zheng M, Gong S, Jin H, Peng Y and Liu Y
Annual Meeting of the Association for Computational Linguistics. vol. 1, 14197-14209.
12-01-2024 - A review of differentiable digital signal processing for music and speech synthesis
Hayes B, Shier J, Fazekas G, McPherson A and Saitis C
Frontiers in Signal Processing, Frontiers vol. 3
11-01-2024 - Spectrogram-based approach with convolutional neural network for human activity classification
Sassi M, Haleem MS and Pecchia L
Mediterranean Conference on Medical and Biological Engineering and Computing International Conference on Medical and Biological Engineering MEDICON 2023, CMBEBIH 2023: MEDICON’23 and CMBEBIH’23 14 Sep 2023 - 16 Sep 2023.
04-01-2024 - Spectrogram-Driven Convolutional Neural Network for Real-Time Non-invasive Hyperglycaemia Detection in Paediatric Type-1 Diabetes via Wearable Sensors
Cisuelo O, Haleem MS, Hattersley J and Pecchia L
MEDICON: Mediterranean Conference on Medical and Biological Engineering and Computing, CMBEBIH: International Conference on Medical and Biological Engineering 14 Sep 2023 - 16 Sep 2023.
04-01-2024 - Wavelet-based network for high dynamic range imaging
Dai T, Li W, Cao X, Liu J, Jia X, Leonardis A, Yan Y and Yuan S
Computer Vision and Image Understanding, Elsevier vol. 238
01-01-2024 - VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning.
Xenos A, Foteinopoulou NM, Ntinou I, Patras I and Tzimiropoulos G
Corr vol. abs/2404.07078
01-01-2024 - Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model
Huang J and Benetos E
European Signal Processing Conference, 146-150.
01-01-2024 - TOWARDS EFFICIENT MODELLING OF STRING DYNAMICS: A COMPARISON OF STATE SPACE AND KOOPMAN BASED DEEP LEARNING METHODS
Diaz R, De La Vega Martin C and Sandler M
Proceedings of the International Conference on Digital Audio Effects, DAFx., 200-207.
01-01-2024 - Self-Supervised Facial Representation Learning with Facial Region Awareness.
Gao Z and Patras I
CVPR., 2081-2092.
01-01-2024 - Self-Supervised Facial Representation Learning with Facial Region Awareness
Gao Z and Patras I
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition., 2081-2092.
01-01-2024 - Real-time Timbre Remapping with Differentiable DSP
Shier J, Saitis C, Robertson A and McPherson A
Proceedings of the International Conference on New Interfaces for Musical Expression.
01-01-2024 - Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization.
Oldfield J, Georgopoulos M, Chrysos GG, Tzelepis C, Panagakis Y, Nicolaou MA, Deng J and Patras I
Corr vol. abs/2402.12550
01-01-2024 - Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection
Liang J, Nolasco I, Ghani B, Phan H, Benetos E and Stowell D
European Signal Processing Conference., 1257-1261.
01-01-2024 - MOAB: Multi-Modal Outer Arithmetic Block For Fusion Of Histopathological Images And Genetic Data For Brain Tumor Grading.
Alwazzan O, Khan A, Patras I and Slabaugh GG
Corr vol. abs/2403.06349
01-01-2024 - MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance.
Meng D, Tzelepis C, Patras I and Tzimiropoulos G
Corr vol. abs/2409.11010
01-01-2024 - LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition.
Sun Z, Feng C, Patras I and Tzimiropoulos G
CVPR., 1639-1649.
01-01-2024 - LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition
Sun Z, Feng C, Patras I and Tzimiropoulos G
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition., 1639-1649.
01-01-2024 - Investigating Adversarial Policy Learning for Robust Agents in Automated Driving Highway Simulations
Pighetti A, Bellotti F, Oh C, Lazzaroni L, Forneris L, Fresta M and Berta R
Lecture Notes in Electrical Engineering. vol. 1110, 124-129.
01-01-2024 - Improving Fairness using Vision-Language Driven Image Augmentation.
D'Incà M, Tzelepis C, Patras I and Sebe N
WACV., 4683-4692.
01-01-2024 - Identification of major hemorrhage in trauma patients in the prehospital setting: diagnostic accuracy and impact on outcome
Wohlgemut JM, Pisirir E, Stoner RS, Kyrimi E, Christian M, Hurst T, Marsh W, Perkins ZB and Tai NRM
Trauma Surgery & Acute Care Open, Bmj vol. 9 (1)
01-01-2024 - Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization.
Zhang Z, Liu Z and Patras I
Corr vol. abs/2408.04983
01-01-2024 - Foundation Models for Music: A Survey.
Ma Y, Øland A, Ragni A, Sette BMD, Saitis C, Donahue C, Lin C, Plachouras C, Benetos E, Quinton E, Shatri E, Morreale F, Zhang G, Fazekas G, Xia G, Zhang H, Manco I, Huang J, Guinot J, Lin L, et al.
Corr vol. abs/2408.14340
01-01-2024 - FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion.
Singh AK and Patras I
Corr vol. abs/2404.18591
01-01-2024 - FOAA: Flattened Outer Arithmetic Attention for Multimodal Tumor Classification.
Alwazzan O, Patras I and Slabaugh GG
ISBI., 1-5.
01-01-2024 - Efficient Vision-Language pre-training via domain-specific learning for human activities
Bulat A, Ouali Y, Guerrero R, Martinez B and Tzimiropoulos G
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing., 7978-8000.
01-01-2024 - Discerning real from synthetic: analysis and perceptual evaluation of sound effects
Garcia N, Zong Y and Reiss J
2024 6th International Conference on Audio for Games., 87-94.
01-01-2024 - DIFFERENTIABLE ALL-POLE FILTERS FOR TIME-VARYING AUDIO SYSTEMS
Yu CY, Mitcheltree C, Carson A, Bilbao S, Reiss JD and Fazekas G
Proceedings of the International Conference on Digital Audio Effects, DAFx., 345-352.
01-01-2024 - CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition.
Sun Z, Song S, Patras I and Tzimiropoulos G
CoRR. vol. abs/2409.18876
01-01-2024 - CLIPCleaner: Cleaning Noisy Labels with CLIP.
Feng C, Tzimiropoulos G and Patras I
ACM Multimedia., 876-885. Editors: Cai J, Kankanhalli MS, Prabhakaran B, Boll S, Subramanian R, Zheng L, Singh VK, César P, Xie L and Xu D.
01-01-2024 - Are CLIP features all you need for Universal Synthetic Image Origin Attribution?
Cioni D, Tzelepis C, Seidenari L and Patras I
Corr vol. abs/2408.09153
01-01-2024 - A hybrid Bayesian network for medical device risk assessment and management
Hunte JL, Neil M and Fenton NE
Reliability Engineering & System Safety, Elsevier vol. 241
01-01-2024 - A Machine learning method to evaluate and improve sound effects synthesis model design
Zong Y, Garcia-Sihuay N and Reiss J
2024 6th International Conference on Audio for Games., 11-19.
01-01-2024