Publications

...

SpikePR: Position Regression with Deep Spiking Neural Network
Huang Z, Zeng Y, Poslad S and Gu F
Ieee Sensors Journal, Institute of Electrical and Electronics Engineers (Ieee) vol. PP (99), 1-1.

DOI 10.1109/jsen.2024.3520666

27-12-2024

DMRN+19: Digital Music Research Network One-day Workshop 2024
Dixon S, Guinot J and Yusuf F
DMRN+19: Digital Music Research Network One-day Workshop 2024 Arst Two - QMUL (Queen Mary University of London); London E1 4NS, UK. 17 Dec 2024. Editors: Bort A.
17-12-2024

Using GPT-4 to guide causal machine learning.
Constantinou AC, Kitson NK and Zanga A
Expert Systems With Applications, Elsevier

DOI 10.1016/j.eswa.2024.126120

QMRO

12-12-2024

Shifting Ambiguity, Collapsing Indeterminacy: Designing with Data as Baradian Apparatus
Reed CN, Benito AL, Caspe F and McPherson AP
Acm Transactions on Computer-Human Interaction, Association For Computing Machinery (Acm) vol. 31 (6), 1-41.

DOI 10.1145/3689043

QMRO

06-12-2024

Pitch-aware generative pretraining improves multi-pitch estimation with scarce data
Pilataki M, Mauch M and Dixon S
Proceedings of the 6th ACM International Conference on Multimedia in Asia., 1-8.

DOI 10.1145/3696409.3700202

03-12-2024

Classification of spontaneous and scripted speech for multilingual audio
Elisha S, McDowell A, Beguerisse-Díaz M and Benetos E
IEEE Spoken Language Technology Workshop 2024 Macao, China 2 Dec 2024 - 5 Dec 2024.

QMRO

02-12-2024

S 2 Reg: Structure-semantics collaborative point cloud registration
Xu Z, Gao X, Jiang X, Cheng S, Zhang Q and Li W
Pattern Recognition, Elsevier

DOI 10.1016/j.patcog.2024.111290

01-12-2024

Robotic Grasping and Manipulation Competition at the 2024 IEEE/RAS International Conference on Robotics and Automation [Competitions]
Sun Y, Calli B, Kimble K, wyffels F, De Gusseme V-L, Hang K, DAvella S, Xompero A, Cavallaro A, Roa MA, Avendano J and Mavrommati A
Ieee Robotics & Automation Magazine, Institute of Electrical and Electronics Engineers (Ieee) vol. 31 (4), 174-185.

DOI 10.1109/mra.2024.3481609

01-12-2024

Guest Editorial: Special Issue on Human Centered AI in Game Evaluation
Denisova A, Perez-Liebana D, Volz V, Frommel J and Asadi S
Ieee Transactions on Games, Institute of Electrical and Electronics Engineers (Ieee) vol. 16 (4), 742-745.

DOI 10.1109/tg.2024.3507232

01-12-2024

Evaluating impact of movement on diabetes via artificial intelligence and smart devices systematic literature review
Rotbei S, Tseng WH, Merino-Barbancho B, Haleem MS, Montesinos L, Pecchia L, Fico G and Botta A
Expert Systems With Applications, Elsevier vol. 257

DOI 10.1016/j.eswa.2024.125058

QMRO

01-12-2024

Development of a User-Friendly Pipeline for Constructing Atrial Models at Scale: Importance of the End-User for Clinical Uptake
Bevis L, Misghina S, Rauseo E, Lopez Barrera C, Plank G, Vigmond E, Loewe A, Karabelas E, Solis-Lemus JA, Niederer S, Petersen S, Slabaugh G, Mathur A and Roney C
Computing in Cardiology 2024 (CinC24) Karlsruhe, Germany 11 Sep 2024 - 9 Dec 2024. vol. 51

DOI 10.22489/CinC.2024.014

01-12-2024

Clinical features, myocardial injury and systolic impairment in acute myocarditis.
Shyam-Sundar V, Slabaugh G, Mohiddin SA, Petersen SE and Aung N
Open Heart, Bmj vol. 11 (2), e002901-e002901.

DOI 10.1136/openhrt-2024-002901

QMRO

01-12-2024

Incremental Object 6D Pose Estimation
Tian L, Sorrenti A, Pang YL, Bellitto G, Palazzo S, Spampinato C and Oh C
International Conference on Pattern Recognition (ICPR) 1 Dec 2024.

DOI 10.1007/978-3-031-78395-1_22

QMRO

29-11-2024

Incremental Object 6D Pose Estimation
Tian L, Sorrenti A, Pang YL, Bellitto G, Palazzo S, Spampinato C and Oh C
International Conference on Pattern Recognition.

QMRO

29-11-2024

RoGUENeRF: A Robust Geometry-Consistent Universal Enhancer for NeRF
Catley-Chandar S, Shaw R, Slabaugh G and Pérez-Pellitero E
European Conference on Computer Vision (2024). vol. 15070, 54-71.

DOI 10.1007/978-3-031-73254-6_4

28-11-2024

Efficient Unsupervised Visual Representation Learning with Explicit Cluster Balancing
Maniadis Metaxas I, Tzimiropoulos G and Patras I
Lecture Notes in Computer Science, Springer Nature vol. 15090, 436-454.

DOI 10.1007/978-3-031-73411-3_25

23-11-2024

Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation
Shatri E and Fazekas G
International Conference on Knowledge Discovery and Information Retrieval Porto, Portugal 17 Nov 2024 - 19 Nov 2024.

DOI 10.5220/0012947500003838

QMRO

19-11-2024

Developing DIY Solar-Powered, Off-Grid Audio Streamers for Forest Soundscapes: Progress and Challenges
Marino L and Xambo Sedo A
CHIME Annual One-day Music and HCI Conference 2024 The Open University, in Milton Keynes, UK 2 Dec 2024.
18-11-2024

‘Journeys in the Dark’ - Towards Game Master AI in Complex Board Games
Best T, Lucas S and Gaina R
Artificial Intelligence and Interactive Digital Entertainment.

DOI 10.1609/aiide.v20i1.31861

QMRO

15-11-2024

Presenting predictions and performance of probabilistic models for clinical decision support in trauma care
Alptekin C, Wohlgemut JM, Perkins ZB, Marsh W, Tai NRM and Yet B
International Journal of Medical Informatics, Elsevier vol. 194

DOI 10.1016/j.ijmedinf.2024.105702

14-11-2024

Diff-MSTC: A Mixing Style Transfer Prototype for Cubase
Vanka S, Hannink L, Rolland J-B and Fazekas G
International Society for Music Information Retrieval San Francisco 10 Nov 2024 - 15 Nov 2024.
11-11-2024

ST-ITO: Controlling audio effects for style transfer with inference-time optimization
Steinmetz C, Singh S, Comunit� M, Ibnyahya I, Yuan S, Benetos E and Reiss J
25th International Society for Music Information Retrieval Conference (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.

DOI 10.48550/arxiv.2410.21233

QMRO

10-11-2024

Proceedings of the 25th International Society for Music Information Retrieval Conference
Guinot J, Fazekas G and Quinton E
The 25th International Society for Music Information Retrieval Conference San Francisco, USA 9 Nov 2024 - 15 Nov 2024.
10-11-2024

MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models
Weck B, Manco I, Benetos E, QUINTON E, Fazekas G and Bogdanov D
25th International Society for Music Information Retrieval Conference (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.

DOI 10.48550/arxiv.2408.01337

QMRO

10-11-2024

I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition
Vasilakis I, Bittner R and Pauwels J
25th International Society for Music Information Retrieval (ISMIR) San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.

QMRO

10-11-2024

Diff-MST: Differentiable Mixing Style Transfer
Vanka S, Steinmetz C, Rolland J-B, Reiss J and Fazekas G
International Society of Music Information Retrieval San Francisco 10 Nov 2024 - 14 Nov 2024.

QMRO

10-11-2024

ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Deng Q, Yang Q, Yuan R, Huang Y, Wang Y, Liu X, Tian Z, Pan J, Zhang G, Lin H, Li Y, Ma Y, Fu J, Lin C, Benetos E, Wang W, Xia G, Xue W and Guo Y
25th International Society for Music Information Retrieval Conference (ISMIR), San Francisco, CA, USA 10 Nov 2024 - 14 Nov 2024.

DOI 10.48550/arxiv.2404.18081

QMRO

10-11-2024

Can LLMs Reason in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Zhou Z, Wu Y, Wu Z, Zhang X, Yuan R, Ma Y, Wang L, Benetos E, Xue W and Guo Y
25th International Society for Music Information Retrieval Conference (ISMIR) San Franscisco, CA, USA 10 Nov 2024 - 14 Nov 2024.

DOI 10.48550/arxiv.2407.21531

QMRO

10-11-2024

A Secure Image Watermarking Framework with Statistical Guarantees via Adversarial Attacks on Secret Key Networks
Chen F, Lin W, Liu Z and Chan AB
Lecture Notes in Computer Science. vol. 15098, 428-445.

DOI 10.1007/978-3-031-73661-2_24

QMRO

10-11-2024

Introduction to the Special Issue on Realistic Synthetic Data: Generation, Learning, Evaluation
Ionescu B, Patras I, Müller H and Del Bimbo A
Acm Transactions on Multimedia Computing Communications and Applications, Association For Computing Machinery (Acm)

DOI 10.1145/3703593

09-11-2024

Actually I Can Count My Blessings: User-Centered Design of an Application to Promote Gratitude Among Young Adults
Bhattacharjee A, Gong Z, Wang B, Luckcock TJ, Watson E, Abellan EA, Gutman L, Hsu A and Williams JJ
Proceedings of the ACM on Human-Computer Interaction. vol. 8 (CSCW2), 1-29.

DOI 10.1145/3686936

QMRO

07-11-2024

Editorial: Variable autonomy for human-robot teaming
Theodorou A, Chiou M, Lacerda B and Rothfuß S
Frontiers in Robotics and AI, Frontiers vol. 11

DOI 10.3389/frobt.2024.1465183

QMRO

06-11-2024

A multimodal understanding of the role of sound and music in gendered toy marketing
Marinelli L, Lucht P and Saitis C
Plos One, Public Library of Science (Plos) vol. 19 (11)

DOI 10.1371/journal.pone.0311876

QMRO

06-11-2024

PhenoGemini: Enhancing Molecular Diagnoses of Mendelian Disorders through Identifying Twin Patients with Large Language Models
Chen Z, Cai J, Liu P, Yang Y, Zhao S, Li G, Xu K, Niu Y, Hospedales T, Qiu G, Wu Z, Zhang TJ and Wu N
ASHG Annual Meeting Denver 5 Nov 2024 - 9 Nov 2024.
05-11-2024

A scoping review, novel taxonomy and catalogue of implementation frameworks for clinical decision support systems
Wohlgemut JM, Pisirir E, Stoner RS, Perkins ZB, Marsh W, Tai NRM and Kyrimi E
Bmc Medical Informatics and Decision Making, Springer Nature vol. 24 (1)

DOI 10.1186/s12911-024-02739-1

01-11-2024

The impact of multimorbidity on cardiac remodelling in the UK Biobank
Shyam-Sundar V, Nicholls H, Chadalavada S, Vargas J, Slabaugh G, Mohiddin S, Petersen S and Aung N
ESC 2024. vol. 45 (Supplement_1), ehae666.248-ehae666.248.

DOI 10.1093/eurheartj/ehae666.248

QMRO

28-10-2024

MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing
Ghosh S, Cai Z, Dhall A, Kollias D, Goecke R and Gedeon T
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing., 1-6.

DOI 10.1145/3689092.3690042

28-10-2024

MRAC '24 Chairs' Welcome
Tao J, Ghosh S, Lian Z, Cai Z, Schuller BW, Dhall A, Zhao G, Kollias D, Cambria E, Goecke R and Gedeon T
Mrac 2024 - Proceedings of The 2nd International Workshop on Multimodal and Responsible Affective Computing
28-10-2024

Diagnostic and prognostic value of ECG-predicted hypertension mediated left ventricular hypertrophy using machine learning
Naderi H, Ramirez J, Van Duijvenboden S, Pujadas ER, Aung N, Wang L, Chamling B, Dorr M, Markus MRP, Lekadir K, Petersen SE and Munroe PB
European Heart Journal. vol. 45 (Supplement_1)

DOI 10.1093/eurheartj/ehae666.2524

QMRO

28-10-2024

CLIPCleaner: Cleaning Noisy Labels with CLIP
Feng C, Tzimiropoulos G and Patras I
Proceedings of the 32nd ACM International Conference on Multimedia., 876-885.

DOI 10.1145/3664647.3680664

QMRO

28-10-2024

1M-Deepfakes Detection Challenge
Cai Z, Dhall A, Ghosh S, Hayat M, Kollias D, Stefanov K and Tariq U
Proceedings of the 32nd ACM International Conference on Multimedia., 11355-11359.

DOI 10.1145/3664647.3689145

28-10-2024

Multi-Signal Informed Attention for Beat and Downbeat Detection
Bolt J, Pauwels J and Fazekas G
2024 IEEE 5th International Symposium on the Internet of Sounds (IS2). vol. 00, 1-7.

DOI 10.1109/is262782.2024.10704128

QMRO

02-10-2024

Artificial intelligence in respiratory care: perspectives on critical opportunities and challenges
Drummond D, Adejumo I, Hansen K, Poberezhets V, Slabaugh G and Hui CY
Breathe, European Respiratory Society (Ers) vol. 20 (3)

DOI 10.1183/20734735.0189-2023

01-10-2024

Improving image de-raining using reference-guided transformers
Ye Z, Cho J and Oh C
IEEE International Conference on Image Processing 27 Oct 2024 - 30 Oct 2024.

DOI 10.1109/icip51287.2024.10648113

QMRO

27-09-2024

˜YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
Chang SK, Benetos E, KIRCHHOFF H and Dixon S
IEEE International Workshop on Machine Learning for Signal Processing (MLSP) London, UK 22 Sep 2024 - 25 Sep 2024.

DOI 10.1109/MLSP58920.2024.10734819

QMRO

22-09-2024

Building Sketch-to-Sound Mapping with Unsupervised Feature Extraction and Interactive Machine Learning
Zheng S, Del Sette BM, Saitis C, Xambo Sedo A and Bryan-Kinns N
New Interfaces for Musical Expression Utrecht, The Netherlands 4 Sep 2024 - 6 Sep 2024.

DOI 10.5281/zenodo.13904959

QMRO

04-09-2024

LEVERAGING REAL ELECTRIC GUITAR TONES AND EFFECTS TO IMPROVE ROBUSTNESS IN GUITAR TABLATURE TRANSCRIPTION MODELING
Roman Guzman I
27th International Conference on Digital Audio Effects (DAFx24).
03-09-2024

Differentiable All-pole Filters for Time-varying Audio Systems
Yu C-Y, Mitcheltree C, Carson A, Bilbao S, Reiss J and Fazekas G
International Conference on Digital Audio Effects 2024 Guildford, Surrey, UK 3 Sep 2024 - 7 Sep 2024.

DOI 10.48550/arXiv.2404.07970

QMRO

03-09-2024

Differentiable Time-Varying Linear Prediction in the Context of End-to-End Analysis-by-Synthesis
Yu C-Y and Fazekas G
INTERSPEECH 2024 Kos Island, Greece 1 Sep 2024 - 5 Sep 2024.

DOI 10.21437/Interspeech.2024-1187

QMRO

01-09-2024

A Multi‐Criteria Decision Support Tool for Shared Decision Making in Clinical Consultation
Şakar CT, Keith‐Jopp C, Yet B, Joyner C, Hill A, Roberts J, Marsh W and Morrissey D
Journal of Multi-Criteria Decision Analysis, Wiley vol. 31 (5-6)

DOI 10.1002/mcda.70001

01-09-2024

Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model
Huang J and Benetos E
32nd European Signal Processing Conference (EUSIPCO) Lyon, France 26 Aug 2024 - 30 Aug 2024., 146-150.

DOI 10.23919/eusipco63174.2024.10715045

QMRO

26-08-2024

ChatMusician: Understanding and Generating Music Intrinsically with LLM
Yuan R, Lin H, Wang Y, Tian Z, Wu S, Shen T, Zhang G, Wu Y, Liu C, Zhou Z, Ma Z, Xue L, Wang Z, Liu Q, Zheng T, Li Y, Ma Y, Liang Y, Chi X, Liu R, et al.
62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024) Bangkok, Thailand 11 Aug 2024 - 16 Aug 2024.

DOI 10.18653/v1/2024.findings-acl.373

QMRO

11-08-2024

MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
Zhang Y, Ikemiya Y, Xia G, Murata N, Martinez M, Liao W, Mitsufuji Y and Dixon S
International Joint Conference on Artificial Intelligence Jeju, Koera 3 Aug 2024 - 8 Aug 2024.

DOI 10.24963/ijcai.2024/864

QMRO

03-08-2024

What characteristics of clinical decision support system implementations lead to adoption for regular use? A scoping review
Hill A, Morrissey D and Marsh W
Bmj Health & Care Informatics, Bmj vol. 31 (1)

DOI 10.1136/bmjhci-2024-101046

QMRO

01-08-2024

One-Shot Neural Face Reenactment via Finding Directions in GAN's Latent Space.
Bounareli S, Tzelepis C, Argyriou V, Patras I and Tzimiropoulos G
Int. J. Comput. Vis. vol. 132, 3324-3354.
01-08-2024

Evaluation of a Musculoskeletal Digital Assessment Routing Tool (DART): Crossover Noninferiority Randomized Pilot Trial
Lowe C, Sephton R, Marsh W and Morrissey D
Jmir Formative Research, Jmir Publications vol. 8

DOI 10.2196/56715

QMRO

30-07-2024

DExter: Learning and Controlling Performance Expression with Diffusion Models
Zhang H, Chowdhury S, Cancino-Chacón CE, Liang J, Dixon S and Widmer G
Applied Sciences, Mdpi vol. 14 (15), 1-17.

DOI 10.3390/app14156543

QMRO

26-07-2024

Automatic Detection of Moral Values in Music Lyrics
Preniqi V, Ghinassi I, Ive J, Kalimeri K and Saitis C
25th International Society for Music Information Retrieval Conference 10 Nov 2024 - 14 Nov 2024.

DOI 10.48550/arxiv.2407.18787

QMRO

26-07-2024

Musician-AI partnership mediated by emotionally-aware smart musical instruments
Turchet L, Stefani D and Pauwels J
International Journal of Human-Computer Studies, Elsevier vol. 191, 103340-103340.

DOI 10.1016/j.ijhcs.2024.103340

QMRO

23-07-2024

Bayesian networks may allow better performance and usability than logistic regression
Wohlgemut JM, Pisirir E, Stoner RS, Kyrimi E, Yet B, Marsh W, Perkins ZB and Tai NRM
Critical Care, Springer Nature vol. 28 (1)

DOI 10.1186/s13054-024-05015-w

QMRO

11-07-2024

TEMPORAL ANALYSIS OF EMOTION PERCEPTION IN FILM MUSIC: INSIGHTS FROM THE FME-24 DATASET
Crocker R and Fazekas G
Sound and Music Computing 2024 ESMAE, Porto, Portugal 4 Jul 2024 - 6 Jul 2024.
06-07-2024

Simulating Piano Performance Mistakes for Music Learning
Morsi A, Zhang H, Maezawa A, Dixon S and Serra X
Sound and Music Computing Conference 4 Jul 2024 - 6 Jul 2024.

QMRO

06-07-2024

Reconstructing the Charlie Parker Omnibook using an audio-to-score automatic transcription pipeline
Riley J and Dixon S
Sound and Music Computing Conference Porto 4 Jul 2024 - 6 Jul 2024.

QMRO

04-07-2024

Can Machine Learning Assist in Diagnosis of Primary Immune Thrombocytopenia? A Feasibility Study
Miah H, Kollias D, Pedone GL, Provan D and Chen F
Diagnostics, Mdpi vol. 14 (13)

DOI 10.3390/diagnostics14131352

26-06-2024

The 6th Affective Behavior Analysis in-the-wild (ABAW) Competition
Kollias D, Tzirakis P, Cowen A, Zafeiriou S, Kotsia I, Baird A, Gagne C, Shao C and Hu G
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). vol. 00, 4587-4598.

DOI 10.1109/cvprw63382.2024.00461

18-06-2024

Explaining models relating objects and privacy
Xompero A, Bontonou M, Arbona J-M, Benetos E and Cavallaro A
3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024 Seattle Convention Center, Seattle WA, USA 18 Jun 2024.

DOI 10.48550/arXiv.2405.01646

QMRO

18-06-2024

CUE-Net: Violence Detection Video Analytics with Spatial Cropping, Enhanced UniformerV2 and Modified Efficient Additive Attention
Senadeera DC, Yang X, Kollias D and Slabaugh G
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). vol. 00, 4888-4897.

DOI 10.1109/cvprw63382.2024.00493

QMRO

18-06-2024

Open-vocabulary object 6D pose estimation
Corsetti J, Boscaini D, Oh C, Cavallaro A and Poiesi F
IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024.

QMRO

17-06-2024

Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
Kim J, Oh C, Do H, Kim S and Sohn K
IEEE/CVF International Conference on Computer Vision and Pattern Recognition 2024.

QMRO

17-06-2024

Automatic Generation of Expressive Piano Miniatures
Colton S, Bradshaw L, Banar B and Bhandari K
International Conference on Computational Creativity (ICCC) Sweden 17 Jun 2024 - 21 Jun 2024.

QMRO

17-06-2024

MusiLingo: bridging music and text with pre-trained language models for music captioning and query response
Deng Z, Ma Y, Liu Y, Guo R, Zhang G, Chen W, Huang W and Benetos E
2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024) Mexico City, Mexico 16 Jun 2024 - 21 Jun 2024., 3643-3655.

DOI 10.18653/v1/2024.findings-naacl.231

QMRO

16-06-2024

Time-of-arrival Estimation and Phase Unwrapping of Head-related Transfer Functions With Integer Linear Programming
Yu C-Y, Pauwels J and Fazekas G
Audio Engineering Society 156th Convention Madrid, Spain 15 Jun 2024 - 17 Jun 2024.

QMRO

15-06-2024

Ensuring UAV Safety: A Vision-Only and Real-Time Framework for Collision Avoidance Through Object Detection, Tracking, and Distance Estimation
Karampinis V, Arsenos A, Filippopoulos O, Petrongonas E, Skliros C, Kollias D, Kollias S and Voulodimos A
2024 International Conference on Unmanned Aircraft Systems (ICUAS). vol. 00, 1072-1079.

DOI 10.1109/icuas60882.2024.10556937

07-06-2024

Common Corruptions for Enhancing and Evaluating Robustness in Air-to-Air Visual Object Detection
Arsenos A, Karampinis V, Petrongonas E, Skliros C, Kollias D, Kollias S and Voulodimos A
Ieee Robotics and Automation Letters, Institute of Electrical and Electronics Engineers

DOI 10.1109/LRA.2024.3408485

QMRO

03-06-2024

The effect of risk communication on consumers’ risk perception, risk tolerance and utility of smart and non-smart home appliances
Hunte JL, Neil M, Fenton NE, Osman M and Bechlivanidis C
Safety Science, Elsevier vol. 174

DOI 10.1016/j.ssci.2024.106464

01-06-2024

CSP2023: 28 Digital Health Technology - Narrowing or Widening the Digital Divide? Learning From Validation of a Musculoskeletal Digital Assessment Tool (DART)
Lowe C, Browne M, Marsh W and Morrissey D
Physiotherapy, Elsevier vol. 123, e86-e87.

DOI 10.1016/j.physio.2024.04.105

QMRO

01-06-2024

A Self-Attention Deep Neural Network Regressor for real time blood glucose estimation in paediatric population using physiological signals
Haleem MS, Cisuelo O, Andellini M, Castaldo R, Angelini M, Ritrovato M, Schiaffini R, Franzese M and Pecchia L
Biomedical Signal Processing and Control, Elsevier vol. 92

DOI 10.1016/j.bspc.2024.106065

QMRO

01-06-2024

Bridging the Gap: Protocol Towards Fair and Consistent Affect Analysis
Hu G, Papadopoulou E, Kollias D, Tzouveli P, Wei J and Yang X
2024 IEEE 18th International Conference on Automatic Face and Gesture Recognition (FG). vol. 00, 1-9.

DOI 10.1109/fg59268.2024.10582033

31-05-2024

Covid-19 Computer-Aided Diagnosis through AI-Assisted CT Imaging Analysis: Deploying a Medical AI System
Gerogiannis D, Arsenos A, Kollias D, Nikitopoulos D and Kollias S
2024 IEEE International Symposium on Biomedical Imaging (ISBI). vol. 00, 1-4.

DOI 10.1109/isbi56570.2024.10635484

30-05-2024

Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer
Zhao Z, Cao Y, Gong S and Patras I

DOI 10.48550/arxiv.2405.19100

QMRO

29-05-2024

COVID‐19 Detection from Computed Tomography Images Using Slice Processing Techniques and a Modified Xception Classifier
Morani K, Ayana EK, Kollias D and Unay D
International Journal of Biomedical Imaging, Hindawi vol. 2024 (1)

DOI 10.1155/2024/9962839

QMRO

24-05-2024

PyTAG: Tabletop Games for Multi-Agent Reinforcement Learning
Balla M, Long G, Goodman J, Gaina R and Perez-Liebana D
Ieee Transactions on Games, Institute of Electrical and Electronics Engineers, 1-10.

DOI 10.1109/TG.2024.3404133

QMRO

22-05-2024

WavCraft: audio editing and generation with large language models
Liang J, Zhang H, Liu H, Cao Y, Kong Q, Liu X, Wang W, Plumbley MD, Phan H and Benetos E
ICLR 2024 Workshop on LLM Agents Vienna, Austria 11 May 2024.

DOI 10.48550/arxiv.2403.09527

QMRO

11-05-2024

Thinking with Sound: Exploring the Experience of Listening to an Ultrasonic Art Installation
Robson N, McPherson A and Bryan-Kinns N
Proceedings of the CHI Conference on Human Factors in Computing Systems., 1-14.

DOI 10.1145/3613904.3642616

11-05-2024

Entangling Entanglement: A Diffractive Dialogue on HCI and Musical Interactions
Morrison L and McPherson A
Proceedings of the CHI Conference on Human Factors in Computing Systems., 1-17.

DOI 10.1145/3613904.3642171

11-05-2024

Introducing the TISMIR Education Track: What, Why, How?
Müller M, Dixon S, Volk A, Sturm BLT, Rao P and Gotham M
Transactions of The International Society For Music Information Retrieval, Ubiquity Press vol. 7 (1), 85-98.

DOI 10.5334/tismir.199

QMRO

09-05-2024

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
Li Y, Yuan R, Zhang G, Ma Y, Chen X, Yin H, Xiao C, Lin C, Ragni A, Benetos E, Gyenge N, Dannenberg R, Liu R, Chen W, Xia G, Shi Y, Huang W, Wang Z, Guo Y and Fu J
International Conference on Learning Representations (ICLR) Vienna, Austria 7 May 2024 - 11 May 2024.

QMRO

07-05-2024

Parameter Reduction of Kernel-Based Video Frame Interpolation Methods Using Multiple Encoders
Khalifeh I, Murn L and Izquierdo E
Ieee Journal on Emerging and Selected Topics in Circuits and Systems, Institute of Electrical and Electronics Engineers (Ieee) vol. 14 (2), 245-260.

DOI 10.1109/jetcas.2024.3395418

30-04-2024

Unsupervised Pitch-Timbre Disentanglement of Musical Instruments Using a Jacobian Disentangled Sequential Autoencoder
Luo Y-J, Ewert S and Dixon S
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1036-1040.

DOI 10.1109/icassp48485.2024.10447564

QMRO

19-04-2024

Uncertainty-Guided Contrastive Learning For Single Source Domain Generalisation
Arsenos A, Kollias D, Petrongonas E, Skliros C and Kollias S
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 6935-6939.

DOI 10.1109/icassp48485.2024.10448096

QMRO

19-04-2024

Syncfusion: Multimodal Onset-Synchronized Video-to-Audio Foley Synthesis
Comunità M, Gramaccioni RF, Postolache E, Rodolà E, Comminiello D and Reiss JD
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 936-940.

DOI 10.1109/icassp48485.2024.10447063

19-04-2024

SSFE-M: A Self-Supervised Feature Extraction Model for Enhanced Camera Calibration
Zhang N and Izquierdo E
Ieee Signal Processing Letters, Institute of Electrical and Electronics Engineers (Ieee) vol. 31, 1179-1183.

DOI 10.1109/lsp.2024.3389830

16-04-2024

Posterior Variance-Parameterised Gaussian Dropout: Improving Disentangled Sequential Autoencoders for Zero-Shot Voice Conversion
Luo Y-J and Dixon S
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 14 Apr 2024 - 19 Apr 2024., 11676-11680.

DOI 10.1109/icassp48485.2024.10447835

QMRO

14-04-2024

MERTech: instrument playing technique detection using self-supervised pretrained model with multi-task finetuning
Li D, Ma Y, Wei W, KONG Q, Wu Y, Che M, Xia F, Benetos E and Li W
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 521-525.

DOI 10.1109/ICASSP48485.2024.10447445

QMRO

14-04-2024

Learning from taxonomy: multi-label few-shot classification for everyday sound recognition
Liang J, Phan QH and Benetos E
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 771-775.

DOI 10.1109/ICASSP48485.2024.10446908

QMRO

14-04-2024

High Resolution Guitar Transcription via Domain Adaptation
Riley JX, EDWARDS D and Dixon S
International Conference on Acoustics, Speech and Signal Processing Seoul, South Korea 14 Apr 2024 - 19 Apr 2024.

DOI 10.1109/ICASSP48485.2024.10446182

QMRO

14-04-2024

Generalized multi-source inference for text conditioned music diffusion models
Postolache E, Mariani G, Cosmo L, Benetos E and Rodola E
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Seoul, Korea 14 Apr 2024 - 19 Apr 2024., 6980-6984.

DOI 10.1109/ICASSP48485.2024.10447122

QMRO

14-04-2024

Faster Person Re-Identification: One-Shot-Filter and Coarse-to-Fine Search
Wang G, Huang X, Gong S, Zhang J and Gao W
Ieee Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers (Ieee) vol. 46 (5), 3013-3030.

DOI 10.1109/tpami.2023.3340923

QMRO

03-04-2024

Learning by Erasing: Conditional Entropy Based Transferable Out-of-Distribution Detection
Xing M, Feng Z, Su Y and Oh C
AAAI Conference on Artificial Intelligence 2024.

DOI 10.1609/aaai.v38i6.28444

QMRO

24-03-2024

Distribution Matching for Multi-Task Learning of Classification Tasks: A Large-Scale Study on Faces & Beyond
Kollias D, Sharmanska V and Zafeiriou S
Proceedings of the AAAI Conference on Artificial Intelligence. vol. 38 (3), 2813-2821.

DOI 10.1609/aaai.v38i3.28061

QMRO

24-03-2024

HRTF Upsampling With a Generative Adversarial Network Using a Gnomonic Equiangular Projection
Hogg AOT, Jenkins M, Liu H, Squires I, Cooper SJ and Picinali L
Ieee/Acm Transactions on Audio Speech and Language Processing, Institute of Electrical and Electronics Engineers (Ieee) vol. 32, 2085-2099.

DOI 10.1109/taslp.2024.3375635

QMRO

11-03-2024

Document structure-driven investigative information retrieval
Ketola T and Roelleke T
Information Systems, Elsevier vol. 121, 102315-102315.

DOI 10.1016/j.is.2023.102315

QMRO

01-03-2024

Auditory imagery ability influences accuracy when singing with altered auditory feedback
Reed CN, Pearce M and McPherson A
Musicae Scientiae, Sage Publications

DOI 10.1177/10298649231223077

QMRO

15-02-2024

Exploring User Perspectives on Brief Reflective Questioning Activities for Stress Management: Mixed Methods Study
Bhattacharjee A, Chen P, Mandal A, Hsu A, O'Leary K, Mariakakis A and Williams JJ
Jmir Formative Research, Jmir Publications vol. 8

DOI 10.2196/47360

QMRO

08-02-2024

A Data-Driven Analysis of Robust Automatic Piano Transcription
EDWARDS D, Dixon S, Benetos E, Maezawa A and Kusaka Y
Ieee Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 681-685.

DOI 10.1109/LSP.2024.3363646

QMRO

08-02-2024

Test-time adaptation for 6D pose tracking
Tian L, Oh C and Cavallaro A
Pattern Recognition, Elsevier Bv, 110390-110390.

DOI 10.1016/j.patcog.2024.110390

QMRO

01-02-2024

A critical analysis of image-based camera pose estimation techniques
Xu M, Wang Y, Xu B, Zhang J, Ren J, Huang Z, Poslad S and Xu P
Neurocomputing, Elsevier vol. 570

DOI 10.1016/j.neucom.2023.127125

QMRO

01-02-2024

YourMT3+: Multi-Instrument Music Transcription with Enhanced Transformer Architectures and Cross-Dataset STEM Augmentation
Chang S, Benetos E, Kirchhoff H and Dixon S
, Institute of Electrical and Electronics Engineers (Ieee) vol. 00, 1-6.

DOI 10.1109/mlsp58920.2024.10734819

25-01-2024

Composer Style-Specific Symbolic Music Generation using Vector Quantized Discrete Diffusion Models
Zhang J, Fazekas G and Saitis C
2024 IEEE 34th International Workshop on Machine Learning for Signal Processing (MLSP). vol. 00, 1-6.

DOI 10.1109/mlsp58920.2024.10734713

25-01-2024

The Role of Communication and Reference Songs in the Mixing Process: Insights from Professional Mix Engineers
Vanka S, Safi M, Rolland J-B and Fazekas G
Journal of The Audio Engineering Society, Audio Engineering Society

DOI 10.17743/jaes.2022.0123

QMRO

20-01-2024

Robust Facial Reactions Generation: An Emotion-Aware Framework with Modality Compensation
Hu G, Wei J, Song S, Kollias D, Yang X, Sun Z and Kaloidas O
2024 IEEE International Joint Conference on Biometrics (IJCB). vol. 00, 1-10.

DOI 10.1109/ijcb62174.2024.10744499

18-01-2024

Exploring the impact of transfer learning on GAN-based HRTF upsampling
Hogg A, Liu H, Jenkins M and Picinali L
Proceedings of the 10th Convention of the European Acoustics Association Forum Acusticum 2023., 2323-2328.

DOI 10.61782/fa.2023.0266

17-01-2024

ATGNN: audio tagging graph neural network
Singh S, Steinmetz C, Benetos E, Phan QH and Stowell D
Ieee Signal Processing Letters, Institute of Electrical and Electronics Engineers vol. 31, 825-829.

DOI 10.1109/LSP.2024.3352514

QMRO

17-01-2024

Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization
Zheng M, Gong S, Jin H, Peng Y and Liu Y
Annual Meeting of the Association for Computational Linguistics. vol. 1, 14197-14209.

QMRO

12-01-2024

A review of differentiable digital signal processing for music and speech synthesis
Hayes B, Shier J, Fazekas G, McPherson A and Saitis C
Frontiers in Signal Processing, Frontiers vol. 3

DOI 10.3389/frsip.2023.1284100

QMRO

11-01-2024

Spectrogram-based approach with convolutional neural network for human activity classification
Sassi M, Haleem MS and Pecchia L
Mediterranean Conference on Medical and Biological Engineering and Computing International Conference on Medical and Biological Engineering MEDICON 2023, CMBEBIH 2023: MEDICON’23 and CMBEBIH’23 14 Sep 2023 - 16 Sep 2023.

DOI 10.1007/978-3-031-49068-2_40

QMRO

04-01-2024

Spectrogram-Driven Convolutional Neural Network for Real-Time Non-invasive Hyperglycaemia Detection in Paediatric Type-1 Diabetes via Wearable Sensors
Cisuelo O, Haleem MS, Hattersley J and Pecchia L
MEDICON: Mediterranean Conference on Medical and Biological Engineering and Computing, CMBEBIH: International Conference on Medical and Biological Engineering 14 Sep 2023 - 16 Sep 2023.

DOI 10.1007/978-3-031-49068-2_39

QMRO

04-01-2024

Wavelet-based network for high dynamic range imaging
Dai T, Li W, Cao X, Liu J, Jia X, Leonardis A, Yan Y and Yuan S
Computer Vision and Image Understanding, Elsevier vol. 238

DOI 10.1016/j.cviu.2023.103881

01-01-2024

VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning.
Xenos A, Foteinopoulou NM, Ntinou I, Patras I and Tzimiropoulos G
Corr vol. abs/2404.07078
01-01-2024

Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model
Huang J and Benetos E
European Signal Processing Conference, 146-150.
01-01-2024

TOWARDS EFFICIENT MODELLING OF STRING DYNAMICS: A COMPARISON OF STATE SPACE AND KOOPMAN BASED DEEP LEARNING METHODS
Diaz R, De La Vega Martin C and Sandler M
Proceedings of the International Conference on Digital Audio Effects, DAFx., 200-207.

QMRO

01-01-2024

Self-Supervised Facial Representation Learning with Facial Region Awareness.
Gao Z and Patras I
CVPR., 2081-2092.
01-01-2024

Self-Supervised Facial Representation Learning with Facial Region Awareness
Gao Z and Patras I
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition., 2081-2092.

DOI 10.1109/CVPR52733.2024.00203

QMRO

01-01-2024

Real-time Timbre Remapping with Differentiable DSP
Shier J, Saitis C, Robertson A and McPherson A
Proceedings of the International Conference on New Interfaces for Musical Expression.

QMRO

01-01-2024

Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization.
Oldfield J, Georgopoulos M, Chrysos GG, Tzelepis C, Panagakis Y, Nicolaou MA, Deng J and Patras I
Corr vol. abs/2402.12550
01-01-2024

Mind the Domain Gap: a Systematic Analysis on Bioacoustic Sound Event Detection
Liang J, Nolasco I, Ghani B, Phan H, Benetos E and Stowell D
European Signal Processing Conference., 1257-1261.
01-01-2024

MOAB: Multi-Modal Outer Arithmetic Block For Fusion Of Histopathological Images And Genetic Data For Brain Tumor Grading.
Alwazzan O, Khan A, Patras I and Slabaugh GG
Corr vol. abs/2403.06349
01-01-2024

MM2Latent: Text-to-facial image generation and editing in GANs with multimodal assistance.
Meng D, Tzelepis C, Patras I and Tzimiropoulos G
Corr vol. abs/2409.11010
01-01-2024

LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition.
Sun Z, Feng C, Patras I and Tzimiropoulos G
CVPR., 1639-1649.
01-01-2024

LAFS: Landmark-Based Facial Self-Supervised Learning for Face Recognition
Sun Z, Feng C, Patras I and Tzimiropoulos G
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition., 1639-1649.

DOI 10.1109/CVPR52733.2024.00162

QMRO

01-01-2024

Investigating Adversarial Policy Learning for Robust Agents in Automated Driving Highway Simulations
Pighetti A, Bellotti F, Oh C, Lazzaroni L, Forneris L, Fresta M and Berta R
Lecture Notes in Electrical Engineering. vol. 1110, 124-129.

DOI 10.1007/978-3-031-48121-5_18

QMRO

01-01-2024

Improving Fairness using Vision-Language Driven Image Augmentation.
D'Incà M, Tzelepis C, Patras I and Sebe N
WACV., 4683-4692.
01-01-2024

Identification of major hemorrhage in trauma patients in the prehospital setting: diagnostic accuracy and impact on outcome
Wohlgemut JM, Pisirir E, Stoner RS, Kyrimi E, Christian M, Hurst T, Marsh W, Perkins ZB and Tai NRM
Trauma Surgery & Acute Care Open, Bmj vol. 9 (1)

DOI 10.1136/tsaco-2023-001214

QMRO

01-01-2024

Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization.
Zhang Z, Liu Z and Patras I
Corr vol. abs/2408.04983
01-01-2024

Foundation Models for Music: A Survey.
Ma Y, Øland A, Ragni A, Sette BMD, Saitis C, Donahue C, Lin C, Plachouras C, Benetos E, Quinton E, Shatri E, Morreale F, Zhang G, Fazekas G, Xia G, Zhang H, Manco I, Huang J, Guinot J, Lin L, et al.
Corr vol. abs/2408.14340

QMRO

01-01-2024

FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion.
Singh AK and Patras I
Corr vol. abs/2404.18591
01-01-2024

FOAA: Flattened Outer Arithmetic Attention for Multimodal Tumor Classification.
Alwazzan O, Patras I and Slabaugh GG
ISBI., 1-5.
01-01-2024

Efficient Vision-Language pre-training via domain-specific learning for human activities
Bulat A, Ouali Y, Guerrero R, Martinez B and Tzimiropoulos G
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing., 7978-8000.

DOI 10.18653/v1/2024.emnlp-main.454

QMRO

01-01-2024

Discerning real from synthetic: analysis and perceptual evaluation of sound effects
Garcia N, Zong Y and Reiss J
2024 6th International Conference on Audio for Games., 87-94.
01-01-2024

DIFFERENTIABLE ALL-POLE FILTERS FOR TIME-VARYING AUDIO SYSTEMS
Yu CY, Mitcheltree C, Carson A, Bilbao S, Reiss JD and Fazekas G
Proceedings of the International Conference on Digital Audio Effects, DAFx., 345-352.
01-01-2024

CemiFace: Center-based Semi-hard Synthetic Face Generation for Face Recognition.
Sun Z, Song S, Patras I and Tzimiropoulos G
CoRR. vol. abs/2409.18876

QMRO

01-01-2024

CLIPCleaner: Cleaning Noisy Labels with CLIP.
Feng C, Tzimiropoulos G and Patras I
ACM Multimedia., 876-885. Editors: Cai J, Kankanhalli MS, Prabhakaran B, Boll S, Subramanian R, Zheng L, Singh VK, César P, Xie L and Xu D.
01-01-2024

Are CLIP features all you need for Universal Synthetic Image Origin Attribution?
Cioni D, Tzelepis C, Seidenari L and Patras I
Corr vol. abs/2408.09153
01-01-2024

A hybrid Bayesian network for medical device risk assessment and management
Hunte JL, Neil M and Fenton NE
Reliability Engineering & System Safety, Elsevier vol. 241

DOI 10.1016/j.ress.2023.109630

01-01-2024

A Machine learning method to evaluate and improve sound effects synthesis model design
Zong Y, Garcia-Sihuay N and Reiss J
2024 6th International Conference on Audio for Games., 11-19.

QMRO

01-01-2024