Dr Iran Roman

Lecturer

School of Electronic Engineering and Computer Science
Queen Mary University of London

i.roman@qmul.ac.uk

Centre for Multimodal AI
Centre for Human-Centred Computing

Research
Publications

Research

Theoretical neuroscience, Machine Perception, Artificial Intelligence

Interests

Iran R. Roman is a Lecturer at the School of Electrical Engineering and Computer Science of Queen Mary University of London. Within Queen Mary, he is a member of the Center for Multimodal AI, Center for Digital Music, Center for Human-Centered Computing, the Computer Vision group, and the Cognitive Science group.

His research area is machine perception, with the goal of creating algorithms that allow computers to perceive environments as living agents do. To this end, Iran has developed algorithms that leverage multimodal signals to sense, identify, and track objects in the real world. These algorithms draw inspiration from the neural mechanisms that allow living organisms to carry out similar tasks. Iran’s work has found applications in products at companies such as Apple, Tesla, Raytheon/BBN, and Plantronics. His research has been funded by the US National Science Foundation (NSF), the US Defense Advanced Research Projects Agency (DARPA), and the Howard Hughes Medical Institute (HHMI).

On the academic service side, he serves as reviewer for IEEE ICASSP, IEEE MLSP, ISMIR, eLife, and Music & Science. Iran is also a volunteer professor for the National Autonomous University of Mexico, and the organizer of the annual Deep Learning for Music Information Retrieval workshop at the Center for Computer Research in Music and Acoustics at Stanford University.

Publications

Publications of specific relevance to the Centre for Multimodal AI

2025

Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware
Pedroza H, Abreu W, Corey R and Roman I
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025.
06-04-2025

2024

HuBar: A Visual Analytics Tool to Explore Human Behavior Based on fNIRS in AR Guidance Systems
Castelo S, Rulff J, Solunke P, McGowan E, Wu G, Roman I, Lopez R, Steers B, Sun Q, Bello J, Feest B, Middleton M, Mckendrick R and Silva C
Ieee Transactions on Visualization and Computer Graphics, Institute of Electrical and Electronics Engineers (Ieee) vol. 31 (1), 119-129.

DOI 10.1109/tvcg.2024.3456388

09-09-2024

LEVERAGING REAL ELECTRIC GUITAR TONES AND EFFECTS TO IMPROVE ROBUSTNESS IN GUITAR TABLATURE TRANSCRIPTION MODELING
Roman Guzman I
27th International Conference on Digital Audio Effects (DAFx24).
03-09-2024

Robust DoA Estimation from Deep Acoustic Imaging
Roman AS, Roman IR and Bello JP
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1321-1325.

DOI 10.1109/icassp48485.2024.10447883

19-04-2024

Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms
Roman IR, Ick C, Ding S, Roman AS, McFee B and Bello JP
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1221-1225.

DOI 10.1109/icassp48485.2024.10446118

19-04-2024

2023

: Visualization of AI-Assisted Task Guidance in AR
Castelo S, Rulff J, McGowan E, Steers B, Wu G, Chen S, Roman I, Lopez R, Brewer E, Zhao C, Qian J, Cho K, He H, Sun Q, Vo H, Bello J, Krone M and Silva C
Ieee Transactions on Visualization and Computer Graphics, Institute of Electrical and Electronics Engineers (Ieee) vol. 30 (1), 1313-1323.

DOI 10.1109/tvcg.2023.3327396

02-11-2023

Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions
Kushwaha SS, Roman IR, Fuentes M and Bello JP
2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA). vol. 00, 1-5.

DOI 10.1109/waspaa58266.2023.10248194

25-10-2023

Exploring Approaches to Multi-Task Automatic Synthesizer Programming
Faronbi D, Roman I and Bello JP
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). vol. 00, 1-5.

DOI 10.1109/icassp49357.2023.10095540

10-06-2023

Hebbian learning with elasticity explains how the spontaneous motor tempo affects music performance synchronization
Roman IR, Roman AS, Kim JC and Large EW
Plos Computational Biology, Public Library of Science (Plos) vol. 19 (6)

DOI 10.1371/journal.pcbi.1011154

07-06-2023

F0 analysis of Ghanaian pop singing reveals progressive alignment with equal temperament over the past three decades: a case study
Roman Guzman I
20th Sound and Music Computing Conference, SMC 2023.
01-06-2023

Dynamic models for musical rhythm perception and coordination
Large EW, Roman I, Kim JC, Cannon J, Pazdera JK, Trainor LJ, Rinzel J and Bose A
Frontiers in Computational Neuroscience, Frontiers vol. 17

DOI 10.3389/fncom.2023.1151895

17-05-2023

2022

Reconstructing room scales with a single sound for augmented reality displays
Liang BS, Liang AS, Roman I, Weiss T, Duinkharjav B, Bello JP and Sun Q
Journal of Information Display, Taylor & Francis vol. 24 (1), 1-12.

DOI 10.1080/15980316.2022.2145377

15-11-2022

Analyzing the effect of equal-angle spatial discretization on sound event localization and detection
Roman Guzman I
Detection and Classification of Acoustic Scenes and Events 2022.
03-11-2022

2021

micarraylib: Software for Reproducible Aggregation, Standardization, and Signal Processing of Microphone Array Datasets.
Roman Guzman I and Bello J
Detection and Classification of Acoustic Scenes and Events 2021.
15-11-2021