Dr Iran Roman
Lecturer
School of Electronic Engineering and Computer Science
Queen Mary University of London
Queen Mary University of London
Research
Theoretical neuroscience, Machine Perception, Artificial Intelligence
Interests
Iran R. Roman is a Lecturer at the School of Electrical Engineering and Computer Science of Queen Mary University of London. Within Queen Mary, he is a member of the Center for Multimodal AI, Center for Digital Music, Center for Human-Centered Computing, the Computer Vision group, and the Cognitive Science group.His research area is machine perception, with the goal of creating algorithms that allow computers to perceive environments as living agents do. To this end, Iran has developed algorithms that leverage multimodal signals to sense, identify, and track objects in the real world. These algorithms draw inspiration from the neural mechanisms that allow living organisms to carry out similar tasks. Iran’s work has found applications in products at companies such as Apple, Tesla, Raytheon/BBN, and Plantronics. His research has been funded by the US National Science Foundation (NSF), the US Defense Advanced Research Projects Agency (DARPA), and the Howard Hughes Medical Institute (HHMI).
On the academic service side, he serves as reviewer for IEEE ICASSP, IEEE MLSP, ISMIR, eLife, and Music & Science. Iran is also a volunteer professor for the National Autonomous University of Mexico, and the organizer of the annual Deep Learning for Music Information Retrieval workshop at the Center for Computer Research in Music and Acoustics at Stanford University.
Publications
Publications of specific relevance to the Centre for Multimodal AI
2025
Pedroza H, Abreu W, Corey R and Roman I (2025). Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025.
06-04-2025
06-04-2025
2024
Castelo S, Rulff J, Solunke P, McGowan E, Wu G, Roman I, Lopez R, Steers B, Sun Q, Bello J, Feest B, Middleton M, Mckendrick R and Silva C (2024). HuBar: A Visual Analytics Tool to Explore Human Behavior Based on fNIRS in AR Guidance Systems. IEEE Transactions on Visualization and Computer Graphics, Institute of Electrical and Electronics Engineers (IEEE) vol. 31 (1), 119-129.
09-09-2024
09-09-2024
Roman Guzman I (2024). LEVERAGING REAL ELECTRIC GUITAR TONES AND EFFECTS TO IMPROVE ROBUSTNESS IN GUITAR TABLATURE TRANSCRIPTION MODELING. 27th International Conference on Digital Audio Effects (DAFx24).
03-09-2024
03-09-2024
Roman AS, Roman IR and Bello JP (2024). Robust DoA Estimation from Deep Acoustic Imaging. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
19-04-2024
19-04-2024
Roman IR, Ick C, Ding S, Roman AS, McFee B and Bello JP (2024). Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
19-04-2024
19-04-2024
2023
Castelo S, Rulff J, McGowan E, Steers B, Wu G, Chen S, Roman I, Lopez R, Brewer E, Zhao C, Qian J, Cho K, He H, Sun Q, Vo H, Bello J, Krone M and Silva C (2023). : Visualization of AI-Assisted Task Guidance in AR. IEEE Transactions on Visualization and Computer Graphics, Institute of Electrical and Electronics Engineers (IEEE) vol. 30 (1), 1313-1323.
02-11-2023
02-11-2023
Kushwaha SS, Roman IR, Fuentes M and Bello JP (2023). Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions. 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).
25-10-2023
25-10-2023
Faronbi D, Roman I and Bello JP (2023). Exploring Approaches to Multi-Task Automatic Synthesizer Programming. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
10-06-2023
10-06-2023
Roman IR, Roman AS, Kim JC and Large EW (2023). Hebbian learning with elasticity explains how the spontaneous motor tempo affects music performance synchronization. PLOS Computational Biology, Public Library of Science (PLoS) vol. 19 (6)
07-06-2023
07-06-2023
Roman Guzman I (2023). F0 analysis of Ghanaian pop singing reveals progressive alignment with equal temperament over the past three decades: a case study. 20th Sound and Music Computing Conference, SMC 2023.
01-06-2023
01-06-2023
Large EW, Roman I, Kim JC, Cannon J, Pazdera JK, Trainor LJ, Rinzel J and Bose A (2023). Dynamic models for musical rhythm perception and coordination. Frontiers in Computational Neuroscience, Frontiers vol. 17
17-05-2023
17-05-2023
2022
Liang BS, Liang AS, Roman I, Weiss T, Duinkharjav B, Bello JP and Sun Q (2022). Reconstructing room scales with a single sound for augmented reality displays. Journal of Information Display, Taylor & Francis vol. 24 (1), 1-12.
15-11-2022
15-11-2022
Roman Guzman I (2022). Analyzing the effect of equal-angle spatial discretization on sound event localization and detection. Detection and Classification of Acoustic Scenes and Events 2022.
03-11-2022
03-11-2022
2021
Roman Guzman I and Bello J (2021). micarraylib: Software for Reproducible Aggregation, Standardization, and Signal Processing of Microphone Array Datasets. Detection and Classification of Acoustic Scenes and Events 2021.
15-11-2021
15-11-2021