Selected Publications and Patents

Publications

* corresponding author; equal contributions

2026

Bian, K., Guo, X., Ruan, L., Chen, B., Shen, Y., Dang, T., Jia, H.

Pocket-Dentist: Efficient Multimodal QA for On-Device Dental Image Understanding

ICML 2026 Workshop EMMQA

Xue, X., Yu, Y., Gao, Y., Wang, J., Chen, B., Ruan, L., Dang, T., Jia, H.

Efficient Multimodal Clinical Question Answering for Pulmonary Embolism Risk Assessment

ICML 2026 Workshop EMMQA

Yu, X., Dong, J., Honorio, J., Ghosh, A., Jia, H., Dang, T.

Disentangling Reasoning in Large Audio-Language Models for Ambiguous Emotion Prediction

INTERSPEECH 2026

Xiao, Y., Mahmudi, A., Thieberger, N., Ambikairajah, E., Holden, E. J., Dang, T.

Continual Adaptation for Pacific Indigenous Speech Recognition

INTERSPEECH 2026

Sun, J., Xiao, Y., Chung, S. K., Hu, Q., Huang, G., Holden, E. J., Dang, T.

Activation Steering for Accent Adaptation in Large Audio Language Models

INTERSPEECH 2026

Ding, H., Xiao, Y., Dong, J., Dang, T.

ImKWS: Test-Time Adaptation for Keyword Spotting with Class Imbalance

INTERSPEECH 2026

Chung, S. K., Dong, J., Hu, Q., Sun, J., Huang, G., Jia, H., Dang, T.

Localizing and Editing Knowledge in Large Audio-Language Models

INTERSPEECH 2026

Chen, Y., Xiao, Y., Yin, H., Liu, X., Huang, J., Dang, T.

PolyBench: A Benchmark for Compositional Reasoning in Polyphonic Audio

INTERSPEECH 2026

Yin, H., Xiao, Y., Das, R. K., Bai, J., Dang, T.

The First Environmental Sound Deepfake Detection Challenge: Benchmarking Robustness, Evaluation, and Insights

INTERSPEECH 2026

Xue, X., Lu, J., Gao, Y., Huang, G., Dang, T., Jia, H.

Edge–Cloud Collaborative Speech Emotion Captioning via Token-Level Speculative Decoding in Audio-Language Models

INTERSPEECH 2026

Adelson, T., Dang, T., Sethu, V.

Beyond Deep Learning: Speech Segmentation and Phones Classification with Neural Assemblies

INTERSPEECH 2026 (Long Paper Track)

Guo, X., Zhao, C., Jia, H., Dang, T., Huang, G., Zheng, X., Gao, Y.

Adaptive Federated Fine-Tuning of Self-Supervised Speech Representations

INTERSPEECH 2026

Lin, Z., Bai, Z., Bai, J., Li, Z., Dang, T., Huang, G., Chen, J., Benesty, J.

BG-CRNN: Boundary-Guided Dynamic Attention for Sound Event Detection in Complex Scenarios

INTERSPEECH 2026

Mahmudi, A., Dang, T., Vylomova, E., Thieberger, N.

Easper: An Accessible ASR Pipeline for Language Documentation

INTERSPEECH 2026

Yang, J., Zhang, S., Deng, Y., Li, P., Dang, T., Huang, G., Chen, J., Benesty, J.

A Fusion-Aware Two-Stage Framework for Mispronunciation Detection and Diagnosis in Low-Resource Modern Standard Arabic

INTERSPEECH 2026

Mylvaganam, P., Dang, T., Ambikairajah, E., Sethu, V., Wu, J.

Hybrid Continual Learning for Low-Resource Australian Aboriginal Language Identification

INTERSPEECH 2026

Mylvaganam, P., Ambikairajah, E., Dang, T., Sethu, V., Szalay, T.

Which Languages Transfer Best to Warlpiri? A Similarity-Based Study for Low-Resource ASR

INTERSPEECH 2026

Xiang, J., Wu, J., Dang, T., Sethu, V., Ambikairajah, E.

Revisiting Delay Compensation via Feature-Level Temporal Accumulation in Continuous Emotion Recognition

INTERSPEECH 2026

Wang, S., Bailey, J., Dang, T.

A Geometric Perspective on Composable Emotion Steering in Text-to-Speech Models

ICML 2026 ML for Audio Workshop

Chen, D., Hu, Q., Xiao, Y., Dang, T.*, Jia, H.

Titans-as-a-Layer: Test-Time Memory for Conversational Speech Emotion Recognition

ICML 2026 ML for Audio Workshop

Yin, H., Xiao, Y., Kwon, Y., Dang, T., Choi, J.

Focus Then Listen: An Empirical Study of Plug-and-Play Audio Enhancer for Noise-Robust Large Audio Language Models

ICML 2026 ML for Audio Workshop

Huang, J., Dang, T., Almoznino, G., Capurro, D.

Fine-tuning Large Language Models on Serialized EHRs for Generating Synthetic Longitudinal EHRs

AMIA 2026 Annual Symposium (Poster)

Liu, M., Lu, F., Wang, M., Zhou, J., Liu, L., Dang, T., Chi, L., Kumar, D. K., Ma, J., Xia, F.

Spider-Brain: Spike-inspired Graph Learning of Asynchronous and Hierarchical Brain Dynamics

IEEE Transactions on Cognitive and Developmental Systems

Chen, S., Dang, T., Qian, M., Liang, H., Bedada, D. T., Louw, Q. A., Moore, A., Cardinal, R. N., Ford, T. J., Jiang, F.

Machine learning and natural language processing for the identification of potential mental disorders among school-age children: a prospective birth cohort study

BMC Medicine

Zhang, X., Yin, H., Xiao, Y., Zhang, L., Dang, T., Das, R. K., Li, M.

Overview of ESDD2: Environment-Aware Speech and Sound

ICME 2026

Wang, S., Tan, S., Liu, S., Huang, G., Jia, H., Bailey, J., Dang, T.

CoCoEmo: Composable and Controllable Human-Like Emotional TTS via Activation Steering

ICML 2026

Yin, P., Huang, J., Xu, Z., Capurro, D., Conway, M., Dang, T.

X-FEMR: A Token-level Explainable Approach for Electronic Health Records Foundation Models using Transformer

IJCAI 2026

Xue, X., Zhang, T., Kostakos, V., Dang, T., Jia, H.

Multi-Task Mental Health Detection with Large Language Models under Class Imbalance

MobiSys Workshop EIFCOM 2026

Hu, C., Pham, H., Dang, T., Li, J., Balan, R., Ma, D.

Poster: Turning Budget Earphones into Hi-Fi with Hardware-Aware Learning

SenSys 2026

Liu, M., Wang, C., Dong, Q., Ren, J., Dang, T., Saikrishna, V., Xia, F.

Multi-Scale Diffusion for Bio-topological Representation Learning on Multimodal Brain Graphs

ACM Transactions on Intelligent Systems and Technology 2026

Dong, J., Jia, H., Dang, T.

Test Time Adaptation for Speech Emotion Recognition

ICASSP 2026

Zhang, W., Jin, H., Wang, S., Wei, Z., Dang, T.

Scaling Ambiguity: Augmenting Human Annotation in Speech Emotion Recognition with Audio-Language Models

ICASSP 2026

🏆 Outstanding student paper award

Dang, T., Chatterjee, S., Jia, H., Wu, Y., Salim, F., Kawsar, F.

AdaNODEs: Test Time Adaptation for Time Series Forecasting Using Neural ODEs

ICASSP 2026

Yin, H., Xiao, Y., Das, R., Bai, J., Dang, T.

Environmental Sound Deepfake Detection Challenge: An Overview

ICASSP 2026

Hu, C., Pham, H., Dang, T., Li, J., Balan, R., Ma, D.

From Cheap to Chic: Enhancing Music Playback Quality of Budget Earphones via Hardware-Aware Learning

SenSys 2026

2025

Dong, J., Jia, H., Chatterjee, S., Ghosh, A., Bailey, J., Dang, T.

E-BATS: Efficient Backpropagation-Free Test-Time Adaptation for Speech Foundation Models

NeurIPS 2025

Gao, Y., Scamarcia, M., Fernandez-Marques, J., Naseri, M., Ng, C., Stripelis, D., Li, Z., Shen, T., Bai, J., Chen, D., Zhang, Z., Hu, R., Song, I., KangYoon, L., Jia, H., Dang, T., Wang, J., Liu, Z., Beutel, D., Lyu, L., Lane, N.

FlowerTune: A Cross-Domain Benchmark for Federated Fine-Tuning of Large Language Models

NeurIPS 2025 Datasets and Benchmarks

Wang, X., Dang, T., Zhang, X., Kostakos, V., Witbrock, M. J., Jia, H.

HealthSLM-Bench: Benchmarking Small Language Models for Mobile and Wearable Healthcare Monitoring

NeurIPS 2025 GenAI4Health Workshop

Zhang, S., Jia, H., Li, S., Dang, T., Hu, Y., Yi, X., Li, H.

Position: Human-Robot Interaction in Embodied Intelligence Demands a Shift From Static Privacy Controls to Dynamic Learning

NeurIPS 2025 LAW Workshop

Wu, J., Dang, T., Sethu, V., Ambikairajah, E.

How many raters do we need? Analyses of uncertainty in estimating ambiguity-aware emotion labels

IEEE Transactions on Affective Computing

Liu, M., Wang, C., Chen, L., Le, N., Tewari, N., Dang, T., Ma, J., Xia, F.

Structure Matters: Brain Graph Augmentation via Learnable Edge Masking for Data-efficient Psychiatric Diagnosis

AJCAI 2025

Liu, M., Zhu, M., Dong, Q.,Dang, T.*, Ma, J., Ren, J., Xia, F.

Data-Efficient Psychiatric Disorder Detection via Self-supervised Learning on Frequency-enhanced Brain Networks

ACM Transactions on Computing for Healthcare

Xiao, Y., Dang, T., Das, R.

RawTFNet: A Lightweight CNN Architecture for Speech Anti-spoofing

APSIPA ASC 2025

Jia, H., Fu, S., Xia, F., Kostakos, V., Dang, T.

Beyond Scale: Small Language Models are Comparable to GPT-4 in Mental Health Understanding

ACII 2025 LBR

Wei, X., Dang, T.†*, Al-Naimi, K., Liu, Y., Kawsar, F., Montanari, A.

Listening to the Mind: Earable Acoustic Sensing of Cognitive Load

Companion of ACM UbiComp/ISWC 2025

Meadia Coverage

Zhang, S., Ma, Y., Hu, Y., Dang, T., Jia, H., Yi, X., Li, H.

From Patient Burdens to User Agency: Designing for Real-Time Protection Support in Online Health Consultations

Companion of ACM UbiComp/ISWC 2025

Jia, H., Chatterjee, S., Keikhosrokiani, P., Dang, T.

WellComp 2025: Eighth International Workshop on Computing and Software Systems for Well-Being

Companion of ACM UbiComp/ISWC 2025

Tang, X., Huang, J., Lin, Y., Dang, T., Cheng, J.

Speech Emotion Recognition Via CNN-Transformer and Multidimensional Attention Mechanism

Speech Communication

Vavaroutas, S., Dang, T., Rocheteau, E., Mascolo, C.

SQUIREDL: Sparse Sequence-to-Sequence Uncertainty Estimation in Evidential Deep Learning

ACM Transactions on Computing for Healthcare

Hong, X., Gong, Y., Sethu, V., Dang, T.

AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models

ICASSP 2025

Quan, J., Al-Naimi, K., Wei, X., Liu, Y., Montanari, A., Dang, T.

Cognitive Load Monitoring via Earable Acoustic Sensing

ICASSP 2025

Meadia Coverage

2024

Jia, H., Kwon, Y., Orsino, A., Dang, T., Talia, D., Mascolo, C.

TinyTTA: Efficient Test-time Adaptation via Early-exit Ensembles on Edge Devices

NeurIPS 2024

Wang, X., Dang, T., Kostakos, V., Jia, H.

Efficient and Personalized Mobile Health Event Prediction via Small Language Models

MobiCom Workshop EIFCom 2024

Wu, J., Dang, T., Sethu, V., Ambikairajah, E.

Emotion Recognition Systems Must Embrace Ambiguity

ACII Satellite Workshop EASE 2024

Wu, Y., Dang, T., Spathis, D., Jia, H., Mascolo, C.

StatioCL: Contrastive Learning for Time Series via Non-Stationary and Temporal Contrast

ACM International Conference on Information and Knowledge Management (CIKM) 2024

Hu, Y., Zhang, S., Dang, T., Jia, H., Salim, FD., Hu, W., Quigley, AJ.

Exploring Large-Scale Language Models to Evaluate EEG-Based Multimodal Data for Mental Health

UbiComp Workshop WellComp 2024

Wu, J., Dang, T., Sethu, V., Ambikairajah, E.

Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction

INTERSPEECH 2024

Shahid, I., Al-Naimi, K., Dang, T., Liu, Y., Kawsar, F., Montanari, A.

Towards Enabling DPOAE Estimation on Single-Speaker Earbuds

ICASSP 2024

Romero, J., Ferlini, A., Spathis, D., Dang, T., Farrahi, K., Kawsar, F., Montanari, A.

OptiBreathe: An Earable-based PPG System for Continuous Respiration Rate, Breathing Phase, and Tidal Volume Monitoring

HotMobile 2024

Xia, T., Dang, T., Han, J., Qendro, L., Mascolo, C.

Uncertainty-aware Health Diagnostics via Class-balanced Evidential Deep Learning

IEEE Journal of Biomedical and Health Informatics

Demirel, BU., Dang, T., Al-Naimi, K., Kawsar, F., Montanari, A.

Unobtrusive Air Leakage Estimation for Earables with In-ear Microphones

UbiComp, 2024

2023

Wu, J., Dang, T., Sethu, V., Ambikairajah, E.

Belief Mismatch Coefficient (BMC): A Novel Interpretable Measure of Prediction Accuracy for Ambiguous Emotion States.

Affective Computing and Intelligent Interaction (ACII), 2023.

🏆 Best paper award

Dang, T., Ghosh, A., Spathis, D., Mascolo, C.

Human-centered AI for mobile health sensing: challenges and opportunities

Royal Society Open Science

2023

Special selection

Dang, T., Han, J., Xia, T., Bondareva, E., Brown, C., Chauhan, J., Grammenos, A., Spathis, D., Cicuta, P., Mascolo, C.

Conditional Neural ODE Processes for Individual Disease Progression Forecasting: A Case Study on COVID-19

ACM SIGKDD on Knowledge Discovery and Data Mining (KDD) 2023.

[Promotion video]

Butkow, K., Dang, T., Ferlini, A., Ma, D., Mascolo, C.

hEARt: Motion-resilient Heart Rate Monitoring with In-ear Microphones

IEEE International Conference on Pervasive Computing and Communications (PerCom) 2023

Wickramasinghe, B., Ambikairajah, E., Sethu, V., Epps, J., Li, H., Dang, T.

DNN controlled adaptive front-end for replay attack detection systems

Speech Communication

154, 102973, 2023.

Wu, J., Dang, T., Sethu, V., Ambikairajah, E.

From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion

Interspeech 2023

Dang, T., Dimitriadis, A., Wu, J., Sethu, V., Ambikairajah, E.

Constrained dynamical neural ode for time series modelling: A case study on continuous emotion prediction.

ICASSP, 2023.

[Poster]

🏆 Top 3% paper award

Han, J., Montagna, M., Grammenos, A., Xia, T., Bondareva, E., Brown, C., Chauhan, J., Dang, T., Spathis, D., Floto, A., Cicuta, P., Mascolo, C.

Evaluating Listening Performance for COVID-19 Detection between Clinicians and Machine Learning: A Comparative Study

Journal of Medical Internet Research, 2023

Wu, J., Dang, T., Sethu, V., Ambikairajah, E.

Multimodal Affect Models: An investigation of relative salience of audio and visual cues for emotion prediction

Frontiers in Computer Science, 2021.

Hu, C., Ma, X., Ma, D., Dang, T.

Lightweight and Non-invasive User Authentication on Earables

HotMobile 2023

2022 and before

Dang, T., Han, J., Xia, T., Spathis, D., Bondareva, E., Brown, C., Chauhan, J., Grammenos, A., Hasthanasombat, A., Floto, A., Cicuta, P., Mascolo, C.

Exploring longitudinal cough, breath, and voice data for COVID-19 progression prediction via sequential deep learning: model development and validation.

Journal of Medical Internet Research, 2023

Media Coverage

Xia, T., Han, J., Qendro, L., Dang, T., Mascolo, C.

Hybrid-EDL: Improving Evidential Deep Learning for Uncertainty Quantification on Imbalanced Data

NeurIPS Workshop TSRML 2022

Wu, J., Dang, T., Sethu, V., Epps, J., Ambikairajah, E.

A Novel Sequential Monte Carlo Framework for Predicting Ambiguous Emotion States

ICASSP 2022

Han, J., Xia, T., Spathis, D., Bondareva, E., Brown, C., Chauhan, J., Dang, T., Grammenos, A., Hasthanasombat, A., Floto, A., Cicuta, P., Mascolo, C.

Sounds of COVID-19: exploring realistic performance of audio-based digital testing.

NPJ digital medicine, 2022

Media Coverage

Xia, T., Spathis, D., Ch, J., Grammenos, A., Han, J., Hasthanasombat, A., Bondareva, E., Dang, T., Floto, A., Cicuta, P., Mascolo, C.

COVID-19 Sounds: A Large-Scale Audio Dataset for Digital COVID-19 Detection

NeurIPS Datasets and Benchmarks Track, 2021

Xia, T., Han, J., Qendro, L., Dang, T., Mascolo, C.

Uncertainty-Aware COVID-19 Detection from Imbalanced Sound Data

Interspeech 2021

B., D., Dang, T., Sethu, V., Ambikairajah, E., Fernando, S.

A Novel Bag-of-Optimised-Clusters Front-End for Speech based Continuous Emotion Prediction

Affective Computing and Intelligent Interaction(ACII), 2019

Ouyang, A., Dang, T., Sethu, V., Ambikairajah, E.

Speech Based Emotion Prediction: Can a Linear Model Work?

Interspeech 2019

Dang, T., Sethu, V., Ambikairajah, E.

Compensation techniques for speaker variability in continuous emotion prediction

IEEE Transaction on Affective Computing

2018.

Dang, T., Sethu, V., Ambikairajah, E.

Dynamic multi-rater Gaussian Mixture Regression incorporating temporal dependencies of emotion uncertainty using kalman filters

ICASSP 2018

Dang, T., Sethu, V., Epps, J., Ambikairajah, E.

An investigation of Emotion Prediction Uncertainty Using Gaussian Mixture Regression

Interspeech 2017

Dang, T., Stasak, B., Huang, Z., Jayawardena, S., Atcheson, M., Hayat, M., Le, P., Sethu, V., Goecke, R., Epps, J.

Investigating Word affect Features and Fusion of Probabilistic Predictions Incorporating Uncertainty in AVEC 2017

the 7th Annual Workshop on Audio/Visual Emotion Challenge, ACM Multimedia, 2017

Dang, T., Sethu, V., Ambikairajah, E.

Factor Analysis Based Speaker Normalisation for Continuous Emotion Prediction

Interspeech 2016

Huang, Z., Stasak, B., Dang, T., Wataraka Gamage, K., Le, P., Sethu, V., Epps, J.

Staircase Regression in OA RVM, Data Selection and Gender Dependency in AVEC 2016

In Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, ACM Multimedia, 2015

Huang, Z., Dang, T., Cummins, N., Stasak, B., Le, P., Sethu, V., Epps, J.

An investigation of annotation delay compensation and output-associative fusion for multi-modal continuous emotion prediction

the 5th International Workshop on Audio/Visual Emotion Challenge, ACM Multimedia, 2015

Patents

Thangarajan, A., Al-Naimi, K., Dang, T., Ferlini, A., Liu, Y., Montanari, A.

Power Saving

U.S. Patent Application No. 19/056,213, filed 2025.

Dang, T., Al-Naimi, K., Thangarajan, A., Liu, Y., Ferlini, A., Montanari, A.

Selecting Candidate Devices

U.S. Patent Application No. 18/915,223, filed 2024.

Al-Naimi, K., Montanari, A., Ferlini, A., Dang, T., Demirel, B. U.

Cancellation of Ultrasonic Signals

U.S. Patent Application No. 18/676,687, filed 2024.