Publications

You can also find my articles on my Google Scholar profile.

Papers


Speculative Decoding for Multimodal Models: A Survey

Published in Preprint; target venue: Transactions on Machine Learning Research (TMLR), 2026

A survey of speculative decoding techniques for accelerating inference in multimodal models.

Recommended citation: Zhang, Y., Wang, Y., Hsieh, Y., Wang, X., Zhang, P., Yang, Z., Ma, J., Zhao, Z., Zheng, B., Chan, H.T., Li, J., Liu, X., Gao, K., Liu, R., Zhang, J., Li, J., Wan, Z., Zhang, Z., Xiong, J., Zhu, S., Cao, H., & Shen, H. (2026). "Speculative Decoding for Multimodal Models: A Survey." Preprint; target venue: Transactions on Machine Learning Research (TMLR). [Under Review.]

A Cloud–Edge System for Multimodal Clinical Screening in Resource-Constrained Rural Settings

Published in Submitted to Machine Learning for Healthcare (MLHC) 2026, 2026

A cloud–edge ML system for multimodal clinical screening in rural settings with limited compute and connectivity.

Recommended citation: Chan, H.T., Wu, C., Liu, X., Zhao, Z., Zheng, B., Mao, Z., Nakayama, L.F., Morley, M.G., Shen, L., & Chen, J. (2026). "A Cloud–Edge System for Multimodal Clinical Screening in Resource-Constrained Rural Settings." Submitted to Machine Learning for Healthcare (MLHC) 2026. [First author; Under Review.]