[1] Lai, Yuzhi and Radke, Mario, et al. (2024). Intuitive Multi-modal Human-Robot Interaction via Posture and Voice. In: Filipe, J., Röning, J. (eds) Robotics, Computer Vision and Intelligent Systems. ROBOVIS 2024. Communications in Computer and Information Science, vol 2077. Springer, Cham. doi.org/10.1007/978-3-031-59057-3_28
[2] Lai, Yuzhi, et al. "NVP-HRI: Zero shot natural voice and posture-based human-robot interaction via large language model." Expert Systems with Applications (2025): 126360.
[3] Lai, Yuzhi, et al. "Natural multimodal fusion-based human–robot interaction: Application with voice and deictic posture via large language model." IEEE Robotics & Automation Magazine (2025).