AI & ML interests
We focus on Natural Language Processing and Multimodal Learning, exploring generative AI across different modalities.
Recent Activity
View all activity
Papers
Beyond APIs: Probing the Limits of MLLMs in Physical Tool Use
Optical Reasoning: Rethinking Images as an Expressive Reasoning Medium Beyond Text