Research
My interests revolve around the convergence of natural language processing and computer vision, with a focus on gaining insights from human cognition. I am enthusiastic about exploring language grounding within multimodal contexts and investigating the linguistic and cognitive characteristics of models.
Representative papers are highlighted.
(*: Equal Contribution)
|