Haruto Yoshida
Haruto Yoshida

1st-year master’s student

About Me

Haruto Yoshida is a 1st-year master’s student at Tohoku NLP Group. His research interests include artificial intelligence, natural language processing and computer vision. He aims to develop a model that integrates visual and linguistic information, enabling them to be handled in the same way.

Interests
  • Artificial Intelligence
  • Natural Language Processing
  • Computer Vision
Education
  • Bachelor of Engineering

    Tohoku university

My Research

I am conducting research on Vision and Language as part of the Tohoku NLP Group. My main focus is on the automatic generation of diagrams and the interpretation of diagrams by multimodal large language models (MLLMs). Additionally, I am also working on research related to the evaluation of generated videos.

If you’re interested, feel free to reach out!

Featured Publications
Recent Publications
(2025). ASCII Challenge ---LLMは画家になれるか---. In NLP2025.
(2025). Sketch2Diagram: 視覚的指示を入力とするダイアグラム生成. In NLP2025.
(2025). ダイアグラム理解に向けた大規模視覚言語モデルの内部表現の分析. In NLP2025.
(2024). How Well Do Vision Models Encode Diagram Attributes?. In ACL2024 SRW.