Haruto Yoshida
Haruto Yoshida

1st-year master’s student

About Me

Haruto Yoshida is a 1st-year master’s student at Tohoku NLP Group. His research interests include artificial intelligence, natural language processing and computer vision. He aims to develop a model that integrates visual and linguistic information, enabling them to be handled in the same way.

Interests
  • Artificial Intelligence
  • Natural Language Processing
  • Computer Vision
Education
  • Bachelor of Engineering

    Tohoku university

My Research

I am conducting research on Vision and Language as part of the Tohoku NLP Group. My main focus is on the automatic generation of diagrams and the interpretation of diagrams by multimodal large language models (MLLMs). Additionally, I am also working on research related to the evaluation of generated videos.

If you’re interested, feel free to reach out!

Featured Publications
Recent Publications
(2024). How Well Do Vision Models Encode Diagram Attributes?. In ACL2024 SRW.
(2024). 自然画像で学習された画像埋め込みにダイアグラムを特徴づける情報は含まれているか?. In NLP2024.
(2023). テキストに基づくダイアグラム生成タスクの提案. In YANS2023.