Haruto Yoshida is a 1st-year master’s student at Tohoku NLP Group. His research interests include artificial intelligence, natural language processing and computer vision. He aims to develop a model that integrates visual and linguistic information, enabling them to be handled in the same way.
Bachelor of Engineering
Tohoku university
I am conducting research on Vision and Language as part of the Tohoku NLP Group. My main focus is on the automatic generation of diagrams and the interpretation of diagrams by multimodal large language models (MLLMs). Additionally, I am also working on research related to the evaluation of generated videos.
If you’re interested, feel free to reach out!