About this role
About the Job We are in search of a highly motivated and accomplished Research Assistant to join our team and significantly contribute to our ongoing research project "Understanding and Generating Interleaved Image-Text Persuasion". The successful candidate will collaborate closely with our research team, working on the development of multimodal AI. This role offers an exciting opportunity to engage in cutting-edge research in multimodal learning and have a substantial influence on the field. What You’ll Do • Conduct literature reviews on multimodal persuasion, visual communication, and the evaluation of large vision-language models. • Support the design and development of datasets and evaluation frameworks for understanding interactions between images and text. • Assist in evaluating and enhancing multimodal models for analysing and generating interleaved image-text content • Summarize experimental results and contribute to the preparation of research reports, presentations, and academic publications. Who We’re Looking For • Bachelor degree with background in computer science, data science, artificial intelligence, or a related field. • Strong interest in multimodal AI, vision-language models. • Basic experience with Python and machine learning frameworks such as PyTorch is preferred. • Familiarity with natural language processing, computer vision, or large language models is an advantage. • Good analytical, communication, and organizational skills. • Ability to work independently while collaborating effectively with a research team.
Also in Data Science