Roberto Amoroso

Roberto Amoroso

ELLIS PhD Student, AI & Computer Vision
International Doctorate in ICT

NVIDIA

LMU Munich

AImageLab UNIMORE

About me

My name is Roberto Amoroso. I am an ELLIS PhD student enrolled in the International Doctorate in ICT program at the AImageLab research group of the University of Modena and Reggio Emilia 🇮🇹, under the supervision of Prof. Rita Cucchiara and Prof. Lorenzo Baraldi. I am engaged in studying and developing novel Deep Learning and Computer Vision techniques.

I am currently a Machine Learning Engineering Intern at NVIDIA in Munich, Germany 🇩🇪, working on Multimodal Video Understanding for Autonomous Vehicles.

I was a PhD Intern at LMU - Ludwig-Maximilians-Universität of Munich, in Germany 🇩🇪, focusing on Multimodal LLM for Video Question Answering and Open-vocabulary Image Segmentation, under the co-supervision of Prof. Volker Tresp.

Prior to joining AImageLab, I was Research Scholar at the Networking Research Group in Saint Louis, USA 🇺🇸, working on Super-resolution techniques applied to Internet traffic matrices.

My primary areas of research are Open-vocabulary Segmentation and Multimodal Video Understanding. In addition, I have also conducted research on the pre-training and optimization of Transformer-based architecture for image classification, self-supervised learning, deepfake detection of synthetic images, and the development of image watermarking systems for artwork protection.

Feel free to reach me out if you have any questions or curiosities! :)

Interests
  • Computer Vision
  • Deep Learning
  • Machine Learning
  • Open-vocabulary Image Segmentation
  • Multimodal Video Understanding
Education
  • ELLIS PhD in AI and Computer Vision, 2024

    University of Modena and Reggio Emilia

  • MS in Artificial Intelligence, 2020

    University of Modena and Reggio Emilia

  • BS in Computer Engineering, 2018

    University of Modena and Reggio Emilia

Recent News

All news »

  • [Oct. 2024] Our paper “Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries” has been accepted @ WACV 2025

  • [Oct. 2024] Our paper “Video Search: A Large-scale Video-Text Retrieval System for AV” has been accepted @ NTECH 2024 NVIDIA conference

  • [Feb. 2024] Started a new position as Machine Learning Engineering Intern @ NVIDIA in Munich, Germany 🇩🇪, working on Multimodal Video Understanding for Autonomous Vehicles

  • [Sep. 2023] I attended the ELLIS Doctoral Symposium 2023 (EDS2023) in Helsinki, Finland 🇫🇮

  • [Jun. 2023] Started my PhD Internship @ LMU in Munich, Germany 🇩🇪, working on Multimodal Video Perception under the co-supervision of Prof. Volker Tresp

  • [Mar. 2022] I gave a short course about “AI & HPC for Industries” on AI, computer vision, and HPC organized by CINECA and Leonardo AI Labs as part of the EuroCC Italy project

  • [Feb. 2021] I joined the AImageLab group at UNIMORE in Italy 🇮🇹 as a Research Fellow

All news »

Experience

 
 
 
 
 
NVIDIA
Machine Learning Engineering Intern
Jan 2024 – Present Munich, Germany 🇩🇪
Research activity focused on the engineering, development, and deployment of Multimodal Video Understanding techniques for Autonomous Vehicles.
 
 
 
 
 
LMU @ Ludwig-Maximilians-Universität of Munich
PhD Intern
Jun 2023 – Nov 2023 Munich, Germany 🇩🇪
Research activity focused on the development of novel Multimodal LLM for Video Question Answering and Open-vocabulary Image Segmentation techniques, under the co-supervision of Prof. Volker Tresp.
 
 
 
 
 
AImageLab @ University of Modena and Reggio Emilia
ELLIS PhD Student | International Doctorate in ICT
Nov 2021 – Present Modena, Italy 🇮🇹
  • The European Laboratory for Learning and Intelligent Systems (ELLIS) supports cutting-edge machine learning research in Europe. ELLIS PhD students (<5% 2021 acceptance rate) are selected on the basis of academic achievement. My research activity is focused on multimodal machine learning, image segmentation, image classification, self-supervised learning, video retrieval, and video question answering.
  • 1st in the ranking of student candidates for the International Doctorate in ICT.
 
 
 
 
 
AImageLab @ University of Modena and Reggio Emilia
Research Fellow
Feb 2021 – Nov 2021 Modena, Italy 🇮🇹
Research activity under the supervision of Prof. Rita Cucchiara and Prof. Lorenzo Baraldi, aimed at the study, analysis, and development of novel Computer Vision and Deep Learning techniques.
 
 
 
 
 
CINI - Consorzio Interuniversitario Nazionale per l’Informatica
Research Engineer
Nov 2020 – Jan 2021 Modena, Italy 🇮🇹
Development of a web platform for the management of data concerning the activities of European research centers, as part of the HumanE-AI-NET project, funded by the EU Framework Programme for Research and Innovation Horizon 2020.
 
 
 
 
 
Saint Louis University
Research Scholar
Mar 2020 – Sep 2020 St. Louis, USA 🇺🇸
  • Conducted research to develop my MS thesis, winner of the Best Poster Award at CoNEXT 2020.
 
 
 
 
 
AGH Akademia Górniczo-Hutnicza
Erasmus+
Sep 2019 – Feb 2020 Krakow, Poland 🇵🇱
  • Completed the following courses: Advanced Python Programming | Computer Vision | Cybersecurity and Cryptography | Programming in Javascript | Mobile App Development

Honors and Awards

  • [Sep. 2024] Outstanding Reviewer Award @ ECCV 2024

  • [Jul. 2022] ICVSS 2022 Reading Group Competition Award sponsored by Amazon Web Services (AWS) @ ICVSS 2022 for participation in the reading group led by Prof. Dr. Stefano Soatto (leading science for AI Applications at AWS and Professor at the University of California Los Angeles)

Publications

Please see my Google Scholar for the complete publication list.
Quickly discover relevant content by filtering publications.

Professional Activities

Reviewer

  • International Conference on Computer Vision and Pattern Recognition (CVPR)
  • International Conference on Computer Vision (ICCV)
  • European Conference on Computer Vision (ECCV)
  • Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  • Association for the Advancement of Artificial Intelligence (AAAI)
  • IEEE Transactions on Multimedia (TMM)
  • Pattern Recognition Letters (PRL)
  • ACM Multimedia (ACMMM)
  • International Conference on Pattern Recognition (ICPR)

Contact