Roberto Amoroso

Roberto Amoroso

ELLIS PhD | AI & Computer Vision
International Doctorate in ICT

NVIDIA

LMU Munich

AImageLab UNIMORE

About me

Ciao! I am Roberto Amoroso, a Research Engineer at NVIDIA in Munich, Germany 🇩🇪, working on Multimodal Video Understanding for Autonomous Vehicles. I enjoy designing and implementing novel Deep Learning and Computer Vision techniques.

I completed my PhD through the ELLIS program and the International Doctorate in ICT at the AImageLab research group of the University of Modena and Reggio Emilia (UNIMORE) 🇮🇹, under the supervision of Prof. Rita Cucchiara and Prof. Lorenzo Baraldi.

During my PhD, I also completed a PhD internship at LMU - Ludwig-Maximilians-Universität of Munich, in Germany 🇩🇪, focusing on Multimodal LLM for Video Question Answering and Open-vocabulary Segmentation, under the co-supervision of Prof. Volker Tresp.

I was also a Research Scholar at the Networking Research Group in Saint Louis, USA 🇺🇸, working on Super-resolution techniques applied to Internet traffic matrices.

My primary areas of research are Multimodal Video Understanding and Open-vocabulary Segmentation. In addition, I have also conducted research on the pre-training and optimization of Transformer-based architecture for image classification, self-supervised learning, deepfake detection of synthetic images, and the development of image watermarking systems.

Feel free to reach me out if you have any questions or curiosities! :)

Interests
  • Computer Vision
  • Deep Learning
  • Machine Learning
  • Multimodal Video Understanding
  • Open-vocabulary Segmentation
Education
  • ELLIS PhD in AI and Computer Vision, 2024

    UNIMORE, Italy 🇮🇹 | LMU, Germany 🇩🇪 | NVIDIA, Germany 🇩🇪

  • MS in Artificial Intelligence, 2020

    UNIMORE, Italy 🇮🇹 | AGH, Poland 🇵🇱 | Saint Louis University, USA 🇺🇸

  • BS in Computer Engineering, 2018

    UNIMORE, Italy 🇮🇹

Recent News

All news »

  • [Oct. 2024] Our paper “Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries” has been accepted @ WACV 2025

  • [Oct. 2024] Our paper “Video Search: A Large-scale Video-Text Retrieval System for AV” has been accepted @ NTECH 2024 NVIDIA conference

  • [Feb. 2024] Started a new position as Machine Learning Engineering Intern @ NVIDIA in Munich, Germany 🇩🇪, working on Multimodal Video Understanding for Autonomous Vehicles

  • [Sep. 2023] I attended the ELLIS Doctoral Symposium 2023 (EDS2023) in Helsinki, Finland 🇫🇮

  • [Jun. 2023] Started my PhD Internship @ LMU in Munich, Germany 🇩🇪, working on Multimodal Video Perception under the co-supervision of Prof. Volker Tresp

  • [Mar. 2022] I gave a short course about “AI & HPC for Industries” on AI, computer vision, and HPC organized by CINECA and Leonardo AI Labs as part of the EuroCC Italy project

  • [Feb. 2021] I joined the AImageLab group at UNIMORE in Italy 🇮🇹 as a Research Fellow

All news »

Experience

 
 
 
 
 
NVIDIA
Machine Learning Engineering Intern
Jan 2024 – Present Munich, Germany 🇩🇪
Research activity focused on the engineering, development, and deployment of Multimodal Video Understanding techniques for Autonomous Vehicles.
 
 
 
 
 
LMU @ Ludwig-Maximilians-Universität of Munich
PhD Intern
Jun 2023 – Nov 2023 Munich, Germany 🇩🇪
Research activity focused on the development of novel Multimodal LLM for Video Question Answering and Open-vocabulary Image Segmentation techniques, under the co-supervision of Prof. Volker Tresp.
 
 
 
 
 
AImageLab @ University of Modena and Reggio Emilia
ELLIS PhD Student | International Doctorate in ICT
Nov 2021 – Oct 2024 Modena, Italy 🇮🇹
  • The European Laboratory for Learning and Intelligent Systems (ELLIS) supports cutting-edge machine learning research in Europe. ELLIS PhD students (<5% 2021 acceptance rate) are selected on the basis of academic achievement. My research activity is focused on multimodal machine learning, image segmentation, image classification, self-supervised learning, video retrieval, and video question answering.
  • 1st in the ranking of student candidates for the International Doctorate in ICT.
 
 
 
 
 
AImageLab @ University of Modena and Reggio Emilia
Research Fellow
Feb 2021 – Nov 2021 Modena, Italy 🇮🇹
Research activity under the supervision of Prof. Rita Cucchiara and Prof. Lorenzo Baraldi, aimed at the study, analysis, and development of novel Computer Vision and Deep Learning techniques.
 
 
 
 
 
CINI - Consorzio Interuniversitario Nazionale per l’Informatica
Research Engineer
Nov 2020 – Jan 2021 Modena, Italy 🇮🇹
Development of a web platform for the management of data concerning the activities of European research centers, as part of the HumanE-AI-NET project, funded by the EU Framework Programme for Research and Innovation Horizon 2020.
 
 
 
 
 
Saint Louis University
Research Scholar
Mar 2020 – Sep 2020 St. Louis, USA 🇺🇸
  • Conducted research to develop my MS thesis, winner of the Best Poster Award at CoNEXT 2020.
 
 
 
 
 
AGH Akademia Górniczo-Hutnicza
Erasmus+
Sep 2019 – Feb 2020 Krakow, Poland 🇵🇱
  • Completed the following courses: Advanced Python Programming | Computer Vision | Cybersecurity and Cryptography | Programming in Javascript | Mobile App Development

Honors and Awards

  • [Sep. 2024] Outstanding Reviewer Award @ ECCV 2024

  • [Jul. 2022] ICVSS 2022 Reading Group Competition Award sponsored by Amazon Web Services (AWS) @ ICVSS 2022 for participation in the reading group led by Prof. Dr. Stefano Soatto (leading science for AI Applications at AWS and Professor at the University of California Los Angeles)

Publications

Please see my Google Scholar for the complete publication list.
Quickly discover relevant content by filtering publications.

Professional Activities

Reviewer

  • International Conference on Computer Vision and Pattern Recognition (CVPR)
  • International Conference on Computer Vision (ICCV)
  • European Conference on Computer Vision (ECCV)
  • Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  • Association for the Advancement of Artificial Intelligence (AAAI)
  • IEEE Transactions on Multimedia (TMM)
  • Pattern Recognition Letters (PRL)
  • ACM Multimedia (ACMMM)
  • International Conference on Pattern Recognition (ICPR)

Contact