My name is Roberto Amoroso. I am an ELLIS PhD student enrolled in the International Doctorate in ICT program at the AImageLab research group of the University of Modena and Reggio Emilia 🇮🇹, under the supervision of Prof. Rita Cucchiara and Prof. Lorenzo Baraldi. I am engaged in studying and developing novel Deep Learning and Computer Vision techniques.
I am currently a Machine Learning Engineering Intern at NVIDIA in Munich, Germany 🇩🇪, working on Multimodal Video Understanding for Autonomous Vehicles.
I was a PhD Intern at LMU - Ludwig-Maximilians-Universität of Munich, in Germany 🇩🇪, focusing on Multimodal LLM for Video Question Answering and Open-vocabulary Image Segmentation, under the co-supervision of Prof. Volker Tresp.
Prior to joining AImageLab, I was Research Scholar at the Networking Research Group in Saint Louis, USA 🇺🇸, working on Super-resolution techniques applied to Internet traffic matrices.
My primary areas of research are Open-vocabulary Segmentation and Multimodal Video Understanding. In addition, I have also conducted research on the pre-training and optimization of Transformer-based architecture for image classification, self-supervised learning, deepfake detection of synthetic images, and the development of image watermarking systems for artwork protection.
Feel free to reach me out if you have any questions or curiosities! :)
ELLIS PhD in AI and Computer Vision, 2024
University of Modena and Reggio Emilia
MS in Artificial Intelligence, 2020
University of Modena and Reggio Emilia
BS in Computer Engineering, 2018
University of Modena and Reggio Emilia
HumanE-AI-NET
project, funded by the EU Framework Programme for Research and Innovation Horizon 2020
.