Publications

* denotes equal contribution

An up-to-date list is available on Google Scholar.

2025

  1. ICLR
    Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion
    M. Mistretta*A. Baldrati*, L. Agnolucci*, M. Bertini, and A. Bagdanov
    In The Thirteenth International Conference on Learning Representations, 2025

2024

  1. ECCV
    Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation
    M. Mistretta*A. Baldrati*, M. Bertini, and A. Bagdanov
    In European Conference on Computer Vision, 2024
  2. arXiv
    iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval
    L. Agnolucci*A. Baldrati*, M. Bertini, and A. Del Bimbo
    2024
  3. arXiv
    Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
    A. Baldrati*, D. Morelli*, M. Cornia, M. Bertini, and R. Cucchiara
    2024

2023

  1. ACM TOMM
    Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features
    A. Baldrati, M. Bertini, T. Uricchio, and A. Del Bimbo
    ACM Transactions on Multimedia Computing, Communications and Applications, 2023
  2. ACM MM
    LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On
    D. Morelli*A. Baldrati*, G. Cartella, M. Cornia, M. Bertini, and R. Cucchiara
    In Proceedings of the ACM International Conference on Multimedia, 2023
  3. ICCV
    Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing
    A. Baldrati*, D. Morelli*, G. Cartella, M. Cornia, M. Bertini, and R. Cucchiara
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2023
  4. ICCV
    Zero-Shot Composed Image Retrieval with Textual Inversion
    A. Baldrati*, L. Agnolucci*, M. Bertini, and A. Del Bimbo
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Oct 2023
  5. ICCV Workshop
    Mapping Memes to Words for Multimodal Hateful Meme Classification
    G. Burbi*A. Baldrati*, L. Agnolucci, M. Bertini, and A. Del Bimbo
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, Oct 2023
  6. ICCV Workshop
    ECO: Ensembling Context Optimization for Vision-Language Models
    L. Agnolucci*A. Baldrati*, F. Todino, F. Becattini, M. Bertini, and A. Del Bimbo
    In Proceedings of the IEEE/CVF International Conference on Computer Vision, Oct 2023
  7. ICIAP
    OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data
    G. Cartella, A. Baldrati, D. Morelli, M. Cornia, M. Bertini, and R. Cucchiara
    In International Conference on Image Analysis and Processing, Oct 2023

2022

  1. CVPR Workshop
    Conditioned and composed image retrieval combining and partially fine-tuning clip-based features
    A. Baldrati, M. Bertini, T. Uricchio, and A. Del Bimbo
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Oct 2022
  2. CVPR Demo
    Effective conditioned and composed image retrieval combining clip-based features
    A. Baldrati, M. Bertini, T. Uricchio, and A. Del Bimbo
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Oct 2022

2021

  1. ACM MM Asia
    Conditioned image retrieval for fashion using contrastive learning and CLIP-based features
    A. Baldrati, M. Bertini, T. Uricchio, and A. Del Bimbo
    In ACM Multimedia Asia, Oct 2021