Multimodal transformers meet collaborative filtering

Leveraging hierarchical cross-attention transformers for recommender system (2021)