published in Proceedings of the 2021 Workshop on Multi-Modal Pre-Training for Multimedia Understanding (MMPT '21), August 21, 2021, Taipei, Taiwan Proceedings