selected publications academic article Vision-Text Cross-Modal Fusion for Accurate Video Captioning. IEEE Access. 11:115477-115492. 2023 conference paper Conditional Cross Correlation Network for Video Question Answering. . 25-32. 2023 A Deep Learning-Based Approach for Camera Motion Classification. . 1-6. 2021