selected publications
-
academic article
- Vision-Text Cross-Modal Fusion for Accurate Video Captioning. IEEE Access. 11:115477-115492. 2023
- DEEP-AD: A Multimodal Temporal Video Segmentation Framework for Online Video Advertising. IEEE Access. 8:99582-99597. 2020
- DEEP-HEAR: A Multimodal Subtitle Positioning System Dedicated to Deaf and Hearing-Impaired People. IEEE Access. 7:88150-88162. 2019
- DEEP-SEE FACE: A Mobile Face Recognition System Dedicated to Visually Impaired People. IEEE Access. 6:51975-51985. 2018
- A computer vision-based perception system for visually impaired. Multimedia Tools and Applications. 76:11771-11807. 2016
- 3D Object Metamorphosis with Pseudo Metameshes. Advances in Electrical and Computer Engineering. 15:115-122. 2015
- Automatic Assistant for Better Mobility and Improved Cognition of Partially Sighted Persons. Advances in Electrical and Computer Engineering. 15:45-52. 2015
- Video Segmentation and Structuring for Indexing Applications. . 2012
-
chapter
- Object Tracking Using Deep Convolutional Neural Networks and Visual Appearance Models. Lecture notes in computer science. 114-125. 2017
- Automatic Segmentation of TV News into Stories Using Visual and Temporal Information. Lecture notes in computer science. 648-660. 2016
- Using Computer Vision to See. Lecture notes in computer science. 375-390. 2016
- Video Segmentation and Structuring for Indexing Applications. IGI Global eBooks. 205-225. 2013
- High Level Video Temporal Segmentation. Lecture notes in computer science. 224-235. 2011
- Scene Change Detection with Temporally Constrained Clustering. ASME Press eBooks. 71-76. 2011
-
conference paper
- Conditional Cross Correlation Network for Video Question Answering. . 25-32. 2023
- Automatic Alignment of Human Generated Transcripts to Speech Signals. 2022 E-Health and Bioengineering Conference (EHB). 1-4. 2022
- Audio-Video Fusion with Double Attention for Multimodal Emotion Recognition. . 1-5. 2022
- Speech Emotion Recognition using GhostVLAD and Sentiment Metric Learning. . 2021
- A Deep Learning-Based Approach for Camera Motion Classification. . 1-6. 2021
- Transferring CT image biomarkers from fibrosing idiopathic interstitial pneumonia to COVID-19 analysis. Medical Imaging 2018: Computer-Aided Diagnosis. 2021
- Dynamic Subtitles: A Multimodal Video Accessibility Enhancement Dedicated to Deaf and Hearing Impaired Users. . 2558-2566. 2019
- Face Recognition in Video Streams for Mobile Assistive Devices Dedicated to Visually Impaired. . 137-142. 2018
- Single object tracking using offline trained deep regression networks. . 1-6. 2017
- Seeing without sight : an automatic cognition system dedicated to blind and visually impaired people. HAL (Le Centre pour la Communication Scientifique Directe). 2017
- The Visual Object Tracking VOT2017 Challenge Results. . 1949-1972. 2017
- Automatic extraction of story units from TV news. 2023 IEEE International Conference on Consumer Electronics (ICCE). 414-415. 2017
- TV News Retrieval Based on Story Segmentation and Concept Association. . 327-334. 2016
- "An Outdoor Cognition System Integrated on a Regular Smartphone Device". HAL (Le Centre pour la Communication Scientifique Directe). 2015
- An Obstacle Categorization System for Visually Impaired People. . 147-154. 2015
- Efficient graph spanning structures for large database image retrieval. . 594-598. 2015
- Real time static/dynamic obstacle detection for visually impaired persons. 2023 IEEE International Conference on Consumer Electronics (ICCE). 394-395. 2014
- Video Structuring: From Pixels to Visual Entities. HAL (Le Centre pour la Communication Scientifique Directe). 2012
- A complete framework for temporal video segmentation. . 156-160. 2011
- Automatic Multilevel Temporal Video Structuring. . 387-394. 2011
- A scale-space filtering-based shot detection algorithm. . 000919-000923. 2010
- Determining Optimal Orbital Path of a Nanosatellite for Efficient Exploitation of the Solar Energy Captured. . 128-133. 2009