Author
Contributions by role
Author 1
Bin Wang
School of Computer Science and Engineering, Northeastern University, Shenyang 110169, China
Summary
Bin Wang received the Ph.D. degree in computer science and technology from Northeastern University, Shenyang, China, in 2008. He is currently a Professor with the School of Computer Science and Engineering, Northeastern University. His research interests include big data management and knowledge engineering, database theory and technology, cloud computing, and data privacy preserving. More details about his research can be found at http://faculty.neu.edu.cn/wangbin.
Edited Journals
IECE Contributions

Free Access | Review Article | 12 June 2024 | Cited: 2
Bridging Modalities: A Survey of Cross-Modal Image-Text Retrieval
Chinese Journal of Information Fusion | Volume 1, Issue 1: 79-92, 2024 | DOI:10.62762/CJIF.2024.361895
Abstract
The rapid advancement of Internet technology, driven by social media and e-commerce platforms, has facilitated the generation and sharing of multimodal data, leading to increased interest in efficient cross-modal retrieval systems. Cross-modal image-text retrieval, encompassing tasks such as image query text (IqT) retrieval and text query image (TqI) retrieval, plays a crucial role in semantic searches across modalities. This paper presents a comprehensive survey of cross-modal image-text retrieval, addressing the limitations of previous studies that focused on single perspectives such as subspace learning or deep learning models. We categorize existing models into single-tower, dual-tower,... More >

Graphical Abstract
Bridging Modalities: A Survey of Cross-Modal Image-Text Retrieval