Abstract
People perceive the world through multi-modal (visual, auditory, tactile, verbal, etc.) information interaction. User behaviors and decisions in e-commerce are also the result of multi-modal (visual image, audio introduction, text content) information interaction. Taking multi-modal graphic and textual understanding as an example, different from the general field, multi-modal understanding in the e-commerce field requires more fine-grained semantic understanding, such as pattern, fabric, style, etc. This sharing will focus on the research and development of multi-modality in the field of e-commerce, and the application of multi-modality in search and promotion system.
About the Speakers
Dr. GAO Dehong received his bachelor's and master's degrees from Northwestern Polytechnical University. He earned a PhD degree in natural language processing from the School of Engineering, Hong Kong Polytechnic University. In 2014, he joined Alibaba and was responsible for algorithms in the field of search/recommendation/advertising. He has accumulated more than 10 years of practical experience in related research fields. In particular, a number of technologies in multi-modal and semantic search have been used by Alibaba and the industry. His main research interests are on search promotion, multi-modal pre-training (including graph, multilingualism, knowledge, etc.), federated learning, etc. He has published more than 20 papers in top international conferences and is a reviewer of top international conferences in a number of areas.