• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Wang, Y. (Wang, Y..) [1] | Zhuo, Y. (Zhuo, Y..) [2] | Wu, Y. (Wu, Y..) [3] | Chen, M. (Chen, M..) [4]

Indexed by:

Scopus PKU CSCD

Abstract:

Many fragmentation information is highly dispersed in different data sources, such as text, image, video and Web. They are characterized by structural disorder and content one-sided. Current researches implement the extraction, expression and understanding of multi-modal fragmentation information by constructing visual question answering (VQA) system. The VQA task is required to provide the correct answer to a given problem with a corresponding image. The aim of this paper is to design a complete framework and algorithm for image fragmentation information question answering under the basic background of visual question answering task. The main research includes image feature extraction, question text feature extraction, multi-modal feature fusion and answer reasoning. Deep neural network is constructed to extract features for representing images and problem information. Attention mechanism and variational inference method are combined to fusion two modal features of image and problem and reason answers. Experiment results show that the model can effectively extract and understand multi-modal fragmentation information, and improve the accuracy of VQA. © 2018, Science Press. All right reserved.

Keyword:

Artificial intelligence; Deep learning; Fragmented information; Neural network; Visual question answering (VQA)

Community:

  • [ 1 ] [Wang, Y.]College of Mathematics and Computer Science, Fuzhou University, Fuzhou, 350108, China
  • [ 2 ] [Zhuo, Y.]College of Mathematics and Computer Science, Fuzhou University, Fuzhou, 350108, China
  • [ 3 ] [Wu, Y.]College of Mathematics and Computer Science, Fuzhou University, Fuzhou, 350108, China
  • [ 4 ] [Chen, M.]College of Mathematics and Computer Science, Fuzhou University, Fuzhou, 350108, China

Reprint 's Address:

  • [Wu, Y.]College of Mathematics and Computer Science, Fuzhou UniversityChina

Show more details

Related Keywords:

Related Article:

Source :

Computer Research and Development

ISSN: 1000-1239

Year: 2018

Issue: 12

Volume: 55

Page: 2600-2610

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 5

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:65/10070992
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1