The framework of the multimodal bi-direction guided attention
Frontiers STGATE: Spatial-temporal graph attention network with
Bidirectional LSTM with self-attention mechanism and multi-channel
Visual representation of the bidirectional LSTM and the self
See, hear, read: Leveraging multimodality with guided attention
Example of resultant images for given text description from
Multimodal Bi-direction Guided Attention Networks for Visual
Applied Sciences, Free Full-Text
Multimodal Bi-direction Guided Attention Networks for Visual
Sensors, Free Full-Text
Two VQA examples: Both the position feature and image feature are
Examples of images generated by proposed method for given textual
Frontiers A Multimodal Affinity Fusion Network for Predicting
Qualitative comparison of depth completion results on KITTI