Xinyu Zhang, Lingling Zhang, Xin Hu, Jun Liu, Shaowei Wang, Qianying Wang
As a knowledge carrier, the diagram is widely distributed in many aspects of human life, such as textbooks, architectural drawings, and documents. Different from natural images, representations of visual elements in the diagram are sparser, and similar visual representations can reflect dissimilar semantics. Thus, current methods fail to capture the visual elements with precise semantics. To address this issue, regarding the aligned visual and textual elements as pairs is the way to assign the precise semantics of textual elements to visual elements...
March 13, 2024: IEEE Transactions on Image Processing: a Publication of the IEEE Signal Processing Society