GT-4S: Graph Transformer for Scene Sketch Semantic Segmentation

doi:10.13328/j.cnki.jos.007155

微信服务号

微信订阅号

Home > Archive>Volume , Issue , >1-15. DOI:10.13328/j.cnki.jos.007155

PDF HTML XML Export Cite reminder

GT-4S: Graph Transformer for Scene Sketch Semantic Segmentation
DOI:
                        10.13328/j.cnki.jos.007155
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:TP391
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The scene sketch is made up of multiple foreground and background objects, which can directly and generally express complex semantic information. It has a wide range of practical applications in real life and has gradually become one of the research hotspots in the field of computer vision and human-computer interaction. As the basic task of the semantic understanding of scene sketch, scene sketch semantic segmentation is rarely studied. Most of the existing methods are improved from the semantic segmentation of natural images, which cannot overcome the sparsity and abstraction of sketches. To solve the above problems, this study proposes a graph Transformer model directly from sketch strokes. The model combines the temporal-spatial information of sketch strokes to solve the semantic segmentation task of free-hand scene sketches. First, the vector scene sketch is constructed into a graph with strokes as the nodes of the graph and temporal and spatial correlations between strokes as the edges of the graph. The temporal-spatial global context information of the strokes is then captured by the edge-enhanced Transformer module. Finally, the encoded temporal-spatial features are optimized for multi-classification learning. The experimental results on the SFSD scene sketch dataset show that the proposed method can effectively segment scene sketches using stroke temporal-spatial information and achieve excellent performance.

Reference

Cited by

Get Citation

张拯明,郭燕,马翠霞,邓小明,王宏安. GT-4S: 基于图Transformer的场景草图语义分割.软件学报,,():1-15

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:August 11,2023
Revised:October 21,2023
Adopted:
Online: May 08,2024
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

Article Metrics

History