Computer Science Faculty Publications

Segmenting Technical Drawing Figures in US Patents

Md Reshad Ul Hoque, Old Dominion UniversityFollow
Xin Wei, Old Dominion UniversityFollow
Muntabir Hasan Choudhury, Old Dominion UniversityFollow
Kehinde Ajayi, Old Dominion UniversityFollow
Martin Gryder, Old Dominion UniversityFollow
Jian Wu, Old Dominion UniversityFollow
Diane Oyen

Document Type

Conference Paper

Publication Date

2022

Publication Title

CEUR Workshop Proceedings- SDU 2022 Proceedings of the Workshop on Scientific Document Understanding

Volume

3164

Pages

1-6

Conference Name

SDU 2022: The AAAI-22 Workshop on Scientific Document Understanding, 1 March 2022, Virtual

Abstract

Image segmentation is the core computer vision problem for identifying objects within a scene. Segmentation is a challenging task because the prediction for each pixel label requires contextual information. Most recent research deals with the segmentation of natural images rather than drawings. However, there is very little research on sketched image segmentation. In this study, we introduce heuristic (point-shooting) and deep learning-based methods (U-Net, HR-Net, MedT, DETR) to segment technical drawings in US patent documents. Our proposed methods on the US Patent dataset achieved over 90% accuracy where transformer performs well with 97% segmentation accuracy, which is promising and computationally efficient. Our source codes and datasets are available at https://github.com/GoFigure-LANL/figure-segmentation.

Comments

Use permitted under a Creative Commons License Attribution 4.0 International (CC BY 4.0) License.

Conference landing page available at: http://ceur-ws.org/Vol-3164/

Original Publication Citation

Hoque, M. R. U., Wei, X., Choudhury, M. H., Ajayi, K., Gryder, M., Wu, J., & Oyen, D. (2022). CEUR Workshop Proceedings- SDU 2022 Proceedings of the Workshop on Scientific Document Understanding, 3164 (1-6). http://ceur-ws.org/Vol-3164/paper28.pdf

Repository Citation

ORCID

0000-0003-4055-2582 (Hoque), 0000-0002-9318-8844 (Choudhury), 0000-0002-5124-0739 (Ajayi), 0000-0003-0173-4463 (Wu)

Download

Included in

Computer Sciences Commons, Electrical and Computer Engineering Commons

COinS

Computer Science Faculty Publications

Segmenting Technical Drawing Figures in US Patents

Document Type

Publication Date

Publication Title

Volume

Pages

Conference Name

Abstract

Comments

Original Publication Citation

Repository Citation

ORCID

Included in

Search

Browse

Contribute

Links

Contact Us

Computer Science Faculty Publications

Segmenting Technical Drawing Figures in US Patents

Authors

Document Type

Publication Date

Publication Title

Volume

Pages

Conference Name

Abstract

Comments

Original Publication Citation

Repository Citation

ORCID

Included in

Share

Search

Browse

Contribute

Links

Contact Us