CORC  > 北京大学  > 信息科学技术学院
Graphic Composite Segmentation for PDF Documents with Complex Layouts
Xu, Canhui ; Tang, Zhi ; Tao, Xin ; Shi, Cao
2013
关键词PDF converter graphic segmentation complex layouts image based document analysis
英文摘要Converting the PDF books to re-flowable format has recently attracted various interests in the area of e-book reading. Robust graphic segmentation is highly desired for increasing the practicability of PDF converters. To cope with various layouts, a multi-layer concept is introduced to segment graphic composites including photographic images, drawings with text insets or surrounded with text elements. Both image based analysis and inherent digital born document advantages are exploited in this multi-layer based layout analysis method. By combining low-level page elements clustering applied on PDF documents and connected component analysis on synthetically generated PNG image document, graphic composites can be segmented for PDF documents with complex layouts. The experimental results on graphic composite segmentation of PDF document pages have shown satisfactory performance.; http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000326744400013&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=8e1609b174ce4e31116a60747a720701 ; Optics; Imaging Science & Photographic Technology; EI; CPCI-S(ISTP); 3
语种英语
DOI标识10.1117/12.2003705
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/321170]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Xu, Canhui,Tang, Zhi,Tao, Xin,et al. Graphic Composite Segmentation for PDF Documents with Complex Layouts. 2013-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace