CORC  > 清华大学
基于凸包像素比特征的粘连汉字切分
魏湘辉 ; 马少平 ; WEI Xiang-hui ; MA Shao-ping
2010-06-09 ; 2010-06-09
关键词人工智能 模式识别 粘连汉字 汉字切分 背景细化 凸包 artificial intelligence pattern recognition touching chinese character segmentation of chinese character background-thinning convex hull TP391.41
其他题名Segmentation of Touching Chinese Character Based on Convex Hull Ratio Feature
中文摘要汉字切分正确与否直接影响了汉字识别系统的识别率 ,粘连汉字则是切分中的难点。本文将基于背景细化的切分方法应用于《四库全书》的两字符粘连汉字数据集 ,并针对其中切分路径选择问题 ,提出了一种新特征 -凸包像素比 ,反映了在不同切分路径下汉字结构变化的特性。实验结果表明该特征对多种分类器均能有效地提高切分路径选择的正确率。其中在使用基于高斯混合模型分类器时取得了 88 6 %正确率。; Accuracy of segmenting Chinese characters, especially touching characters, is essential for performance of a Chinese characters recognition system. The paper applied a background-thinning algorithm to segment two-touching Chinesecharacters that come from the dataset of four vaults. A new feature called convex hull ratio was proposed for selection of the best segmentation path, as this feature exploits the property on the balance of Chinese characters' structure. The experimental results show that segmentation accuracy improved consistently using the new feature when three different classifiers were experimented. And gaussian mixture model achieves the accuracy of 88.6%.; 自然科学基金资助项目 (6 0 2 2 30 0 4 ); 国家 86 3高科技资助项目 (2 0 0 1AA114 0 82 )
语种中文 ; 中文
内容类型期刊论文
源URL[http://hdl.handle.net/123456789/55532]  
专题清华大学
推荐引用方式
GB/T 7714
魏湘辉,马少平,WEI Xiang-hui,等. 基于凸包像素比特征的粘连汉字切分[J],2010, 2010.
APA 魏湘辉,马少平,WEI Xiang-hui,&MA Shao-ping.(2010).基于凸包像素比特征的粘连汉字切分..
MLA 魏湘辉,et al."基于凸包像素比特征的粘连汉字切分".(2010).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace