Call for Papers
About the Journal
Editorial Board
Publication Ethics
Instructions for Authors
Announcements
Current Issue
Back Issues
Search for Articles
Categories
Back Issues
 

JCSE, vol. 8, no. 3, pp.137-148, September, 2014

DOI: http://dx.doi.org/10.5626/JCSE.2014.8.3.137

Classifying Articles in Chinese Wikipedia with Fine-Grained Named Entity Type

Jie Zhou, Bicheng Li, and Yongwang Tang
Zhengzhou Information Science and Technology Institute, Zhengzhou, China

Abstract: Named entity classification of Wikipedia articles is a fundamental research area that can be used to automatically build large-scale corpora of named entity recognition or to support other entity processing, such as entity linking, as auxiliary tasks. This paper describes a method of classifying named entities in Chinese Wikipedia with fine-grained types. We considered multi-faceted information in Chinese Wikipedia to construct four feature sets, designed different feature selection methods for each feature, and fused different features with a vector space using different strategies. Experimental results show that the explored feature sets and their combination can effectively improve the performance of named entity classification.

Keyword: Named entity classification; Chinese Wikipedia; Fine-grained; Feature selection; NER corpora

Full Paper:   246 Downloads, 2104 View

 
 
ⓒ Copyright 2010 KIISE – All Rights Reserved.    
Korean Institute of Information Scientists and Engineers (KIISE)   #401 Meorijae Bldg., 984-1 Bangbae 3-dong, Seo-cho-gu, Seoul 137-849, Korea
Phone: +82-2-588-9240    Fax: +82-2-521-1352    Homepage: http://jcse.kiise.org    Email: office@kiise.org