Call for Papers
About the Journal
Editorial Board
Publication Ethics
Instructions for Authors
Announcements
Current Issue
Back Issues
Search for Articles
Categories
Search for Articles
 

JCSE, vol. 8, no. 3, pp.137-148, 2014

DOI: http://dx.doi.org/10.5626/JCSE.2014.8.3.137

Classifying Articles in Chinese Wikipedia with Fine-Grained Named Entity Type

Jie Zhou, Bicheng Li, and Yongwang Tang
Zhengzhou Information Science and Technology Institute, Zhengzhou, China

Abstract: Named entity classification of Wikipedia articles is a fundamental research area that can be used to automatically build large-scale corpora of named entity recognition or to support other entity processing, such as entity linking, as auxiliary tasks. This paper describes a method of classifying named entities in Chinese Wikipedia with fine-grained types. We considered multi-faceted information in Chinese Wikipedia to construct four feature sets, designed different feature selection methods for each feature, and fused different features with a vector space using different strategies. Experimental results show that the explored feature sets and their combination can effectively improve the performance of named entity classification.

Keyword: Named entity classification; Chinese Wikipedia; Fine-grained; Feature selection; NER corpora

Full Paper:   247 Downloads, 2196 View

 
 
ⓒ Copyright 2010 KIISE – All Rights Reserved.    
Korean Institute of Information Scientists and Engineers (KIISE)   #401 Meorijae Bldg., 984-1 Bangbae 3-dong, Seo-cho-gu, Seoul 137-849, Korea
Phone: +82-2-588-9240    Fax: +82-2-521-1352    Homepage: http://jcse.kiise.org    Email: office@kiise.org