Abstract
In current organizations, valuable enterprise knowledge is often buried under rapidly expanding huge amount of unstructured information in the form of web pages, blogs, and other forms of human text communications. We present a novel unsupervised machine learning method called CORDER (COmmunity Relation Discovery by named Entity Recognition) to turn these unstructured data into structured information for knowledge management in these organizations. CORDER exploits named entity recognition and co-occurrence data to associate individuals in an organization with their expertise and associates. We discuss the problems associated with evaluating unsupervised learners and report our initial evaluation experiments in an expert evaluation, a quantitative benchmarking, and an application of CORDER in a social networking tool called BuddyFinder.
Original language | English |
---|---|
Pages (from-to) | 405-417 |
Number of pages | 13 |
Journal | Web Intelligence and Agent Systems |
Volume | 5 |
Issue number | 4 |
Publication status | Published - 2007 |
Keywords
- relation discovery
- named entity recognition
- clustering