Abstract
We describe a system, UNN-WePS for identifying individuals from web pages us- ing data from Semeval Task 13. Our sys- tem is based on using co-presence of per- son names to form seed clusters. These are then extended with pages that are deemed conceptually similar based on a lexical chaining analysis computed using Roget’s thesaurus. Finally, a single link hierarchical agglomerative clustering algorithm merges the enhanced clusters for individual entity recognition.
Original language | English |
---|---|
Publication status | Published - 23 Jun 2007 |
Event | SemEval 2007: 4th International Workshop on Semantic Evaluations - Prague, Czech Republic Duration: 23 Jun 2007 → … |
Workshop
Workshop | SemEval 2007: 4th International Workshop on Semantic Evaluations |
---|---|
Period | 23/06/07 → … |