Abstract
Performance of NLP systems can only be as good as the lexical resources they employ. By modelling the evolved structure of language, there is scope for morpho-semantic enrichment of these resources. A set of linguistically-informed morphological rules is formulated from the CatVar database, implemented in a Java model of WordNet and tested on suffixation and desuffixation. Overgeneration and undergeneration are measured and an approach to improving these by using multilingual resources is proposed.
Original language | English |
---|---|
Title of host publication | Natural Language Processing and Cognitive Science - Proceedings of the 6th International Workshop on Natural Language Processing and Cognitive Science - NLPCS 2009 In Conjunction with ICEIS 2009 |
Pages | 36-45 |
Number of pages | 10 |
Publication status | Published - 1 Dec 2009 |
Event | 6th International Workshop on Natural Language Processing and Cognitive Science - NLPCS 2009 In Conjunction with ICEIS 2009 - Milan, United Kingdom Duration: 1 May 2009 → 1 May 2009 |
Conference
Conference | 6th International Workshop on Natural Language Processing and Cognitive Science - NLPCS 2009 In Conjunction with ICEIS 2009 |
---|---|
Country/Territory | United Kingdom |
City | Milan |
Period | 1/05/09 → 1/05/09 |