A Hybrid Method for Extracting Turkish Part-Whole Relation Pairs from Corpus

Küçük Resim Yok

Tarih

2016

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

IEEE

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Extraction of various semantic relation pairs from different sources (dictionary definitions, corpus etc.) with high accuracy is one of the most popular topics in natural language processing (NLP). In this study, a hybrid method is proposed to extract Turkish part-whole pairs from corpus. Corpus statistics, WordNet similarities and Word2Vec word vector similarities are used together in this study. Firstly, initial part-whole seeds are prepared and by using these seeds part-whole patterns are extracted from corpus. For each pattern, a reliability score is calculated and reliable patterns are selected to produce new pairs from corpus. Various reliability scores are used for new pairs. To measure success of method, 19 target whole words are selected and average 83% (first 10 pairs), 74% (first 20 pairs), 68% (first 30 pairs) precisions are obtained, respectively.

Açıklama

24th Signal Processing and Communication Application Conference (SIU) -- MAY 16-19, 2016 -- Zonguldak, TURKEY

Anahtar Kelimeler

Part-Whole Semantic Relationship, Pattern Based Method, Lexico-Syntactic Pattern, Semantic Dictionary, Natural Language Processing

Kaynak

2016 24th Signal Processing and Communication Application Conference (Siu)

WoS Q Değeri

N/A

Scopus Q Değeri

N/A

Cilt

Sayı

Künye