A Comparison of Different Approaches to Document Representation in Turkish LanguageA Comparison of Different Approaches to Document Representation in Turkish Language
dc.contributor.author | Yıldırım, Savaş | |
dc.contributor.author | Yıldız, Tuğba | |
dc.date.accessioned | 2024-07-18T20:07:18Z | |
dc.date.available | 2024-07-18T20:07:18Z | |
dc.date.issued | 2018 | |
dc.department | İstanbul Billgi Üniversitesi | en_US |
dc.description.abstract | Recently, deep learning methods have demonstrated state-of-the-art performancein numerous complex Natural Language Processing (NLP) problems. Easy accessibilityof high-performance computing resources and open-source libraries makes ArtificialIntelligence (AI) approaches more applicable for researchers. This sudden growth ofavailable techniques shaped and improved standards in the field of NLP. Thus, we find anopportunity to compare different approaches to document representation, owing to variousopen-source libraries and a large amount of research. We evaluate four different paradigmsto represent documents: Traditional bag-of-words approaches, topic modeling, embeddingbased approach and deep learning. As the main contribution of this article, we aim atevaluating all these representation approaches with suitable machine learning algorithmsfor document categorization problem in the Turkish language. The supervised architectureuses a benchmark dataset specifically prepared for this language. Within the architecture,we evaluate the representation approaches with corresponding machine learning algorithmssuch as Support Vector Machine (SVM), multi-nominal Naive Bayes Algorithm(m-NB) and so forth. We conduct a variety of experiments and present successful resultsfor the Turkish document categorization. We also observed that tradition approaches havestill comparable results with Neural Network models in terms of document classification. | en_US |
dc.identifier.endpage | 576 | en_US |
dc.identifier.issn | 1300-7688 | |
dc.identifier.issn | 1308-6529 | |
dc.identifier.issue | 2 | en_US |
dc.identifier.startpage | 569 | en_US |
dc.identifier.trdizinid | 323160 | en_US |
dc.identifier.uri | https://search.trdizin.gov.tr/yayin/detay/323160 | |
dc.identifier.uri | https://hdl.handle.net/11411/5892 | |
dc.identifier.volume | 22 | en_US |
dc.indekslendigikaynak | TR-Dizin | en_US |
dc.language.iso | en | en_US |
dc.relation.ispartof | Süleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisi | en_US |
dc.relation.publicationcategory | Makale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.title | A Comparison of Different Approaches to Document Representation in Turkish LanguageA Comparison of Different Approaches to Document Representation in Turkish Language | en_US |
dc.type | Article | en_US |