A Comparison of Different Approaches to Document Representation in Turkish LanguageA Comparison of Different Approaches to Document Representation in Turkish Language

dc.contributor.authorYıldırım, Savaş
dc.contributor.authorYıldız, Tuğba
dc.date.accessioned2024-07-18T20:07:18Z
dc.date.available2024-07-18T20:07:18Z
dc.date.issued2018
dc.departmentİstanbul Billgi Üniversitesien_US
dc.description.abstractRecently, deep learning methods have demonstrated state-of-the-art performancein numerous complex Natural Language Processing (NLP) problems. Easy accessibilityof high-performance computing resources and open-source libraries makes ArtificialIntelligence (AI) approaches more applicable for researchers. This sudden growth ofavailable techniques shaped and improved standards in the field of NLP. Thus, we find anopportunity to compare different approaches to document representation, owing to variousopen-source libraries and a large amount of research. We evaluate four different paradigmsto represent documents: Traditional bag-of-words approaches, topic modeling, embeddingbased approach and deep learning. As the main contribution of this article, we aim atevaluating all these representation approaches with suitable machine learning algorithmsfor document categorization problem in the Turkish language. The supervised architectureuses a benchmark dataset specifically prepared for this language. Within the architecture,we evaluate the representation approaches with corresponding machine learning algorithmssuch as Support Vector Machine (SVM), multi-nominal Naive Bayes Algorithm(m-NB) and so forth. We conduct a variety of experiments and present successful resultsfor the Turkish document categorization. We also observed that tradition approaches havestill comparable results with Neural Network models in terms of document classification.en_US
dc.identifier.endpage576en_US
dc.identifier.issn1300-7688
dc.identifier.issn1308-6529
dc.identifier.issue2en_US
dc.identifier.startpage569en_US
dc.identifier.trdizinid323160en_US
dc.identifier.urihttps://search.trdizin.gov.tr/yayin/detay/323160
dc.identifier.urihttps://hdl.handle.net/11411/5892
dc.identifier.volume22en_US
dc.indekslendigikaynakTR-Dizinen_US
dc.language.isoenen_US
dc.relation.ispartofSüleyman Demirel Üniversitesi Fen Bilimleri Enstitüsü Dergisien_US
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.titleA Comparison of Different Approaches to Document Representation in Turkish LanguageA Comparison of Different Approaches to Document Representation in Turkish Languageen_US
dc.typeArticleen_US

Dosyalar