Multifractal complexity analysis-based dynamic media text categorization models by natural language processing with BERT

dc.authorscopusid56585856100
dc.authorscopusid35786830100
dc.authorscopusid36544650700
dc.authorscopusid57219483972
dc.contributor.authorKaraca, Y.
dc.contributor.authorZhang, Y.-D.
dc.contributor.authorWang, S.-H.
dc.contributor.authorDursun, A.D.
dc.date.accessioned2024-07-18T20:16:49Z
dc.date.available2024-07-18T20:16:49Z
dc.date.issued2022
dc.description.abstractFractals, being essentially mathematical constructs, are forms that embody the fundamental features of dynamism, self-organization, self-similarity and complexity. The lexical items and parts of sentences are comprehended as the constituents of schemata with a particular pattern made up of interacting elements. Among the most well-known means used to detect and analyze self-repeating patterns are multifractal methods which have numerous applications in many areas including computational linguistics. The predominance of properties like self-similarity, irregularity and vagueness in texts add more to the challenge of clear and accurate meaning conveyance. The ever-increasing amount of text data in different categories also contribute to the inherent complexity due to having properties like being unstructured, noisy and nonstandard. To address this challenge and complexity, this study has aimed at ensuring regularity and self-similarity within the digital-based complex media texts, which comprise the dataset, by multifractal methods (multifractal Bayesian, multifractal regularization and multifractal wavelet shrinkage) and attaining accurate classification and categorization of the words within texts in the dataset by Bidirectional Encoder Representations from Transformers (BERT), as the Natural Language Processing (NLP) method. The related steps of our integrative proposed method are as follows: firstly, regularity enhancement was attained by applying the multifractal methods (multifractal Bayesian, multifractal regularization and multifractal wavelet shrinkage) to the text dataset. Thus, the new datasets were generated, respectively, by obtaining the significant, self-similar and regular attributes. Subsequently, BERT, as the NLP method, was employed to the text dataset as well as to the three new datasets obtained for the classification purposes. In this way, accurate word detection within the text for the category classification was ensured for the analyses. The analysis results for the text dataset and the new datasets were compared by BERT and the most optimal result could be achieved by multifractal Bayesian method. Through this integrated scheme, we have enunciated the significance of the behavioral patterns of fractal while setting forth the distinctive quality of BERT owing to its capability of classification accuracy and adaptiveness into integrated methodologies. © 2022 Elsevier Inc. All rights reserved.en_US
dc.identifier.doi10.1016/B978-0-323-90032-4.00012-2
dc.identifier.endpage115en_US
dc.identifier.isbn9780323900324
dc.identifier.isbn9780323886161
dc.identifier.scopus2-s2.0-85137904790en_US
dc.identifier.scopusqualityN/Aen_US
dc.identifier.startpage95en_US
dc.identifier.urihttps://doi.org/10.1016/B978-0-323-90032-4.00012-2
dc.identifier.urihttps://hdl.handle.net/11411/6281
dc.indekslendigikaynakScopusen_US
dc.language.isoenen_US
dc.publisherElsevieren_US
dc.relation.ispartofMulti-Chaos, Fractal and Multi-Fractional Artificial Intelligence of Different Complex Systemsen_US
dc.relation.publicationcategoryKitap Bölümü - Uluslararasıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectAutomatic And Effective Classificationen_US
dc.subjectBerten_US
dc.subjectComplex Textual Analysis Of Lexical İtemsen_US
dc.subjectComplexityen_US
dc.subjectFractalsen_US
dc.subjectHidden Patternsen_US
dc.subjectHölder Regularityen_US
dc.subjectMulti-Fractal Complexityen_US
dc.subjectMultifractal Analysisen_US
dc.subjectSelf-Similarity And İrregularityen_US
dc.titleMultifractal complexity analysis-based dynamic media text categorization models by natural language processing with BERT
dc.typeBook Chapter

Dosyalar