Mastering Transformers: The Journey from BERT to Large Language Models and Stable Diffusion

Yıldırım, Savaş; Asgari-Chenaghlu, Meysam

Mastering Transformers: The Journey from BERT to Large Language Models and Stable Diffusion

dc.contributor.author	Yıldırım, Savaş
dc.contributor.author	Asgari-Chenaghlu, Meysam
dc.date.accessioned	2026-04-04T18:48:45Z
dc.date.available	2026-04-04T18:48:45Z
dc.date.issued	2024
dc.description.abstract	Transformer-based language models such as BERT, T5, GPT, DALL-E, and ChatGPT have dominated NLP studies and become a new paradigm. Thanks to their accurate and fast fine-tuning capabilities, transformer-based language models have been able to outperform traditional machine learning-based approaches for many challenging natural language understanding (NLU) problems. Aside from NLP, a fast-growing area in multimodal learning and generative AI has recently been established, showing promising results. Mastering Transformers will help you understand and implement multimodal solutions, including text-to-image. Computer vision solutions that are based on transformers are also explained in the book. You’ll get started by understanding various transformer models before learning how to train different autoregressive language models such as GPT and XLNet. The book will also get you up to speed with boosting model performance, as well as tracking model training using the TensorBoard toolkit. In the later chapters, you’ll focus on using vision transformers to solve computer vision problems. Finally, you’ll discover how to harness the power of transformers to model time series data and for predicting. By the end of this transformers book, you’ll have an understanding of transformer models and how to use them to solve challenges in NLP and CV. © 2024 Packt Publishing.
dc.identifier.endpage	439
dc.identifier.isbn	978-183763150-6
dc.identifier.isbn	978-183763378-4
dc.identifier.scopus	2-s2.0-105024291059
dc.identifier.scopusquality	N/A
dc.identifier.startpage	1
dc.identifier.uri	https://hdl.handle.net/11411/10340
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	De Gruyter
dc.relation.ispartof	Mastering Transformers: The Journey from BERT to Large Language Models and Stable Diffusion
dc.relation.publicationcategory	Kitap - Uluslararası
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_Scopus_20260402
dc.subject	Computational Linguistics
dc.subject	Computer Vision
dc.subject	Learning Systems
dc.subject	Machine Learning
dc.subject	Natural Language Processing Systems
dc.subject	Personnel Training
dc.subject	Power Transformers
dc.subject	Auto-Regressive
dc.subject	Fine Tuning
dc.subject	Language Model
dc.subject	Learning-Based Approach
dc.subject	Machine-Learning
dc.subject	Multi-Modal
dc.subject	Multi-Modal Learning
dc.subject	Natural Language Understanding
dc.subject	Transformer Modeling
dc.subject	Tuning Capability
dc.subject	Diffusion
dc.title	Mastering Transformers: The Journey from BERT to Large Language Models and Stable Diffusion
dc.type	Book

Koleksiyon

Scopus Indexed Publications

Mastering Transformers: The Journey from BERT to Large Language Models and Stable Diffusion

Dosyalar

Koleksiyon