Mastering Transformers: The Journey from BERT to Large Language Models and Stable Diffusion

Küçük Resim Yok

Tarih

2024

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

De Gruyter

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

Transformer-based language models such as BERT, T5, GPT, DALL-E, and ChatGPT have dominated NLP studies and become a new paradigm. Thanks to their accurate and fast fine-tuning capabilities, transformer-based language models have been able to outperform traditional machine learning-based approaches for many challenging natural language understanding (NLU) problems. Aside from NLP, a fast-growing area in multimodal learning and generative AI has recently been established, showing promising results. Mastering Transformers will help you understand and implement multimodal solutions, including text-to-image. Computer vision solutions that are based on transformers are also explained in the book. You’ll get started by understanding various transformer models before learning how to train different autoregressive language models such as GPT and XLNet. The book will also get you up to speed with boosting model performance, as well as tracking model training using the TensorBoard toolkit. In the later chapters, you’ll focus on using vision transformers to solve computer vision problems. Finally, you’ll discover how to harness the power of transformers to model time series data and for predicting. By the end of this transformers book, you’ll have an understanding of transformer models and how to use them to solve challenges in NLP and CV. © 2024 Packt Publishing.

Açıklama

Anahtar Kelimeler

Computational Linguistics, Computer Vision, Learning Systems, Machine Learning, Natural Language Processing Systems, Personnel Training, Power Transformers, Auto-Regressive, Fine Tuning, Language Model, Learning-Based Approach, Machine-Learning, Multi-Modal, Multi-Modal Learning, Natural Language Understanding, Transformer Modeling, Tuning Capability, Diffusion

Kaynak

Mastering Transformers: The Journey from BERT to Large Language Models and Stable Diffusion

WoS Q Değeri

Scopus Q Değeri

N/A

Cilt

Sayı

Künye