Accelerate Large Model Training using DeepSpeed
In this post we will look at how we can leverage the Accelerate library for training large models which enables users to leverage the ZeRO features of DeeSpeed.
Read moreDeep Learning, NLP, NMT, AI, ML
In this post we will look at how we can leverage the Accelerate library for training large models which enables users to leverage the ZeRO features of DeeSpeed.
Read more⚠️ A new updated version of this article is available here 👉 https://huggingface.co/deep-rl-course/unit1/introduction This article is part of the Deep Reinforcement Learning Class. A free course from beginner to expert. Check the syllabus here. ⚠️ A new updated version of this article is available here 👉 https://huggingface.co/deep-rl-course/unit1/introduction This article is part of the Deep Reinforcement Learning
Read moreSentiment analysis is the automatic process of classifying text data according to their polarity, such as positive, negative and neutral. Companies leverage sentiment analysis of tweets to get a sense of how customers are talking about their products and services, get insights to drive business decisions, and identify product issues and potential PR crises early on. In this guide, we will cover
Read moreLarge language models (LLMs) have made a significant impact on AI research. These powerful, general models can take on a wide variety of new language tasks from a user’s instructions. However, academia, nonprofits and smaller companies’ research labs find it difficult to create, study, or even use LLMs as only a few industrial labs with the necessary resources
Read moreA short while ago I published a playlist generator that I’d built using Sentence Transformers and Gradio, and I followed that up with a reflection on how I try to use my projects as effective learning experiences. But how did I actually build the playlist generator? In this post we’ll break down that project and look at two technical details:
Read moreIn recent years, training ever larger language models has become the norm. While the issues of those models’ not being released for further study is frequently discussed, the hidden knowledge about how to train such models rarely gets any attention. This article aims to change this by shedding some light on the technology and engineering behind training such models both in terms
Read moreWhat you will learn here đź’ˇthe basic idea of dynamic adversarial data collection and why it is important. âš’ how to collect adversarial data dynamically and train your model on them – using an MNIST handwritten digit recognition task as an
Read more⚠️ A new updated version of this article is available here 👉 https://huggingface.co/deep-rl-course/unit1/introduction This article is part of the Deep Reinforcement Learning Class. A free course from beginner to expert. Check the syllabus here. ⚠️ A new updated version of this article is available here 👉 https://huggingface.co/deep-rl-course/unit1/introduction This article is part of the Deep Reinforcement Learning Class.
Read moreIn the past few months, the Hugging Face team and external contributors added a variety of vision models in TensorFlow to Transformers. This list is growing comprehensively and already includes state-of-the-art pre-trained models like Vision Transformer, Masked Autoencoders, RegNet, ConvNeXt, and many others! When it comes to deploying TensorFlow models,
Read more