Convert Transformers to ONNX with Hugging Face Optimum

There are currently three ways to convert your Hugging Face Transformers models to ONNX. In this section, you will learn how to export distilbert-base-uncased-finetuned-sst-2-english for text-classification using all three methods going from the low-level torch API to the most user-friendly high-level API of optimum. Each method will do exactly the same Export with torch.onnx (low-level) torch.onnx enables you to convert model checkpoints to an ONNX graph by the export method. But you have to provide    

Read more

Announcing Evaluation on the Hub

TL;DR: Today we introduce Evaluation on the Hub, a new tool powered by AutoTrain that lets you evaluate any model on any dataset on the Hub without writing a single line of code! Evaluate all the models 🔥🔥🔥! Progress in AI has been nothing short of amazing, to the point where some people are now seriously debating whether AI models may be better than humans at certain tasks. However, that progress has not at all been even: to a machine […]

Read more

Policy Gradient with PyTorch

⚠️ A new updated version of this article is available here 👉 https://huggingface.co/deep-rl-course/unit1/introduction This article is part of the Deep Reinforcement Learning Class. A free course from beginner to expert. Check the syllabus here. ⚠️ A new updated version of this article is available here 👉 https://huggingface.co/deep-rl-course/unit1/introduction This article is part of the Deep Reinforcement Learning    

Read more

Getting Started with Sentiment Analysis on Twitter

Sentiment analysis is the automatic process of classifying text data according to their polarity, such as positive, negative and neutral. Companies leverage sentiment analysis of tweets to get a sense of how customers are talking about their products and services, get insights to drive business decisions, and identify product issues and potential PR crises early on. In this guide, we will cover    

Read more

The Technology Behind BLOOM Training

In recent years, training ever larger language models has become the norm. While the issues of those models’ not being released for further study is frequently discussed, the hidden knowledge about how to train such models rarely gets any attention. This article aims to change this by shedding some light on the technology and engineering behind training such models both in terms    

Read more
1 8 9 10 11 12 1,021