A fast and easy implementation of Transformer with PyTorch

FasySeq

FasySeq is a shorthand as a Fast and easy sequential modeling toolkit. It aims to provide a seq2seq model to researchers and developers, which can be trained efficiently and modified easily. This toolkit is based on Transformer(Vaswani et al.), and will add more seq2seq models in the future.

Dependency

PyTorch >= 1.4
NLTK

Result

…

Structure

…

To Be Updated

top-k and top-p sampling
multi-GPU inference
length penalty in beam search
…

Preprocess

Build Vocabulary

createVocab.py

NamedArguments Description

-f/–file The files used to build the vocabulary.
Type: List

–vocab_num The maximum size of vocabulary, the excess word will be discard according to the frequency.
Type: Int Default: -1

–min_freq

NamedArguments	Description
-f/–file	The files used to build the vocabulary. `Type: List`
–vocab_num	The maximum size of vocabulary, the excess word will be discard according to the frequency. `Type: Int` `Default: -1`
–min_freq	The minimum frequency of token in vocabulary. To finish reading, please visit source site Categories Categories Search for: Recent Posts Quiz: Python 3.14 Preview: REPL Autocompletion and Highlighting Python 3.14 Preview: REPL Autocompletion and Highlighting Python Project Management With uv Quiz: What Does -> Mean in Python Function Definitions? Quiz: Python Project Management With uv Tags Attention blogathon Calculus Command-line Tools Data Preparation data science data visualization Deep Learning Deep Learning for Computer Vision Deep Learning for Natural Language Processing Deep Learning for Time Series Deep Learning Performance Deep Learning with PyTorch Ensemble Learning Generative Adversarial Networks Imbalanced Classification Linear Algebra Long Short-Term Memory Networks machine learning Machine Learning Algorithms Machine Learning Process Machine Learning Resources machine translation Matplotlib Natural language processing Natural Language Processing & Speech Neural MT nlp NMT opencv Optimization pandas Probability python Python for Machine Learning Python Machine Learning Resources R Machine Learning scikit-learn sentiment analysis Start Machine Learning Statistics Time Series Weka Machine Learning XGBoost Categories Categories Archives Archives Powered by WordPress and Rubine.

The minimum frequency of token in vocabulary.

To finish reading, please visit source site