ULMFiT for Genomic Sequence Data
This is an implementation of ULMFiT for genomics classification using Pytorch and Fastai. The model architecture used is based on the AWD-LSTM model, consisting of an embedding, three LSTM layers, and a final set of linear layers. The ULMFiT approach uses three training phases to produce a classification model: Train a language model on a large, unlabeled corpus Fine tune the language model on the classification corpus Use the fine tuned language model to initialize a classification model This method […]
Read more