Skip to content

Latest commit

 

History

History
75 lines (53 loc) · 2.78 KB

README.md

File metadata and controls

75 lines (53 loc) · 2.78 KB

Sentiment Analysis with Variable Length sequences in pytorch

This repo contains the code for the this blog.
Our final aim is to build a simple GRU model with concat pooling [5]. For this post I will use Twitter Sentiment Analysis [1] dataset as this is a much easier dataset compared to the competition. Download dataset from [2]

This post focuses on how to implement sequence classification with variable lengths in pure pytorch.

UPDATE - START - May 2019

Added pytorch 1.0 compatible notebook. It uses pytorch 1.1 and ignite training functions. Also better use of pytorch Dataset and Dataloader. Code is more compact and easy to understand.

What is covered in Notebook Pytorch 1.0

  • Process Data through pytorch Dataset
    • Tokenize tweets
    • Build vocabulary
    • Vectorize tweets
  • Make batches through pytorch Dataloader
    • Pad tweets to the max length in the batch
  • Max Pooling and Average Pooling
    • RNN model (GRU) with concat pooling
  • Ignite training callbacks
    • Define ignite training loops
    • Add callback for epoch loss and accuracy
    • Add callback for ModelCheckpoint
    • Add callback for EarlyStopping

Added a new repository that contains REST API build in Flask to deploy ML models in production.

UPDATE - END

Prerequisites

  • Python 3.6
  • Basic knowledge of Pytorch 0.3.1
  • Understanding of GRU/LSTM [4]

What is covered in the notebook

  1. Load Data
  2. Simple Data Analysis
  3. Build Vocabulary and tokenize
  4. Unpadded dataset and dataloader
  5. Padded dataset and dataloader
  6. Simple GRU model
  7. GRU model with concat pooling
  8. Max Pooling and Average Pooling
  9. Train

Data Overview

Top 5 rows of dataset

Output from GRU and hidden state

GRU output

Concat Pooling model architecture [5]

GRU model with concat pooling

ToDo

  • Add function to make train/validation split
  • Use pretrained embedding
  • Use bidirectional GRU

References

[1] http://thinknook.com/twitter-sentiment-analysis-training-corpus-dataset-2012-09-22/
[2] http://thinknook.com/wp-content/uploads/2012/09/Sentiment-Analysis-Dataset.zip
[3] https://stackoverflow.com/questions/46387661/how-to-correctly-implement-a-batch-input-lstm-network-in-pytorch
[4] https://colah.github.io/posts/2015-08-Understanding-LSTMs/
[5] https://arxiv.org/abs/1801.06146