PDF summarizer with Latent Dirichlet Allocation to save time when it comes to reading
- Clean data more in research LDA model
- possibly do separate analysis on tables in research
- check distribution of tfidf to figure out what features can be tweaked/removed in research