Skip to content

Latest commit

 

History

History
14 lines (11 loc) · 408 Bytes

README.md

File metadata and controls

14 lines (11 loc) · 408 Bytes

NLP

Goal: To identify the author of the paragraph given from the book.

Natural Language Processing:
Steps for the text data:-
1.) Lowercase
2.) Remove Puntuation
3.) Remove stopwords
4.) Remove non english words if any
5.) Stemming
6.) Tf-idf for converting words to numbers

naive bayes model on the pre processed data Kaggle submission Top 20%