Questions on NLP

  1. The computer understands the numerical value
  2. Once we convert text to vector we can leverage the power of algebra
  1. BagOfWords
  1. Remove the HTML tags
  2. Stop word removal (not is a stop word but it's an important word to determine polarity. We can use n-gram to solve the issue of removal of important stop words)
  3. Remove non-alphanumeric character
  4. Change to Lower case
  5. Lemmatization (nltk.stem.wordnet.WordNetLemmatizer)
  6. Stemming (nltk.stem.PorterStemmer / nltk.stem.SnowballStemmer)
  7. Tokenization
  8. Converting text to vector
  9. Thresholding




Machine Learning Enthusiast

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Predicting Sign Language Based On Hand Signals

Five things I learned about working on content quality at Instagram

A practical use case of Machine Learning In Amazon


7 Must have projects for Machine Learning Beginners

Understanding Word Vectorization In NLP using Word2Vec

Using TensorBoard in an Amazon SageMaker PyTorch Training job: a Step-by-Step Tutorial

Fraud detection model in digital transactions

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Chinmayi Sahu

Chinmayi Sahu

Machine Learning Enthusiast

More from Medium

Data analysis of medical Big Data based in Deep Learning

Tumor Segmentation

Tennis Clustering — Break & Serve Performance Analysis.

Covid-19 Public Opinion Sentiment Analysis: An NLP Project with Neural Network