Word2vec

Learn the fundamentals of Word2vec, a powerful NLP technique that converts words into vector representations by using shallow neural networks. Understand how the CBOW and Skip-Gram models work to capture word context and similarity, and discover practical applications in recommendation systems and language tasks.

We'll cover the following...

What is word2vec?
How does word2vec work?

Bag of Words
CBOW
Skip-Gram model

Advantages of the CBOW model
Disadvantages of the CBOW model

What is `word2vec`?

Word2vec is one of the most popular techniques to learn word embeddings using shallow neural networks (networks with fewer hidden layers). It was developed by Tomas Mikolov in 2013 at Google. Some key points to know about word2vec:

It contains vector representations of around 50 billion words.
Words that are similar in context will have similar vectors.
The distance, or the similarity between two words can be measured using the cosine distance between the two vectors.
It represents each word as a 300-D vector.
To use this model, we suggest you use Google Colab because the model size is around 1.5GB, and you need to download it to move forward in this project.

In addition to being used as word embedding, the word2vec model has shown significantly great results in making recommendation engines and working with sequential data. Companies like Airbnb, Alibaba, and Spotify are using this great model to build and improve their recommendation engines.

How does `word2vec` work?

Word2vec is not a single algorithm but is rather a combination of two techniques:

CBOW (Continuous Bag of Words)
Skip-Gram model

Both of these techniques use a shallow neural network, which means they map the words to the target variable, which is also a word.

Before discussing these techniques, let’s discuss what a bag of words is?

Bag of Words

This technique is very simple and easy to implement, but it doesn’t take into account the context of words. Let’s take an example to understand this concept in depth.

“I was at the stadium”
“I was at the park”
“I was watching a football match”
“It was the best of times"

We treat each sentence as a separate document, and we ...

1.Welcome to the Course

2.Project: Build a COVID-19 Detection System Using X-Rays

3.Project: Building a Pokemon Classifier Using Transfer Learning

4.Project: Text Generation Using Markov Chains

5.Word Embedding: Two Mini Projects

6.Project: IMDB Reviews Sentiment Analysis

7.Project: Deciphering Text Using Character-Level RNNs

8.Project: Emoji Predictor Using Transfer Learning in NLP

9.Final Exam

10.Where to Go Next?

Word2vec

What is `word2vec`?

How does `word2vec` work?

Bag of Words

Word2vec

What is word2vec?

How does word2vec work?

Bag of Words

What is `word2vec`?

How does `word2vec` work?