...

Sentence-BERT

Learn about Sentence-BERT, its fine-tuning architectures, and multiple ways to compute sentence representation.

We'll cover the following...

Computing sentence representation
- Using the R[CLS] token representation
- Using pooling
Understanding the Sentence-BERT
Fine-tuning architectures for Sentence-BERT

Sentence-BERT is popularly used in tasks such as sentence pair classification, computing similarity between two sentences, and so on. Before understanding how Sentence-BERT works in detail, first, let's take a look at computing sentence representation using the pre-trained BERT model.

Computing sentence representation

Consider the sentence 'Paris is a beautiful city'. Suppose we need to compute the representation of the given sentence. First, we will tokenize the sentence and add a [CLS] token at the beginning and a [SEP] token at the end, so our tokens become the following:

Before We Start

Starting Off with BERT

A Primer on Transformers

Understanding the BERT Model

Getting Hands-On with BERT

Exploring BERT Variants

Different BERT Variants

BERT Variants—Based on Knowledge Distillation

Applications of BERT

Exploring BERTSUM for Text Summarization

Semantic Search with Transformers

Applying BERT to Other Languages

Exploring Sentence and Domain-Specific BERT

Working with VideoBERT, BART, and More

Conclusion

Similarity Detection in English Language Using RoBERTa

Sentence-BERT

Computing sentence representation