Grokking the Machine Learning Interview/

...

Metrics

Let's go over the metrics to evaluate the performance of the entity linking system.

We'll cover the following...

Offline metrics
Online metrics

Press + to interact

Named entity recognition

For the first layer/component, i.e., the recognition layer, you want to extract all the entity mentions from a given sentence. We will continue with the previous sentence example, i.e., “Michael Jordan is the best professor at UC Berkeley”.

It has two entity mentions:

Michael Jordan
UC Berkeley

NER should be able to detect both entities correctly. However, it may detect:

Both correctly
One correctly
None correctly (wrongly detect non-entity as an entity)
Correct entity but with the wrong type
No entity, i.e., altogether miss the entities in the sentence

Press + to interact

📝 You will call a recognition/detection of a named entity correct, only if it is an exact match of the entity in the labeled data. If NER only recognizes “Michael” as an entity and misses the “Jordan” part, it would be considered wrong. Moreover, if NER recognizes “Michael Jordan” as an entity but with the wrong type (say Organization), again, it would be considered wrong.

Given the above context on the correctness of the system, both precision and recall are important for measuring the performance of NER. They will be defined as:

Precision = $\frac{no.\; of\;correctly\; recognized\; named\; entities}{no\; of\;total\; recognized\; named\; entities}$

Recall = $\frac{no.\; of\;correctly\; recognized\; named\; entities}{no\; of\; named\; entities\; in\; corpus}$ ...

Introduction

Practical ML Techniques/Concepts

Search Ranking

Feed Based System

Recommendation System

Self-Driving Car: Image Segmentation

Entity Linking System

Ad Prediction System

Metrics

Offline metrics

Named entity recognition