Generator and Discriminator of the ELECTRA Model
Learn the working of the generator and discriminator model in detail and find out why we should prioritize using ELECTRA over BERT.
We'll cover the following
The generator model
First, let's have a look at the generator. The generator performs the MLM task. We randomly mask a few tokens with a 15% mask rate and train the generator to predict the masked token. Let's represent our input tokens as
Get hands-on with 1400+ tech skills courses.