🔍 Click image to zoom

Masked language modelling — predicting hidden tokens
Share

Frequently Asked Questions

What is Masked Language Model?

A pre-training objective that trains a model to predict randomly masked tokens in a sentence using bidirectional context. A Masked Language Model (MLM) is trained by randomly masking (hiding) a percentage of tokens in input text and requiring the model to predict the original tokens using the surrounding context in both directions.

How is Masked Language Model used in practice?

The MLM objective, used in BERT and its variants, forces the model to develop rich bidirectional representations because it must integrate context from both the left and right of each masked position.

Why is Masked Language Model important in AI?

Masked Language Model is a foundational concept in Training Technique. A pre-training objective that trains a model to predict randomly masked tokens in a sentence using bidirectional context.

See Also