MLM

MLM

Masked Language Model

A pre-training objective that trains a model to predict randomly masked tokens in a sentence using bidirectional context.

A Masked Language Model (MLM) is trained by randomly masking (hiding) a percentage of tokens in input text and requiring the model to predict the original tokens using the surrounding context in both directions.

The MLM objective, used in BERT and its variants, forces the model to develop rich bidirectional representations because it must integrate context from both the left and right of each masked position.

🔍 Click image to zoom

Masked language modelling — predicting hidden tokens

Frequently Asked Questions

What is Masked Language Model?

A pre-training objective that trains a model to predict randomly masked tokens in a sentence using bidirectional context. A Masked Language Model (MLM) is trained by randomly masking (hiding) a percentage of tokens in input text and requiring the model to predict the original tokens using the surrounding context in both directions.

How is Masked Language Model used in practice?

Why is Masked Language Model important in AI?

Masked Language Model is a foundational concept in Training Technique. A pre-training objective that trains a model to predict randomly masked tokens in a sentence using bidirectional context.

Frequently Asked Questions

What is Masked Language Model?

How is Masked Language Model used in practice?

Why is Masked Language Model important in AI?

See Also