Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Pre-norm

Pre-norm

techniques

A Transformer design choice where layer normalization is applied before the main computation rather than after.

Pre-norm — Glossary — ThinkLLM