A smaller, faster version of BERT that retains most of its language understanding ability while using fewer parameters and less computational power.
World knowledge accuracy, recall of facts and relationships
Adhering to complex, structured, or constrained instructions