Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Tokenizer

Tokenizer

architecture

The component that splits text into tokens (subwords or characters) that the model can process.

Learn more on Wikipedia

Related Capabilities

Quality of non-English language understanding and generation

Tokenizer — Glossary — ThinkLLM