Qwen3.5 35B A3B Base

Name: Qwen3.5 35B A3B Base
Author: Qwen (Alibaba)

by Qwen (Alibaba)Qwen 3

Open WeightModel weights are publicly available — can be downloaded and self-hosted

Released February 2026context N/A35B params

A mid-sized mixture-of-experts base model that activates only 3 billion of its 35 billion parameters per forward pass, making it computationally lean relative to its total capacity. As a base model, it arrives without instruction tuning or chat formatting, meaning it excels at completion tasks and serves as a foundation for fine-tuning rather than direct conversation.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Multilingual

Strong

Factual Knowledge

Strong

Multimodal

Use Case Fit

Fit scores are AI-generated based on model capabilities, intended use, and technical specifications. Learn more

Qwen3.5 35B A3B Base

by Qwen (Alibaba)Qwen 3

Open WeightModel weights are publicly available — can be downloaded and self-hosted

Released February 2026context N/A35B params

A mid-sized mixture-of-experts base model that activates only 3 billion of its 35 billion parameters per forward pass, making it computationally lean relative to its total capacity. As a base model, it arrives without instruction tuning or chat formatting, meaning it excels at completion tasks and serves as a foundation for fine-tuning rather than direct conversation.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Multilingual

Strong

Factual Knowledge

Strong

Multimodal

Use Case Fit

Fit scores are AI-generated based on model capabilities, intended use, and technical specifications. Learn more

Glossary

Base ModelA pretrained model that completes text patterns but hasn't been trained to follow instructions, serving as a starting point for customization through fine-tuning.Fine-TuningThe process of further training a pre-trained model on new data to adapt it for specific tasks or domains.Forward PassA single computation cycle where input data flows through the model's layers to produce an output prediction.ParametersThe learned numerical values in a model — more parameters generally means more capacity but higher compute cost.