Zero To CAD Qwen3 VL 2B

Qwen3

Open WeightModel weights are publicly available — can be downloaded and self-hosted

Released April 2026context N/A2B params

A compact vision-language model fine-tuned by ADSKAILab on Qwen3's 2B architecture, though its input modalities are listed as text-only despite the VL (vision-language) designation — worth noting as a potential discrepancy. At 2B parameters it sits on the smaller end, trading raw capacity for efficiency and deployability.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Instruction Following

Moderate

Factual Knowledge

Basic

Reasoning & Logic

Use Case Fit

Fit scores are AI-generated based on model capabilities, intended use, and technical specifications. Learn more

Zero To CAD Qwen3 VL 2B

Qwen3

Open WeightModel weights are publicly available — can be downloaded and self-hosted

Released April 2026context N/A2B params

A compact vision-language model fine-tuned by ADSKAILab on Qwen3's 2B architecture, though its input modalities are listed as text-only despite the VL (vision-language) designation — worth noting as a potential discrepancy. At 2B parameters it sits on the smaller end, trading raw capacity for efficiency and deployability.

Capabilities

Capability scores are AI-generated based on model documentation, benchmarks, and technical specifications. Learn more

Instruction Following

Moderate

Factual Knowledge

Basic

Reasoning & Logic

Use Case Fit

Fit scores are AI-generated based on model capabilities, intended use, and technical specifications. Learn more

Glossary

ArchitectureThe underlying structural design of a neural network that defines how data flows through layers and components.Fine-TunedA pre-trained model further trained on a smaller, task-specific dataset to improve performance on that task.Language ModelAn AI model trained to predict and generate text by learning patterns from large amounts of written data.ParametersThe learned numerical values in a model — more parameters generally means more capacity but higher compute cost.Vision-LanguageA model designed to understand and reason about both visual content (images) and natural language text together.Vision-Language ModelAn AI model that understands both images and text, allowing it to answer questions about images or describe what it sees.

Capabilities

Use Case Fit

Capabilities

Use Case Fit

Similar Models

Glossary