A quantized variant of DeepSeek V4 Flash, optimized with NVFP4 and FP8 precision formats for efficient inference. It trades some numerical precision for reduced memory footprint and faster throughput, making it practical for deployment on constrained hardware. The open-weight MIT license gives developers full flexibility to inspect, modify, and redistribute.