Similar Models

deepseek/r1-distill-qwen-7b

A 2026-native reasoning model distilled from R1. Specialized for agentic "Chain of Thought" logic on local hardware.

Jackrong/Qwen3.5-2B-Claude-4.6-Opus-Reasoning-Distilled-GGUF

A 2B dense architecture model fine-tuned with structured step-by-step reasoning trajectories distilled from Claude 4.6 Opus.

reasoningchain-of-thoughtqwen

← Back to Models

Distil-Qwen3-4B-Text2SQL

distil-labs/distil-qwen3-4b-text2sql-gguf

Task-specialized 4B model for natural-language-to-SQL conversion. Distilled from DeepSeek-V3. Quantized GGUF for local database agents.

How to Use

To get started, install the `transformers` library:

pip install transformers

Then, use the following snippet to load the model:

from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "distil-labs/distil-qwen3-4b-text2sql-gguf"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)

# Your inference code here...

Available Versions

Tag / Variant	Size	Format	Download
distil-labs/distil-qwen3-4b-text2sql-gguf:Q4_K_M	3.1GB	GGUF	Link
distil-labs/distil-qwen3-4b-text2sql-gguf:Q5_K_M	3.6GB	GGUF	Link

Model Details

Teacher Model

DeepSeek-V3

Distillation Method

Knowledge Distillation (Logits)

Training Dataset

Flickr30k (Conceptual)

Primary Task

Multimodal Generation

Performance Metrics (Example)

Metric	Student Model	Teacher Model
Model Size	3.1GB	8.5GB
BLEU Score	28.5	30.1