Similar Models

Distilled Pruned Gemma-3

Sweelol-ai/kd-gemma3-pruned-dolly

A highly optimized model, first pruned for size and then knowledge-distilled from a larger teacher on the Dolly-15k dataset.

How to Use

To get started, install the `transformers` library:

pip install transformers

Then, use the following snippet to load the model:

from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "Sweelol-ai/kd-gemma3-pruned-dolly"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)

# Your inference code here...

Available Versions

Tag / VariantSizeFormatDownload
No specific variants listed for this model.

Model Details

Teacher Model

N/A

Distillation Method

Knowledge Distillation (Logits)

Training Dataset

Flickr30k (Conceptual)

Primary Task

Multimodal Generation

Performance Metrics (Example)

MetricStudent ModelTeacher Model
Model Size~270MB8.5GB
BLEU Score28.530.1