Similar Models

Sweelol-ai/lora-gemma3-270m-dolly

A Gemma-3 270M model fine-tuned on the Dolly-15k dataset using Low-Rank Adaptation (LoRA) for maximum efficiency.

Sweelol-ai/kd-gemma3-pruned-dolly

A highly optimized model, first pruned for size and then knowledge-distilled from a larger teacher on the Dolly-15k dataset.

gemma3knowledge-distillationpruned

Sweelol-ai/gemma3-270m-dolly-teacher

New

A Gemma-3 270M model fully fine-tuned on the Dolly-15k dataset, intended to be used as a "teacher" for knowledge distillation.

gemma3fine-tunedteacher

← Back to Models

Mini LLaMA Chat

sweelol/mini-llama-chat

A compact chat-tuned model, distilled from LLaMA-2 for quick interactions.

How to Use

To get started, install the `transformers` library:

pip install transformers

Then, use the following snippet to load the model:

from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "sweelol/mini-llama-chat"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)

# Your inference code here...

Available Versions

Tag / Variant	Size	Format	Download
sweelol/mini-llama-chat:Q4_0	2.1GB	GGUF	Link
sweelol/mini-llama-chat:Q8_0	3.8GB	GGUF	Link

Model Details

Teacher Model

meta-llama/Llama-2-13b-chat-hf

Distillation Method

Knowledge Distillation (Logits)

Training Dataset

Flickr30k (Conceptual)

Primary Task

Multimodal Generation

Performance Metrics (Example)

Metric	Student Model	Teacher Model
Model Size	2.1GB	8.5GB
BLEU Score	28.5	30.1