Similar Models

Mini LLaMA Chat

sweelol/mini-llama-chat

A compact chat-tuned model, distilled from LLaMA-2 for quick interactions.

How to Use

To get started, install the `transformers` library:

pip install transformers

Then, use the following snippet to load the model:

from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "sweelol/mini-llama-chat"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)

# Your inference code here...

Available Versions

Tag / VariantSizeFormatDownload
sweelol/mini-llama-chat:Q4_02.1GBGGUFLink
sweelol/mini-llama-chat:Q8_03.8GBGGUFLink

Model Details

Teacher Model

meta-llama/Llama-2-13b-chat-hf

Distillation Method

Knowledge Distillation (Logits)

Training Dataset

Flickr30k (Conceptual)

Primary Task

Multimodal Generation

Performance Metrics (Example)

MetricStudent ModelTeacher Model
Model Size2.1GB8.5GB
BLEU Score28.530.1