Similar Models

Qwen/Qwen3.5-2B-GGUF

Alibaba Qwen3.5 2B edge-optimized model. Hybrid Gated DeltaNet+Attention architecture, 256K context, Apache 2.0. Built for tool-calling agents and multimodal workflows.

agenticmultimodalhybrid-arch

mistralai/Ministral-3-3B-Instruct-GGUF

New

Mistral AI edge-optimized 3.4B+0.4B vision model. Native function calling, JSON outputs, 256K context. Built for tool-using agentic pipelines.

agent-readyvision-textfunction-calling

← Back to Models

LFM2-700M (Edge Agentic)

unsloth/LFM2-700M-GGUF

Liquid AI hybrid architecture via Unsloth. 700M params, 32K context, CPU-optimized. Built for narrow-scope agentic tasks: data extraction, RAG, multi-turn workflows.

How to Use

To get started, install the `transformers` library:

pip install transformers

Then, use the following snippet to load the model:

from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "unsloth/LFM2-700M-GGUF"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)

# Your inference code here...

Available Versions

Tag / Variant	Size	Format	Download
unsloth/LFM2-700M-GGUF:Q4_K_M	469MB	GGUF	Link
unsloth/LFM2-700M-GGUF:Q6_K	612MB	GGUF	Link

Model Details

Teacher Model

LFM2-1.2B

Distillation Method

Knowledge Distillation (Logits)

Training Dataset

Flickr30k (Conceptual)

Primary Task

Multimodal Generation

Performance Metrics (Example)

Metric	Student Model	Teacher Model
Model Size	469MB	8.5GB
BLEU Score	28.5	30.1