Open-weight model

SecurityLLM

Name: SecurityLLM
Brand: Orionfold
Availability: InStock

An open AI model for cyber security work, like spotting threats and writing up what they mean. It runs fully offline on a small desktop, so sensitive details never leave your network.

Sponsor this work Open on HuggingFace

Field: Cyber security
Runs: Fully offline
Built on: ZySec SecurityLLM
License: Apache-2.0, free

SecurityLLM

Sponsor

SecurityLLM is an open AI model tuned for cyber security. Security work means finding weak spots, reading attack reports, and explaining what a threat does and how to stop it. A lot of that text is sensitive, so this model does its thinking fully offline, and nothing leaves your network.

What it can do

It answers security questions in plain words, sums up threat reports, and walks through how an attack works and how to defend against it. It is built on ZySec-AI’s SecurityLLM, an open model already trained on security material, and packed into ready-to-run files so it starts fast on a single desktop.

How well it works

We scored five builds on CyberMetric, a 50-question security quiz, on a small Spark desktop. The surprise: the smallest, fastest build (Q4_K_M) scored the best at 40 percent and ran at about 48 tokens a second. The full-size build was both slower and a touch behind. So the cheap build is the one to run. The table above has every build.

These scores come from a short quiz, not a full audit. Treat the model as a fast helper for security questions, and always check its answers against trusted sources before you act.

How to run it

Download the GGUF files (the ready-to-run format) and run them with llama.cpp on a Spark-class desktop, a small AI machine with 128 GB of memory. Start with the Q4_K_M build: it is the fastest here and scored highest on our test.

Install

huggingface-cli download Orionfold/SecurityLLM-GGUF

llama-cli -hf Orionfold/SecurityLLM-GGUF:Q4_K_M

Use it

llama-cli -hf Orionfold/SecurityLLM-GGUF:Q4_K_M -p "Explain how a SQL injection attack works and how to stop it."

from llama_cpp import Llama

llm = Llama.from_pretrained(
    repo_id="Orionfold/SecurityLLM-GGUF",
    filename="*Q4_K_M.gguf",
)
out = llm("Explain how a SQL injection attack works and how to stop it.")
print(out["choices"][0]["text"])

Specs

Base model: ZySec-AI/SecurityLLM
Format: GGUF (ready to run)
Builds: Q4_K_M · Q5_K_M · Q6_K · Q8_0 · F16
Best build: Q4_K_M (about 48 tokens a second on a Spark desktop)
License: Apache-2.0 (free to use)

Benchmarks

Build	CyberMetric score	Speed on a Spark
Q4_K_M (best pick)	40%	48 tokens a second
Q5_K_M	38%	40 tokens a second
Q6_K	36%	35 tokens a second
Q8_0	36%	30 tokens a second
F16 (full size)	34%	17 tokens a second

Used in the open

Live counts from HuggingFace, refreshed when the site builds. Built and maintained in the open by Orionfold.

124
Downloads · last 30 days

Get the Proof playbook

Think a small local model can beat the frontier ones?

We proved it. Rerun it yourself, do not take our word for it.

By subscribing you agree to receive the AI For Everyone digest, one email a week, no more. You can unsubscribe any time. See our privacy policy.

See which AI wins, on your own desk

Run, compare, and score on your own Spark.

Orionfold Arena is the cockpit that runs, compares, scores, and trains local AI models on one DGX Spark. The model, the tests, and the results in one place you control.

$349 founding, first 25 then $499 one time

or see the full details

Orionfold Arena poster: the eval cockpit you run on your own DGX Spark.

Keep exploring

Model

Patent Strategist

Offline patent reasoning in ready-to-run files, built with the NeMo toolkit. Nothing leaves your desktop.

Book

AI Research on NVIDIA DGX Spark

Real notes from doing AI research on one desktop. The NVIDIA DGX Spark is a small machine with huge power (petascale means it runs about a quadrillion math steps a second), so you can push local AI further with no cloud needed. Every lesson is backed by code that runs.

Resources

WeightsGGUF files (ready to run)

SecurityLLM

What it can do

How well it works

How to run it

Install

Use it

Specs

Benchmarks

Used in the open

Sponsor SecurityLLM

Bronze

Silver

Gold

Platinum

Think a small local model can beat the frontier ones?

Run, compare, and score on your own Spark.

Keep exploring

Patent Strategist

AI Research on NVIDIA DGX Spark

Resources

Further reading