Open-weight model

Saul 7B Instruct

Name: Saul 7B Instruct
Brand: Orionfold
Availability: InStock

Open legal AI model. Reads contracts, filings, and case files fully offline on one desktop, so client matters never leave the room. Free, MIT-licensed.

Sponsor this work Open on HuggingFace

Field: Law
Runs: Fully offline
Built on: Saul 7B
License: MIT, free

Saul 7B Instruct

Sponsor

Saul 7B Instruct is an open AI model tuned for legal text. Legal work is full of careful reading, like contracts, filings, and case files, and much of it is private. This model reads and answers in private, fully offline, so client matters never leave the room.

What it can do

It follows plain instructions on legal tasks: sum up a contract, pull out the key duties and dates, explain a clause in simple words, or draft a first pass at a routine document. It is built on Equall’s Saul-7B-Instruct, an open model trained on legal material, and packed into ready-to-run files for a single desktop.

How well it works

We scored five builds on LegalBench, a 50-question legal test, on a small Spark desktop. The Q5_K_M build scored the best at 72 percent while running at about 20 tokens a second. If you want more speed, the Q4_K_M build is faster, at 29 tokens a second, and still scores 62 percent. The table above lists every build.

This is a short test, and it is not legal advice. Use the model as a fast first reader, and always have a person check its work before it matters.

How to run it

Download the GGUF files (the ready-to-run format) and run them with llama.cpp on a Spark-class desktop, a small AI machine with 128 GB of memory. Pick Q5_K_M for the best answers, or Q4_K_M when you want it faster.

Install

huggingface-cli download Orionfold/Saul-7B-Instruct-v1-GGUF

llama-cli -hf Orionfold/Saul-7B-Instruct-v1-GGUF:Q5_K_M

Use it

llama-cli -hf Orionfold/Saul-7B-Instruct-v1-GGUF:Q5_K_M -p "Summarize the key duties in a standard non-disclosure agreement."

from llama_cpp import Llama

llm = Llama.from_pretrained(
    repo_id="Orionfold/Saul-7B-Instruct-v1-GGUF",
    filename="*Q5_K_M.gguf",
)
out = llm("Summarize the key duties in a standard non-disclosure agreement.")
print(out["choices"][0]["text"])

Specs

Base model: Equall/Saul-7B-Instruct-v1
Format: GGUF (ready to run)
Builds: Q4_K_M · Q5_K_M · Q6_K · Q8_0 · F16
Best build: Q5_K_M (best test score, about 20 tokens a second on a Spark desktop)
License: MIT (free to use)

Benchmarks

Build	LegalBench score	Speed on a Spark
Q4_K_M (fastest)	62%	29 tokens a second
Q5_K_M (best score)	72%	20 tokens a second
Q6_K	68%	22 tokens a second
Q8_0	66%	7 tokens a second
F16 (full size)	68%	11 tokens a second

Used in the open

Live counts from HuggingFace, refreshed when the site builds. Built and maintained in the open by Orionfold.

74
Downloads · last 30 days

Get the Proof playbook

Think a small local model can beat the frontier ones?

We proved it. Rerun it yourself, do not take our word for it.

By subscribing you agree to receive the AI For Everyone digest, one email a week, no more. You can unsubscribe any time. See our privacy policy.

See which AI wins, on your own desk

Run, compare, and score on your own Spark.

Orionfold Arena is the cockpit that runs, compares, scores, and trains local AI models on one DGX Spark. The model, the tests, and the results in one place you control.

$349 founding, first 25 then $499 one time

or see the full details

Orionfold Arena poster: the eval cockpit you run on your own DGX Spark.

Keep exploring

Model

Patent Strategist

Offline patent reasoning in ready-to-run files, built with the NeMo toolkit. Nothing leaves your desktop.

Book

AI Research on NVIDIA DGX Spark

Real notes from doing AI research on one desktop. The NVIDIA DGX Spark is a small machine with huge power (petascale means it runs about a quadrillion math steps a second), so you can push local AI further with no cloud needed. Every lesson is backed by code that runs.

Resources

WeightsGGUF files (ready to run)

Saul 7B Instruct

What it can do

How well it works

How to run it

Install

Use it

Specs

Benchmarks

Used in the open

Sponsor Saul 7B Instruct

Bronze

Silver

Gold

Platinum

Think a small local model can beat the frontier ones?

Run, compare, and score on your own Spark.

Keep exploring

Patent Strategist

AI Research on NVIDIA DGX Spark

Resources

Further reading