Open-weight model

Saul 7B Instruct

An open AI model tuned for legal text and built to follow instructions. It runs fully offline on a small desktop, so client matters and case files never leave the room.

Saul 7B Instruct
Field
Law
Runs
Fully offline
Built on
Saul 7B
License
MIT, free

Saul 7B Instruct

Saul 7B Instruct is an open AI model tuned for legal text. Legal work is full of careful reading, like contracts, filings, and case files, and much of it is private. This model reads and answers in private, fully offline, so client matters never leave the room.

What it can do

It follows plain instructions on legal tasks: sum up a contract, pull out the key duties and dates, explain a clause in simple words, or draft a first pass at a routine document. It is built on Equall’s Saul-7B-Instruct, an open model trained on legal material, and packed into ready-to-run files for a single desktop.

How well it works

We scored five builds on LegalBench, a 50-question legal test, on a small Spark desktop. The Q5_K_M build scored the best at 72 percent while running at about 20 tokens a second. If you want more speed, the Q4_K_M build is faster, at 29 tokens a second, and still scores 62 percent. The table above lists every build.

This is a short test, and it is not legal advice. Use the model as a fast first reader, and always have a person check its work before it matters.

How to run it

Download the GGUF files (the ready-to-run format) and run them with llama.cpp on a Spark-class desktop, a small AI machine with 128 GB of memory. Pick Q5_K_M for the best answers, or Q4_K_M when you want it faster.

Install

huggingface-cli download Orionfold/Saul-7B-Instruct-v1-GGUF

Use it

llama-cli -hf Orionfold/Saul-7B-Instruct-v1-GGUF:Q5_K_M -p "Summarize the key duties in a standard non-disclosure agreement."

Specs

Base model
Equall/Saul-7B-Instruct-v1
Format
GGUF (ready to run)
Builds
Q4_K_M · Q5_K_M · Q6_K · Q8_0 · F16
Best build
Q5_K_M (best test score, about 20 tokens a second on a Spark desktop)
License
MIT (free to use)

Benchmarks

BuildLegalBench scoreSpeed on a Spark
Q4_K_M (fastest)62%29 tokens a second
Q5_K_M (best score)72%20 tokens a second
Q6_K68%22 tokens a second
Q8_066%7 tokens a second
F16 (full size)68%11 tokens a second

Used in the open

Live counts from HuggingFace, refreshed when the site builds. Built and maintained in the open by Orionfold.

575
Downloads · last 30 days