Bronze
Back the work
$10 / month
- Your name on our supporters list
- A vote on what we build next
- A thank you in the build log
Open software
Run, compare, and score AI models on your own desktop. Live speed and memory, a private board, two side by side. Free, and nothing leaves your machine.
Orionfold Arena
Orionfold Arena is a single screen for running, comparing, and scoring the AI models on your own desktop. Open it on an NVIDIA DGX Spark and you see the machine’s live readouts, every model you have built, the tests those models were measured on, and a private board that ranks them from your own results. All of it runs on the machine under your desk, and none of it leaves.
If you build models on a Spark, you end up with a shelf of them and nowhere to drive them from. Picking one meant remembering a long command. Comparing two meant a terminal and a notebook. Knowing which small build was the good one meant digging up notes you wrote weeks ago. Arena turns that shelf into a control room. Chat with the model that is already warm and loaded, set two of them against each other, score an answer against a known-good answer, and read one chart to decide which build is worth shipping.
Jargon, in plain words: a GGUF is just a packaged model file you can run on your own machine. To quantize a model is to shrink it so it runs faster, and the chart shows what you trade away when you do. Throughput is how many words a second the model can produce.
Nothing you type is uploaded, and nothing you compare phones home, unless you deliberately pick a hosted model. The whole thing is a tool you could run on a plane.
Arena is a thin cockpit over the fieldkit toolbox and a year of real research. fieldkit serves the models, runs the tests, and scores the answers; Arena gives that work a screen. Because the parts already existed, the whole cockpit, fourteen screens and 125 tests, came together in about fifteen hours of work, with the AI agent doing the typing. The honest version of that story is the better one: the cockpit is the sum of a lot of compounding work, not a fresh start.
pip install fieldkit
# Start the Arena on your Spark and open it in a browser.
# It reads your own models, your benches, and your past results.
fieldkit arena serve
Back this work with a monthly tier. Your support moves your requests up the list, and Gold or Platinum earns a badge on the roadmap item you back.
Back the work
$10 / month
Get a say
$25 / month
Move it up the list
$50 / month
Shape the roadmap
$100 / month
Need something specific? Send an enquiry from the roadmap.

Offline patent reasoning in ready-to-run files, built with the NeMo toolkit. Nothing leaves your desktop.

Tuned for cyber security questions, threat write-ups, and security know-how.

Tuned for legal text and built to follow instructions on legal tasks.

Tuned for finance and money questions in plain chat.

Tuned for medical questions and clinical text.

Real notes from doing AI research on one desktop. The NVIDIA DGX Spark is a small machine with huge power (petascale means it runs about a quadrillion math steps a second), so you can push local AI further with no cloud needed. Every lesson is backed by code that runs.

I had a shelf of models on one desktop and no way to drive them. In fifteen hours I built a cockpit to run, compare, and score them, all on that desk.

I ran my first model on a small computer on my desk. 52 milliseconds to the first word, no cloud, no per-use bill. It felt like a local function, not a service.