A small gold desktop computer glows on a plain desk at night, sending bright beams of light up into a wide sky full of stars and constellations.

Built on the NVIDIA stack

AI research, run on one desk.

The NVIDIA DGX Spark is Orionfold's reference machine. The book, open models, and tools below were all proven on it first.

Everything below was built and proven on the NVIDIA DGX Spark, a small AI computer that sits on your own desk. This page gathers it in one place: the book, the open models, the tools, and the Field Edition that turns a Spark into a turnkey AI lab.

Movie-poster key art for Orionfold Arena, the eval cockpit you run on your own desktop.

Arena Field Edition for DGX Spark

Get an AI team without hiring one, in the Spark you own.

One installer turns your NVIDIA DGX Spark into a private AI lab with an AI research teammate, and proves itself on first boot. The proof of what it unlocks: one builder, solo on this exact stack, shipped 14 tools, 6 models, 3 books, and 2 production sites. What you buy is the assembled, proven team, delivered turnkey.

$349 Founding, first 25 then $499 one time

Get Arena Field Edition Watch the demo

Program membership

Orionfold is a member of the NVIDIA Inception program. The membership supports the same practical goal as this page: make private AI work easier to build, test, and run on customer-controlled hardware.

Offer fit on the NVIDIA stack

Orionfold Advisor

Shipping

A governed local AI advisor over an enterprise corpus for small teams adopting local AI. The first Advisor shipped in June 2026: a fine-tuned NVIDIA-native 4B model lane with exact source-id citations, adversarially tested refusals, recall-gated retrieval, and deterministic frontier routing under a spend cap.

NVIDIA used: NVIDIA DGX Spark reference hardware, CUDA-capable local inference stacks

Private Agent Starter Kit

Alpha/Beta

A fixed-scope starter kit that gives a small team one private agent workflow with trigger, runner, evaluation, approval loop, artifact output, and audit trail. It turns AI experimentation into an owned operating capability.

NVIDIA used: NVIDIA DGX Spark reference hardware, CUDA-capable inference runtimes

Orionfold Domain Experts

Shipping

Offline model packs for sensitive domain work. Each pack combines a domain model, benchmark or scorecard, local run path, and use-case playbook. Patent Strategist is the live flagship; security, legal, finance, and medical packs follow the same pattern.

NVIDIA used: NVIDIA DGX Spark, NVIDIA NeMo, CUDA-capable inference

Orionfold Local AI Cockpit

Shipping

A local cockpit for comparing, remembering, evaluating, and shipping private AI work. Arena provides the cockpit, Cortex adds local memory and provenance, and fieldkit carries reusable patterns for model runs, evals, RAG, training, and publishing.

NVIDIA used: NVIDIA DGX Spark, NVIDIA NIM, CUDA-capable inference, NVIDIA NeMo patterns

Most powerful AI lives far away, in a rented cloud. Orionfold builds the other way. Our reference machine is the NVIDIA DGX Spark, a small AI supercomputer that sits on one desk.

It is tiny but mighty. Inside is NVIDIA's GB10 Grace Blackwell chip and 128 GB of shared memory. That is enough to fine-tune (retrain) models up to about 70 billion parameters, and to run models up to about 200 billion, all on your own desk with no cloud and no meter running.

Everything below was proven on this machine first. The book is the field log of the work. The models are tuned and tested on it, with the flagship built using NVIDIA's NeMo toolkit (NVIDIA's open kit for training models). The fieldkit toolbox is the set of patterns that held up on it.

The Spark is our reference, not our only home. The same models and tools also run on Apple Silicon and other small devices, a lighter path for when you do not need the full machine.

GB10 Grace Blackwell 128 GB shared memory About 1 PFLOP at FP4 Fine-tune up to ~70B Run up to ~200B

The field log

AI Research on NVIDIA DGX Spark

Real notes from doing AI research on one desktop. The NVIDIA DGX Spark is a small machine with huge power (petascale means it runs about a quadrillion math steps a second), so you can push local AI further with no cloud needed. Every lesson is backed by code that runs.

Local AI NVIDIA DGX Spark Petascale desktop Backed by code

Explore the book

The models

Open-weight models tuned for one field each, all benchmarked on the Spark. The flagship is built with NVIDIA NeMo.

Advisory

Advisor

Answers from your own notes and files, with the exact source named. When the answer is not there, it says so instead of making one up. A small model you run on your own hardware.

Explore the model

Patent

Patent Strategist

The NeMo-built patent reasoning as a small add-on patch for the base model. For your own custom builds.

Explore the model

Security

SecurityLLM

Tuned for cyber security questions, threat write-ups, and security know-how.

Explore the model

Legal

Saul 7B Instruct

Tuned for legal text and built to follow instructions on legal tasks.

Explore the model

Finance

Finance Chat

Tuned for finance and money questions in plain chat.

Explore the model

Medical

II-Medical 8B

Tuned for medical questions and clinical text.

Explore the model

Space

Kepler

Tuned for space math, like the paths satellites fly. It shows short work and ends in one number you can check.

Explore the model

The software

Open tools whose patterns were proven on the Spark before they shipped.

Governed advisor

Orionfold Advisor

A local AI advisor over your own body of documents. Every answer names the exact source it came from. If your documents cannot support an answer, it refuses instead of guessing. Every check it passed is a saved receipt you can re-run.

Open the project

Eval cockpit

Orionfold Arena

One screen to run, compare, score, and now train the AI models on your own desktop. Watch live speed and memory, rank models on a private leaderboard, queue tests and training runs, and wake up to a morning report. Nothing you type leaves your machine.

Open the project

Memory layer

Orionfold Cortex

A second brain that lives on your own desktop. It indexes your notes, stamps where every fact came from, and grades its own memory: a rebuild that would make recall worse is caught, not shipped. Your documents never leave your machine.

Open the project

Open toolbox

fieldkit

A Python toolbox of patterns we proved on a small AI desktop. It covers the whole job: faster replies, search over your own files, scoring, training, and shipping models. Use just the parts you need.

Open the project

The field notes

Short build-log posts from the real work on the machine.

Field note

My first model on a desktop

I ran my first model on a small computer on my desk. 52 milliseconds to the first word, no cloud, no per-use bill. It felt like a local function, not a service.

Read the note

Field note

Access first, models second

On day one with my desktop AI machine I did not pick a model. I set up how I reach it. Models change every six months. Good access lasts for years.

Read the note

Field note

The cockpit for my models

I had a shelf of models on one desktop and no way to drive them. In fifteen hours I built a cockpit to run, compare, and score them, all on that desk.

Read the note

Used in the open

Live counts from HuggingFace, refreshed when the site builds. Built and maintained in the open by Orionfold.

783
Model downloads · last 30 days: 135
Patent Strategist · last 30 days

Start with the book. It is free to read, and every lesson runs.

Read the field log online, keep a copy if you want one, then run the open models and tools on your own machine. Want to move the work forward faster? You can sponsor it.

Read the book free Get the book and bundle Sponsor the work

© 2025 NVIDIA, the NVIDIA logo, NVIDIA DGX Spark, NVIDIA Inception, NVIDIA NeMo, NVIDIA NIM, and TensorRT-LLM are trademarks and/or registered trademarks of NVIDIA Corporation in the U.S. and other countries.