Build the organisation around the genius.

Every quantum software stack today treats the QPU as an accelerator — a coprocessor slotted into a classical architecture. The classical system remains the skeleton. The QPU is attached where convenient.

QUASI inverts this. The QPU is the centre. The entire software architecture is designed from one question: what does the QPU need to be maximally productive?

CPUs, compilers, GPUs — they handle stabilising, statefulness, execution. They serve the QPU. Not the other way around.

The QPU is not faster. It is qualitatively different.

It can do something no classical system can: superposition, entanglement, interference. But it is fragile. It needs context. It needs translation. Like a wildly productive mind that needs fifteen people in the studio to make sure every utterance translates into consequential action.

This is not how the industry builds quantum software. The industry starts with what exists — Python SDKs, classical schedulers, runtime error mitigation — and bolts QPU access on top. It is pragmatic. It ships.

QUASI starts from the goal and works backward. What does a system look like that was designed for quantum-first computing? Not adapted from classical. Not incrementally extended. Designed. That means: physics-native input, compile-time noise rejection, enforced hardware abstraction, zero classical dependencies in the critical path.

Architecture comparisons

How QUASI's architecture compares to existing quantum computing stacks, layer by layer. Each comparison follows the same structure: compilation, hardware abstraction, task orchestration, workflow representation, error handling.

IBM QCSC NVIDIA CUDA-Q Pasqal QOS / Qernel Google Cirq Amazon Braket

Comparison

QUASI vs IBM Quantum-Centric Supercomputer

Based on Seelam et al., “Reference Architecture of a Quantum-Centric Supercomputer,” arXiv:2603.10970, March 2026. Authored by IBM Quantum leadership (Gambetta, Chow, Sheldon, et al.).

IBM proposes a three-phase roadmap for integrating QPUs into HPC clusters alongside GPUs and CPUs. Key abstractions: Tensor Compute Graph (TCG), Quantum Systems API (QSA), QRMI (Slurm plugin), and three coupling modes (batch, near-time, real-time).

Compilation

IBM QCSCQUASI
LanguagePython (Qiskit SDK)Rust (Afana)
InputQiskit circuit objectsEhrenfest CBOR binary
OutputVendor-specific ISAOpenQASM 2.0 / 3.0
NoiseRuntime mitigation (TEM, Pauli propagation)Compile-time rejection (type-level noise constraints)
DependenciesQiskit, NumPy, SciPy, vendor pluginsciborium, quizx, serde, clap — no Python
DeploymentPython environment + pipSingle static Rust binary

Key difference: IBM mitigates noise after execution. Afana rejects programs that violate their noise budget before compilation completes. Compile-time rejection is cheaper and eliminates wasted QPU time on doomed circuits.

Key difference: IBM’s compilation depends on Qiskit — a Python framework with heavy transitive dependencies. Afana is a zero-dependency Rust binary. In HPC environments, dependency-free deployment matters.

Hardware Abstraction

IBM QCSCQUASI
NameQuantum Systems API (QSA)HAL Contract
InterfaceUnspecified (“potentially vendor-portable”)POST /hal/jobs (REST, vendor-neutral)
NeutralityAspirationalEnforced — compiler cannot address hardware directly
EnforcementArchitectural intentCI boundary check (compiler-boundary job)
IBMQiskit → (vendor transpiler) → QPU
QUASIEhrenfest → Afana → OpenQASM → HAL Contract → driver → QPU

Key difference: QSA is described as “potentially vendor-portable.” HAL Contract is vendor-portable by construction — there is no code path from Afana to any hardware API. Adding a new vendor requires only a new HAL driver, not compiler changes.

Task Orchestration

IBM QCSCQUASI
SchedulerSlurm + QRMI pluginActivityPub (federated)
ModelCentralised job queueDecentralised activity stream
LifecycleSlurm job statesquasi:Proposequasi:Claimquasi:Complete
Multi-siteSlurm federation (complex)ActivityPub federation (native)
AuditSlurm accounting DBImmutable ActivityPub ledger

Key difference: QCSC bolts QPUs onto Slurm — a centralised scheduler designed for homogeneous compute nodes. QUASI uses ActivityPub, a W3C-standard federation protocol that scales naturally across institutional boundaries. For the multi-institution quantum HPC that QCSC Phase 3 envisions, federated orchestration is architecturally simpler than Slurm federation.

Workflow Representation

IBM QCSCQUASI
AbstractionTensor Compute Graph (TCG)Ehrenfest program (CBOR)
ScopeFull hybrid workflow (quantum + classical)Physics-level problem specification
GranularityOperation-level DAGHamiltonians, observables, noise constraints

Key difference: TCG describes how to compute (operation-level DAG). Ehrenfest describes what to compute (physics-level). Afana derives the circuit from the physics specification. The same Ehrenfest program compiles to different circuits for different backends without user intervention.

Error Handling

IBM QCSCQUASI
StrategyRuntime error mitigationCompile-time noise rejection + optional runtime mitigation
TechniquesTEM, Pauli propagation, hierarchical QECType-level noise budget in Ehrenfest spec
QECInner codes (FPGA) + outer codes (GPU)Delegated to HAL drivers

IBM’s hierarchical QEC design (inner FPGA decoding + outer GPU decoding) is well-engineered. QUASI currently delegates QEC entirely to HAL drivers, which is architecturally correct but means the HAL Contract API may need to surface QEC metadata as backends mature.

Where IBM Is Ahead

Coupling modes. IBM defines three coupling modes with clear latency requirements: batch (loose), near-time (iterative VQE/SQD), and real-time (microsecond QEC feedback). QUASI’s HAL Contract currently supports only batch mode. Near-time and real-time coupling are not yet addressed.

GPU co-processing. QCSC treats GPUs as first-class compute alongside QPUs. Tensor network error mitigation and outer QEC decoding run on co-located GPUs. QUASI has no explicit GPU compute layer.

Calibration integration. IBM stresses that real-time calibration data must flow into the compiler for optimal qubit selection. QUASI correctly places calibration in HAL drivers, but does not yet expose calibration metadata to inform compilation decisions.

Where QUASI Is Ahead

Vendor neutrality (enforced, not aspirational). IBM’s QSA is “potentially vendor-portable.” QUASI’s HAL Contract is the only path to hardware, enforced by CI. There is no way to bypass it.

Compile-time noise rejection. If a program’s noise budget is infeasible for the target backend, Afana refuses to emit QASM. This eliminates wasted QPU time on circuits that would produce meaningless results.

Dependency-free deployment. Afana is a single Rust binary. Deploying it to an HPC node requires copying one file. IBM’s stack requires Python, Qiskit, NumPy, SciPy, and vendor-specific transpiler plugins.

Federated task management. ActivityPub is a W3C standard with mature implementations. Slurm federation is complex, fragile, and designed for tightly-coupled clusters.

Physics-level abstraction. Ehrenfest programs describe physics, not circuits. The compiler derives the optimal circuit for each backend. IBM’s TCG operates at the circuit level.

Assessment

This paper validates the problem space QUASI operates in. IBM’s conclusion — that quantum computing needs a real systems architecture, not ad-hoc scripting — aligns exactly with QUASI’s thesis.

However, IBM’s solution is shaped by IBM’s constraints: Qiskit lock-in, centralised scheduling that reflects their HPC business, runtime mitigation that compensates for the noisy hardware they ship today, and vendor neutrality deferred to future phases.

QUASI’s architecture already enforces what QCSC only proposes.

The structural decisions — Rust compiler, HAL Contract boundary, ActivityPub orchestration, compile-time noise rejection — are not incremental improvements. They are fundamentally different design choices that become more valuable as quantum computing scales beyond single-vendor, single-site deployments.

Source: Seelam et al., “Reference Architecture of a Quantum-Centric Supercomputer,” arXiv:2603.10970, March 2026.


Comparison

QUASI vs NVIDIA CUDA-Q

CUDA-Q (formerly CUDA Quantum) is NVIDIA’s open-source platform for hybrid quantum-classical computing. It provides a compiler toolchain (nvq++), GPU-accelerated simulation via cuQuantum, and a vendor-neutral backend abstraction. Apache 2.0 licensed.

The architecture is monolithic-compiler-centric: an MLIR-based pipeline (Quake dialect → QIR → target code) is the central element. There is no separate orchestration service or workflow engine — task scheduling is implicit through async execution primitives.

Compilation

CUDA-QQUASI
LanguageC++ (__qpu__) / Python (@cudaq.kernel)Rust (Afana)
IRQuake (MLIR) → QIR (LLVM)Ehrenfest CBOR binary
OutputTarget-specific native codeOpenQASM 2.0 / 3.0
NoiseCompile-time type safety (non-copyable qubits); noise as simulation onlyCompile-time rejection (type-level noise constraints)
DependenciesLLVM/MLIR, cuQuantum, CUDA toolkit, Pythonciborium, quizx, serde, clap — no Python, no LLVM
Deploymentpip / conda / Docker / C++ installerSingle static Rust binary

Key difference: CUDA-Q compiles at the circuit level — kernels describe gate sequences. Ehrenfest compiles at the physics level — programs describe Hamiltonians and observables. CUDA-Q users must understand hardware-specific circuit construction; Ehrenfest users specify physics and let Afana derive the circuit.

Key difference: CUDA-Q’s Quake dialect enforces quantum type safety (non-copyable qubits, value semantics) — a genuine compile-time guarantee. But it does not reject circuits based on noise feasibility. Afana goes further: if the noise budget is infeasible, compilation fails.

Hardware Abstraction

CUDA-QQUASI
Abstractionquantum_platform classHAL Contract
InterfaceC++ API with integer QPU IDsPOST /hal/jobs (REST)
NeutralityVendor-neutral (~75% of public QPUs)Vendor-neutral, enforced by CI
EnforcementRuntime target selectionCompile-time boundary — no hardware API in compiler

Key difference: Both achieve vendor neutrality, but through different mechanisms. CUDA-Q uses a monolithic compiler that emits target-specific code internally. QUASI enforces a hard architectural boundary — the compiler produces vendor-neutral OpenQASM; HAL drivers translate to hardware. Adding a backend to CUDA-Q means modifying the compiler pipeline. Adding a backend to QUASI means writing a HAL driver.

Task Orchestration

CUDA-QQUASI
ModelAsync execution primitives (sample_async, observe_async)ActivityPub federation
Multi-QPUInteger QPU IDs, mqpu modeFederated activity stream
Multi-siteMPI for multi-node GPU; no native multi-site QPUActivityPub federation (native)
HPCDeep — CUDA, OpenMP, OpenACC, MPIMinimal — classical compute outside scope

Key difference: CUDA-Q has no orchestration layer — task dispatch is implicit through function calls and async primitives. This works for single-site GPU clusters but has no answer for multi-institution quantum computing. QUASI’s ActivityPub layer is designed for exactly this scenario.

GPU Integration

CUDA-QQUASI
SimulationcuQuantum: state vector, tensor network, MPS — multi-GPU, multi-nodeNo GPU simulation layer
Classical co-processingNative CUDA/OpenMP/OpenACC for host codeOutside scope
PerformanceUp to 180x over CPU simulators; 300x+ multi-GPUN/A

GPU integration is CUDA-Q’s strongest differentiator and QUASI’s largest gap. QUASI has no GPU compute layer. For workloads that require large-scale simulation (algorithm development, error mitigation research), CUDA-Q is currently without competition. QUASI’s HAL drivers could in principle delegate to cuQuantum-based simulators, but this integration does not exist today.

Assessment

CUDA-Q is the most engineering-mature platform in this comparison. The MLIR-based compiler, cuQuantum simulation stack, and broad hardware support represent years of investment by the world’s largest GPU company.

The architectural difference is structural: CUDA-Q is a compiler platform that treats quantum kernels as functions within classical GPU programs. QUASI is an operating system that treats the QPU as the central resource and organises everything else around it.

CUDA-Q asks: how do we add QPUs to our GPU platform?
QUASI asks: how do we build a system that serves the QPU?

These are complementary, not competing questions. CUDA-Q’s GPU simulation and classical co-processing are capabilities QUASI will need. QUASI’s physics-level abstraction, compile-time noise rejection, and federated orchestration are capabilities CUDA-Q does not attempt.


Comparison

QUASI vs Pasqal

Pasqal builds neutral-atom quantum computers and an open-source software stack centred on Pulser (pulse-level control) and Qadence (digital-analog quantum programming). The stack is Python-based and deliberately hardware-near.

Pasqal’s architecture is unique in this comparison: it is the only stack that treats analog quantum simulation — continuous Hamiltonian evolution — as a first-class computational paradigm alongside digital gates.

Programming Model

PasqalQUASI
ParadigmsAnalog, digital, digital-analog (DAQC)Physics-level specification (compiled to gates)
AbstractionPulse-level (Pulser) / block-level (Qadence)Hamiltonian-level (Ehrenfest)
LanguagePythonCBOR binary (Ehrenfest), compiled by Rust (Afana)
Hardware exposureDeliberate — Device objects encode exact QPU constraintsHidden — HAL Contract abstracts hardware away

Key difference: Pasqal and QUASI share a physics-first intuition — both believe quantum programs should describe physics, not gate sequences. But they draw opposite conclusions. Pasqal exposes the hardware physics (atom positions, laser parameters, Rydberg interactions) so users can exploit them. QUASI hides the hardware physics behind HAL Contract so programs remain portable.

Compilation

PasqalQUASI
PipelinePulser: minimal — programs are already hardware-level. Qadence 2: IR-based (in development)Ehrenfest → Afana → OpenQASM → HAL Contract
OutputPulse sequences (JSON-serialised)OpenQASM 2.0 / 3.0
NoiseNo compile-time noise handlingCompile-time noise budget rejection
DependenciesPython, NumPy, Pulser, PyTorch (Qadence)Zero Python dependencies

Key difference: Pulser programs are already at hardware level — there is almost no compilation in the traditional sense. This is maximally efficient for Pasqal hardware but means programs are not portable. QUASI’s multi-stage compilation pipeline adds overhead but produces vendor-neutral output.

Hardware Abstraction

PasqalQUASI
ApproachHardware-near — programs are written for a specific deviceHardware-agnostic — programs compiled to any device
Vendor scopePasqal hardware onlyAny HAL Contract-compatible backend
AbstractionDevice objects with exact physical constraintsHAL Contract REST API

Key difference: Pasqal deliberately does not abstract away the hardware. This is a valid design choice for a hardware company — their competitive advantage is in the physics of neutral atoms, and exposing it lets users exploit features like native multi-qubit gates and arbitrary 2D/3D atom topologies. QUASI’s choice is the opposite: portability over hardware-specific optimisation.

Orchestration

PasqalQUASI
SchedulerCloud SDK (job submission) / SLURM (HPC)ActivityPub (federated)
Cloud accessPasqal Cloud, Azure Quantum, Google Cloud, OVHcloudSelf-hosted, federated
HPCSLURM-integrated (GENCI/CEA, Jülich)ActivityPub federation

Assessment

Pasqal occupies a unique position: a hardware company with a genuinely open software stack and a physics-first programming model. The analog/DAQC paradigm is architecturally interesting because it sidesteps the gate-model abstraction entirely — the Hamiltonian is the program.

Pasqal and QUASI share the deepest intuition: quantum programs should describe physics.

They diverge on the consequence. Pasqal says: expose the specific physics of our hardware. QUASI says: describe the physics abstractly and let the compiler map it to any hardware. If neutral-atom QPUs win, Pasqal’s approach optimises harder. If the ecosystem fragments across modalities, QUASI’s abstraction pays off.

A HAL Contract driver for Pasqal hardware — translating Ehrenfest physics specifications to Pulser sequences — would be a natural integration point. The physics-to-physics mapping (Ehrenfest Hamiltonians → Rydberg Hamiltonians) is closer than the physics-to-gates mapping most backends require.


Comparison

QUASI vs QOS / Qernel

QOS is a quantum operating system built at TU Munich (TUM-DSE group), published at USENIX OSDI 2025. Its central abstraction is the “Qernel” — a DAG-based intermediate representation for quantum circuits with static and dynamic properties.

QOS is the most directly comparable system to QUASI: both position themselves as OS-level abstractions above circuit SDKs. But the approaches are fundamentally different.

Architecture

QOS / QernelQUASI
LevelOS layer on top of QiskitFull-stack OS with own language and compiler
FoundationBuilt on Qiskit / PythonBuilt from scratch in Rust
IRQernel IR (DAG of gates)Ehrenfest (CBOR binary, physics-level)
ScopeMulti-job resource managementFull compilation + orchestration + execution

Key difference: QOS optimises within the existing Qiskit ecosystem. QUASI builds a new ecosystem. QOS accepts Qiskit circuits as input and makes them run better on IBM hardware. QUASI accepts physics specifications and compiles them to any hardware.

Hardware Scope

QOS / QernelQUASI
BackendsIBM only (27-qubit Falcon QPUs)Any HAL Contract-compatible backend (9 today)
Validation7,000+ real quantum runs, 70,000+ benchmark instancesHAL Contract tested across 9 backends
Vendor lock-inComplete (Qiskit + IBM hardware)None (enforced by CI)

Key Capabilities

QOS / QernelQUASI
Multi-programmingNative — co-locates circuits on same QPUNot addressed
SchedulingFidelity vs. latency optimisationActivityPub task claiming
Circuit cuttingGate cutting, wire cutting, qubit reuseNot addressed
Fidelity estimationML-based, up to 99% prediction accuracyCompile-time noise budget
Noise handlingRuntime mitigation (cutting, freezing, scheduling)Compile-time rejection

Key difference: QOS excels at NISQ-era optimisation — making noisy circuits run better on noisy hardware through cutting, multi-programming, and fidelity-aware scheduling. QUASI takes the opposite approach: reject infeasible programs at compile time rather than optimising them at runtime. As hardware improves, compile-time rejection becomes more valuable; as hardware remains noisy, QOS’s runtime optimisation is more practical.

Where QOS Is Ahead

Real-hardware validation. QOS has been validated with 7,000+ real quantum runs on IBM hardware, demonstrating 2.6–456.5x higher fidelity and up to 9.6x better resource utilisation. QUASI’s architecture has not been validated at this scale on real QPUs.

Multi-programming. Co-locating compatible circuits on the same QPU is a practical capability for cloud-scale quantum computing. QUASI does not address this.

Peer review. Published at OSDI 2025, one of the top systems conferences. This is a level of academic validation QUASI does not yet have.

Where QUASI Is Ahead

Hardware independence. QOS is locked to IBM/Qiskit. QUASI’s HAL Contract supports 9 backends today with no vendor dependency in the compiler.

Own language and compiler. QOS accepts Qiskit circuits — it cannot change the input abstraction. QUASI controls the full stack from physics specification to hardware execution.

Production readiness. QOS is a research prototype (11 commits on GitHub). QUASI’s HAL Contract implementation (Arvak) is at v1.8.1 with 9 production backends.

Federated orchestration. QOS has no multi-site capability. QUASI’s ActivityPub-based orchestration is designed for cross-institutional quantum computing.

Assessment

QOS and QUASI answer different questions. QOS asks: given the hardware and tools we have today, how do we extract maximum value? QUASI asks: what should the system look like if we design it from scratch?

QOS optimises the present. QUASI designs for the future.

QOS’s NISQ-era optimisations (circuit cutting, multi-programming, fidelity-aware scheduling) are genuinely valuable today. But they are workarounds for noisy hardware — they become less relevant as error-corrected machines arrive. QUASI’s architectural decisions (physics-level abstraction, vendor neutrality, compile-time guarantees) become more relevant as the ecosystem matures.

Source: Giortamis et al., “QOS: A Quantum Operating System,” USENIX OSDI 2025. arXiv:2406.19120.