CUDA Graph¶
CUDA Graph support reduces launch overhead by capturing stable execution paths. Diffulex exposes the current full-static runner controls and the standard eager debug switch.
Main Controls¶
Key |
How to set it |
What it does |
|---|---|---|
|
Set |
Disables CUDA Graph-style execution paths. |
|
Leave |
Enables the full-static runner for supported forward passes. |
Debugging Workflow¶
Use eager mode while validating a new model, sampler, strategy, or kernel:
python -m diffulex.server ... --enforce-eager
After correctness is stable, remove eager mode and compare one optimization toggle at a time.