Getting Started¶

Start here for a practical path from environment setup to a working Diffulex run.

Quickstart: install, run a limited benchmark, call the Python API, and start the HTTP server.
Installation: prepare Python, CUDA, vLLM, checkpoint paths, and documentation build dependencies.

Before running the examples, prepare:

Use DATASET_LIMIT or --dataset-limit for the first run. Increase limits only after model loading and a small generation are successful.