vllm.v1.worker.kv_connector_model_runner_mixin ¶
Define KV connector functionality mixin for model runners.
KVConnectorModelRunnerMixin ¶
Source code in vllm/v1/worker/kv_connector_model_runner_mixin.py
finalize_kv_connector staticmethod ¶
Finalize the KV connector: wait_for_save and clear metadata.
Call after draft model forward when defer_finalize=True was used.