CompiledModule¶

class qlip.compiler.module.CompiledModule(model, *, builder_config, **kwargs)¶

Compiled module class for compilation.

Parameters

model (torch.nn.Module) – Model to compile.
builder_config (BuilderConfig) – Builder configuration.
**kwargs (dict) – Additional keyword arguments, see CompiledModule for more details.

Variables

compile(device='cuda', original_device='meta', **kwargs)¶

Compile model.

Parameters

device (str) – Device to compile model, by default “cuda”
original_device (str) – Original device of the compiled model, by default “meta”
**kwargs (dict) – Additional keyword arguments, backend specific.

collect_shapes(value=True)¶

Toggle shape collection mode.

Parameters

collect_inputs(value=True)¶

Toggle input collection mode.

Parameters

set_skip_n(n)¶

Set number of forward passes to skip during collection (shapes or inputs).

Parameters

flush_shapes(type='static', opt='mode')¶

Flush collected shapes to static or dynamic axes profiles.

Parameters

type (str) – Type of shapes to flush. Can be “static” or “dynamic”.
opt (str) – Optimal shape for dynamic axes profile. Can be “mode”, “min” or “max”.

forward(*args, **kw)¶

Forward pass.

Run compiled model if available, otherwise collect shapes.

export_mode()¶

Toggle export mode for model.

Used by ~qlip.compiler.base.BaseExporter.export methods.

extra_repr()¶: extra_repr is used by torch.nn.Module to print the model structure.

load(engine_path, device='cuda', original_device='meta', **kwargs)¶

Load session into compiled module.

Parameters

set_inference_config(config)¶

Set inference configuration for lazy initialization before loading engine.

Parameters