QML tools

ML Tools

This module implements gradient-free and gradient-based training loops for torch Modules and QuantumModel.

`TrainConfig` `dataclass`

Default config for the train function. The default value of each field can be customize with the constructor:

from qadence.ml_tools import TrainConfig
c = TrainConfig(folder="/tmp/train")

TrainConfig(max_iter=10000, print_every=1000, write_every=50, checkpoint_every=5000, folder=PosixPath('/tmp/train'), create_subfolder_per_run=False, checkpoint_best_only=False, validation_criterion=<function TrainConfig.__post_init__.<locals>.<lambda> at 0x28afb7700>, trainstop_criterion=<function TrainConfig.__post_init__.<locals>.<lambda> at 0x28afb7430>, batch_size=1)

`batch_size: int = 1` `class-attribute` `instance-attribute`

The batch_size to use when passing a list/tuple of torch.Tensors.

`checkpoint_best_only: bool = False` `class-attribute` `instance-attribute`

Write model/optimizer checkpoint only if a metric has improved

`checkpoint_every: int = 5000` `class-attribute` `instance-attribute`

Write model/optimizer checkpoint

`create_subfolder_per_run: bool = False` `class-attribute` `instance-attribute`

Checkpoint/tensorboard logs stored in subfolder with name <timestamp>_<PID>. Prevents continuing from previous checkpoint, useful for fast prototyping.

`folder: Optional[Path] = None` `class-attribute` `instance-attribute`

Checkpoint/tensorboard logs folder

`max_iter: int = 10000` `class-attribute` `instance-attribute`

Number of training iterations.

`print_every: int = 1000` `class-attribute` `instance-attribute`

Print loss/metrics.

`trainstop_criterion: Optional[Callable] = None` `class-attribute` `instance-attribute`

A boolean function which evaluates a given training stopping metric is satisfied

`validation_criterion: Optional[Callable] = None` `class-attribute` `instance-attribute`

A boolean function which evaluates a given validation metric is satisfied

`write_every: int = 50` `class-attribute` `instance-attribute`

Write tensorboard logs

`get_parameters(model)`

Retrieve all trainable model parameters in a single vector

PARAMETER	DESCRIPTION
`model`	the input PyTorch model TYPE: `Module`

RETURNS	DESCRIPTION
`Tensor`	a 1-dimensional tensor with the parameters TYPE: `Tensor`

Source code in qadence/ml_tools/parameters.py

def get_parameters(model: Module) -> Tensor:
    """Retrieve all trainable model parameters in a single vector

    Args:
        model (Module): the input PyTorch model

    Returns:
        Tensor: a 1-dimensional tensor with the parameters
    """
    ps = [p.reshape(-1) for p in model.parameters() if p.requires_grad]
    return torch.concat(ps)

`num_parameters(model)`

Return the total number of parameters of the given model

Source code in qadence/ml_tools/parameters.py

def num_parameters(model: Module) -> int:
    """Return the total number of parameters of the given model"""
    return len(get_parameters(model))

`set_parameters(model, theta)`

Set all trainable parameters of a model from a single vector

Notice that this function assumes prior knowledge of right number of parameters in the model

PARAMETER	DESCRIPTION
`model`	the input PyTorch model TYPE: `Module`
`theta`	the parameters to assign TYPE: `Tensor`

Source code in qadence/ml_tools/parameters.py

def set_parameters(model: Module, theta: Tensor) -> None:
    """Set all trainable parameters of a model from a single vector

    Notice that this function assumes prior knowledge of right number
    of parameters in the model

    Args:
        model (Module): the input PyTorch model
        theta (Tensor): the parameters to assign
    """

    with torch.no_grad():
        idx = 0
        for ps in model.parameters():
            if ps.requires_grad:
                n = torch.numel(ps)
                if ps.ndim == 0:
                    ps[()] = theta[idx : idx + n]
                else:
                    ps[:] = theta[idx : idx + n].reshape(ps.size())
                idx += n

`data_to_model(xs, device='cpu')`

Default behavior for single-dispatched function

Just return the given data independently on the type

PARAMETER	DESCRIPTION
`xs`	the input data TYPE: `Any`
`device`	The torch device. Not used in this implementation. TYPE: `str` DEFAULT: `'cpu'`

RETURNS	DESCRIPTION
`Any`	the `xs` argument untouched TYPE: `Any`

Source code in qadence/ml_tools/optimize_step.py

@singledispatch
def data_to_model(xs: Any, device: str = "cpu") -> Any:
    """Default behavior for single-dispatched function

    Just return the given data independently on the type

    Args:
        xs (Any): the input data
        device (str, optional): The torch device. Not used in this implementation.

    Returns:
        Any: the `xs` argument untouched
    """
    return xs

`optimize_step(model, optimizer, loss_fn, xs, device='cpu')`

Default Torch optimize step with closure

This is the default optimization step which should work for most of the standard use cases of optimization of Torch models

PARAMETER	DESCRIPTION
`model`	The input model TYPE: `Module`
`optimizer`	The chosen Torch optimizer TYPE: `Optimizer`
`loss_fn`	A custom loss function TYPE: `Callable`
`xs`	the input data. If None it means that the given model does not require any input data TYPE: `dict \| list \| Tensor \| None`
`device`	The device were computations are executed. Defaults to "cpu". TYPE: `str` DEFAULT: `'cpu'`

RETURNS	DESCRIPTION
`tuple`	tuple containing the model, the optimizer, a dictionary with the collected metrics and the compute value loss TYPE: `tuple[Tensor \| float, dict \| None]`

Source code in qadence/ml_tools/optimize_step.py

def optimize_step(
    model: Module,
    optimizer: Optimizer,
    loss_fn: Callable,
    xs: dict | list | torch.Tensor | None,
    device: str = "cpu",
) -> tuple[torch.Tensor | float, dict | None]:
    """Default Torch optimize step with closure

    This is the default optimization step which should work for most
    of the standard use cases of optimization of Torch models

    Args:
        model (Module): The input model
        optimizer (Optimizer): The chosen Torch optimizer
        loss_fn (Callable): A custom loss function
        xs (dict | list | torch.Tensor | None): the input data. If None it means
            that the given model does not require any input data
        device (str, optional): The device were computations are executed.
            Defaults to "cpu".

    Returns:
        tuple: tuple containing the model, the optimizer, a dictionary with
            the collected metrics and the compute value loss
    """

    loss, metrics = None, {}

    def closure() -> Any:
        # NOTE: We need the nonlocal as we can't return a metric dict and
        # because e.g. LBFGS calls this closure multiple times but for some
        # reason the returned loss is always the first one...
        nonlocal metrics, loss
        optimizer.zero_grad()
        loss, metrics = loss_fn(model, xs)
        loss.backward(retain_graph=True)
        return loss.item()

    optimizer.step(closure)
    # return the loss/metrics that are being mutated inside the closure...
    return loss, metrics

`train(model, dataloader, optimizer, config, loss_fn, device='cpu', optimize_step=optimize_step, write_tensorboard=write_tensorboard)`

Runs the training loop with gradient-based optimizer

Assumes that loss_fn returns a tuple of (loss, metrics: dict), where metrics is a dict of scalars. Loss and metrics are written to tensorboard. Checkpoints are written every config.checkpoint_every steps (and after the last training step). If a checkpoint is found at config.folder we resume training from there. The tensorboard logs can be viewed via tensorboard --logdir /path/to/folder.

PARAMETER	DESCRIPTION
`model`	The model to train. TYPE: `Module`
`dataloader`	dataloader of different types. If None, no data is required by the model TYPE: `DictDataLoader \| DataLoader \| list[Tensor] \| tuple[Tensor, Tensor] \| None`
`optimizer`	The optimizer to use. TYPE: `Optimizer`
`config`	`TrainConfig` with additional training options. TYPE: `TrainConfig`
`loss_fn`	Loss function returning (loss: float, metrics: dict[str, float]) TYPE: `Callable`
`device`	String defining device to train on, pass 'cuda' for GPU. TYPE: `str` DEFAULT: `'cpu'`
`optimize_step`	Customizable optimization callback which is called at every iteration.= The function must have the signature `optimize_step(model, optimizer, loss_fn, xs, device="cpu")` (see the example below). Apart from the default we already supply three other optimization functions `optimize_step_evo`, `optimize_step_grad_norm`, and `optimize_step_inv_dirichlet`. Learn more about how to use this in the Advancded features tutorial of the documentation. TYPE: `Callable` DEFAULT: `optimize_step`
`write_tensorboard`	Customizable tensorboard logging callback which is called every `config.write_every` iterations. The function must have the signature `write_tensorboard(writer, loss, metrics, iteration)` (see the example below). TYPE: `Callable` DEFAULT: `write_tensorboard`

Example:

from pathlib import Path
import torch
from itertools import count
from qadence.constructors import hamiltonian_factory, hea, feature_map
from qadence import chain, Parameter, QuantumCircuit, Z
from qadence.models import QNN
from qadence.ml_tools import train_with_grad, TrainConfig

n_qubits = 2
fm = feature_map(n_qubits)
ansatz = hea(n_qubits=n_qubits, depth=3)
observable = hamiltonian_factory(n_qubits, detuning = Z)
circuit = QuantumCircuit(n_qubits, fm, ansatz)

model = QNN(circuit, observable, backend="pyqtorch", diff_mode="ad")
batch_size = 1
input_values = {"phi": torch.rand(batch_size, requires_grad=True)}
pred = model(input_values)

## lets prepare the train routine

cnt = count()
criterion = torch.nn.MSELoss()
optimizer = torch.optim.Adam(model.parameters(), lr=0.1)

def loss_fn(model: torch.nn.Module, data: torch.Tensor) -> tuple[torch.Tensor, dict]:
    next(cnt)
    x, y = data[0], data[1]
    out = model(x)
    loss = criterion(out, y)
    return loss, {}
tmp_path = Path("/tmp")
n_epochs = 5
config = TrainConfig(
    folder=tmp_path,
    max_iter=n_epochs,
    checkpoint_every=100,
    write_every=100,
    batch_size=batch_size,
)
batch_size = 25
x = torch.linspace(0, 1, batch_size).reshape(-1, 1)
y = torch.sin(x)
train_with_grad(model, (x, y), optimizer, config, loss_fn=loss_fn)

Source code in qadence/ml_tools/train_grad.py

def train(
    model: Module,
    dataloader: DictDataLoader | DataLoader | list[Tensor] | tuple[Tensor, Tensor] | None,
    optimizer: Optimizer,
    config: TrainConfig,
    loss_fn: Callable,
    device: str = "cpu",
    optimize_step: Callable = optimize_step,
    write_tensorboard: Callable = write_tensorboard,
) -> tuple[Module, Optimizer]:
    """Runs the training loop with gradient-based optimizer

    Assumes that `loss_fn` returns a tuple of (loss,
    metrics: dict), where `metrics` is a dict of scalars. Loss and metrics are
    written to tensorboard. Checkpoints are written every
    `config.checkpoint_every` steps (and after the last training step).  If a
    checkpoint is found at `config.folder` we resume training from there.  The
    tensorboard logs can be viewed via `tensorboard --logdir /path/to/folder`.

    Args:
        model: The model to train.
        dataloader: dataloader of different types. If None, no data is required by
            the model
        optimizer: The optimizer to use.
        config: `TrainConfig` with additional training options.
        loss_fn: Loss function returning (loss: float, metrics: dict[str, float])
        device: String defining device to train on, pass 'cuda' for GPU.
        optimize_step: Customizable optimization callback which is called at every iteration.=
            The function must have the signature `optimize_step(model,
            optimizer, loss_fn, xs, device="cpu")` (see the example below).
            Apart from the default we already supply three other optimization
            functions `optimize_step_evo`, `optimize_step_grad_norm`, and
            `optimize_step_inv_dirichlet`. Learn more about how to use this in
            the [Advancded features](../../tutorials/advanced) tutorial of the
            documentation.
        write_tensorboard: Customizable tensorboard logging callback which is
            called every `config.write_every` iterations. The function must have
            the signature `write_tensorboard(writer, loss, metrics, iteration)`
            (see the example below).

    Example:
    ```python exec="on" source="material-block"
    from pathlib import Path
    import torch
    from itertools import count
    from qadence.constructors import hamiltonian_factory, hea, feature_map
    from qadence import chain, Parameter, QuantumCircuit, Z
    from qadence.models import QNN
    from qadence.ml_tools import train_with_grad, TrainConfig

    n_qubits = 2
    fm = feature_map(n_qubits)
    ansatz = hea(n_qubits=n_qubits, depth=3)
    observable = hamiltonian_factory(n_qubits, detuning = Z)
    circuit = QuantumCircuit(n_qubits, fm, ansatz)

    model = QNN(circuit, observable, backend="pyqtorch", diff_mode="ad")
    batch_size = 1
    input_values = {"phi": torch.rand(batch_size, requires_grad=True)}
    pred = model(input_values)

    ## lets prepare the train routine

    cnt = count()
    criterion = torch.nn.MSELoss()
    optimizer = torch.optim.Adam(model.parameters(), lr=0.1)

    def loss_fn(model: torch.nn.Module, data: torch.Tensor) -> tuple[torch.Tensor, dict]:
        next(cnt)
        x, y = data[0], data[1]
        out = model(x)
        loss = criterion(out, y)
        return loss, {}
    tmp_path = Path("/tmp")
    n_epochs = 5
    config = TrainConfig(
        folder=tmp_path,
        max_iter=n_epochs,
        checkpoint_every=100,
        write_every=100,
        batch_size=batch_size,
    )
    batch_size = 25
    x = torch.linspace(0, 1, batch_size).reshape(-1, 1)
    y = torch.sin(x)
    train_with_grad(model, (x, y), optimizer, config, loss_fn=loss_fn)
    ```
    """

    assert loss_fn is not None, "Provide a valid loss function"

    # Move model to device before optimizer is loaded
    model = model.to(device)

    # load available checkpoint
    init_iter = 0
    if config.folder:
        model, optimizer, init_iter = load_checkpoint(config.folder, model, optimizer)
        logger.debug(f"Loaded model and optimizer from {config.folder}")
    # initialize tensorboard
    writer = SummaryWriter(config.folder, purge_step=init_iter)

    ## Training
    progress = Progress(
        TextColumn("[progress.description]{task.description}"),
        BarColumn(),
        TaskProgressColumn(),
        TimeRemainingColumn(elapsed_when_finished=True),
    )
    if isinstance(dataloader, (list, tuple)):
        from qadence.ml_tools.data import to_dataloader

        assert len(dataloader) == 2, "Please provide exactly two torch tensors."
        x, y = dataloader
        dataloader = to_dataloader(x=x, y=y, batch_size=config.batch_size)
    with progress:
        dl_iter = iter(dataloader) if isinstance(dataloader, DictDataLoader) else None

        # outer epoch loop
        for iteration in progress.track(range(init_iter, init_iter + config.max_iter)):
            try:
                # in case there is not data needed by the model
                # this is the case, for example, of quantum models
                # which do not have classical input data (e.g. chemistry)
                if dataloader is None:
                    loss, metrics = optimize_step(
                        model, optimizer, loss_fn, dataloader, device=device
                    )
                    loss = loss.item()

                # single epoch with DictDataloader using a single iteration method
                # DictDataloader returns a single sample of the data
                # with a given batch size decided when the dataloader is defined
                elif isinstance(dataloader, DictDataLoader):
                    # resample all the time from the dataloader
                    # by creating a fresh iterator if the dataloader
                    # does not support automatically iterating datasets
                    if not dataloader.has_automatic_iter:
                        dl_iter = iter(dataloader)
                    data = next(dl_iter)  # type: ignore[arg-type]
                    loss, metrics = optimize_step(model, optimizer, loss_fn, data, device=device)

                elif isinstance(dataloader, DataLoader):
                    # single-epoch with standard DataLoader
                    # otherwise a standard PyTorch DataLoader behavior
                    # is assumed with optional mini-batches
                    running_loss = 0.0
                    for i, data in enumerate(dataloader):
                        # TODO: make sure to average metrics as well
                        loss, metrics = optimize_step(
                            model, optimizer, loss_fn, data, device=device
                        )
                        running_loss += loss.item()
                    loss = running_loss / (i + 1)

                else:
                    raise NotImplementedError("Unsupported dataloader type!")

                if iteration % config.print_every == 0:
                    print_metrics(loss, metrics, iteration)

                if iteration % config.write_every == 0:
                    write_tensorboard(writer, loss, metrics, iteration)

                if config.folder:
                    if iteration % config.checkpoint_every == 0:
                        write_checkpoint(config.folder, model, optimizer, iteration)

            except KeyboardInterrupt:
                print("Terminating training gracefully after the current iteration.")
                break

    # Final writing and checkpointing
    if config.folder:
        write_checkpoint(config.folder, model, optimizer, iteration)
    write_tensorboard(writer, loss, metrics, iteration)
    writer.close()

    return model, optimizer

`train(model, dataloader, optimizer, config, loss_fn)`

Runs the training loop with a gradient-free optimizer

Assumes that loss_fn returns a tuple of (loss, metrics: dict), where metrics is a dict of scalars. Loss and metrics are written to tensorboard. Checkpoints are written every config.checkpoint_every steps (and after the last training step). If a checkpoint is found at config.folder we resume training from there. The tensorboard logs can be viewed via tensorboard --logdir /path/to/folder.

PARAMETER	DESCRIPTION
`model`	The model to train TYPE: `Module`
`dataloader`	Dataloader constructed via `dictdataloader` TYPE: `DictDataLoader \| DataLoader \| None`
`optimizer`	The optimizer to use taken from the Nevergrad library. If this is not the case the function will raise an AssertionError TYPE: `Optimizer`
`loss_fn`	Loss function returning (loss: float, metrics: dict[str, float]) TYPE: `Callable`

Source code in qadence/ml_tools/train_no_grad.py

def train(
    model: Module,
    dataloader: DictDataLoader | DataLoader | None,
    optimizer: NGOptimizer,
    config: TrainConfig,
    loss_fn: Callable,
) -> tuple[Module, NGOptimizer]:
    """Runs the training loop with a gradient-free optimizer

    Assumes that `loss_fn` returns a tuple of (loss, metrics: dict), where
    `metrics` is a dict of scalars. Loss and metrics are written to
    tensorboard. Checkpoints are written every `config.checkpoint_every` steps
    (and after the last training step).  If a checkpoint is found at `config.folder`
    we resume training from there.  The tensorboard logs can be viewed via
    `tensorboard --logdir /path/to/folder`.

    Args:
        model: The model to train
        dataloader: Dataloader constructed via `dictdataloader`
        optimizer: The optimizer to use taken from the Nevergrad library. If this is not
            the case the function will raise an AssertionError
        loss_fn: Loss function returning (loss: float, metrics: dict[str, float])
    """
    init_iter = 0
    if config.folder:
        model, optimizer, init_iter = load_checkpoint(config.folder, model, optimizer)
        logger.debug(f"Loaded model and optimizer from {config.folder}")

    def _update_parameters(
        data: Tensor | None, ng_params: ng.p.Array
    ) -> tuple[float, dict, ng.p.Array]:
        loss, metrics = loss_fn(model, data)  # type: ignore[misc]
        optimizer.tell(ng_params, float(loss))
        ng_params = optimizer.ask()  # type: ignore [assignment]
        params = promote_to_tensor(ng_params.value, requires_grad=False)
        set_parameters(model, params)
        return loss, metrics, ng_params

    assert loss_fn is not None, "Provide a valid loss function"
    # TODO: support also Scipy optimizers
    assert isinstance(optimizer, NGOptimizer), "Use only optimizers from the Nevergrad library"

    # initialize tensorboard
    writer = SummaryWriter(config.folder, purge_step=init_iter)

    # set optimizer configuration and initial parameters
    optimizer.budget = config.max_iter
    optimizer.enable_pickling()

    # TODO: Make it GPU compatible if possible
    params = get_parameters(model).detach().numpy()
    ng_params = ng.p.Array(init=params)

    # serial training
    # TODO: Add a parallelization using the num_workers argument in Nevergrad
    progress = Progress(
        TextColumn("[progress.description]{task.description}"),
        BarColumn(),
        TaskProgressColumn(),
        TimeRemainingColumn(elapsed_when_finished=True),
    )
    with progress:
        dl_iter = iter(dataloader) if isinstance(dataloader, DictDataLoader) else None

        for iteration in progress.track(range(init_iter, init_iter + config.max_iter)):
            if dataloader is None:
                loss, metrics, ng_params = _update_parameters(None, ng_params)

            elif isinstance(dataloader, DictDataLoader):
                # resample all the time from the dataloader
                # by creating a fresh iterator if the dataloader
                # does not support automatically iterating datasets
                if not dataloader.has_automatic_iter:
                    dl_iter = iter(dataloader)

                data = next(dl_iter)  # type: ignore[arg-type]
                loss, metrics, ng_params = _update_parameters(data, ng_params)

            elif isinstance(dataloader, DataLoader):
                # single-epoch with standard DataLoader
                # otherwise a standard PyTorch DataLoader behavior
                # is assumed with optional mini-batches
                running_loss = 0.0
                for i, data in enumerate(dataloader):
                    loss, metrics, ng_params = _update_parameters(data, ng_params)
                    running_loss += loss
                loss = running_loss / (i + 1)

            else:
                raise NotImplementedError("Unsupported dataloader type!")

            if iteration % config.print_every == 0:
                print_metrics(loss, metrics, iteration)

            if iteration % config.write_every == 0:
                write_tensorboard(writer, loss, metrics, iteration)

            if config.folder:
                if iteration % config.checkpoint_every == 0:
                    write_checkpoint(config.folder, model, optimizer, iteration)

            if iteration >= init_iter + config.max_iter:
                break

    ## Final writing and stuff
    if config.folder:
        write_checkpoint(config.folder, model, optimizer, iteration)
    write_tensorboard(writer, loss, metrics, iteration)
    writer.close()

    return model, optimizer

QML tools

ML Tools

TrainConfig dataclass

batch_size: int = 1 class-attribute instance-attribute

checkpoint_best_only: bool = False class-attribute instance-attribute

checkpoint_every: int = 5000 class-attribute instance-attribute

create_subfolder_per_run: bool = False class-attribute instance-attribute

folder: Optional[Path] = None class-attribute instance-attribute

max_iter: int = 10000 class-attribute instance-attribute

print_every: int = 1000 class-attribute instance-attribute

trainstop_criterion: Optional[Callable] = None class-attribute instance-attribute

validation_criterion: Optional[Callable] = None class-attribute instance-attribute

write_every: int = 50 class-attribute instance-attribute

get_parameters(model)

num_parameters(model)

set_parameters(model, theta)

data_to_model(xs, device='cpu')

optimize_step(model, optimizer, loss_fn, xs, device='cpu')

train(model, dataloader, optimizer, config, loss_fn, device='cpu', optimize_step=optimize_step, write_tensorboard=write_tensorboard)

train(model, dataloader, optimizer, config, loss_fn)

`TrainConfig` `dataclass`

`batch_size: int = 1` `class-attribute` `instance-attribute`

`checkpoint_best_only: bool = False` `class-attribute` `instance-attribute`

`checkpoint_every: int = 5000` `class-attribute` `instance-attribute`

`create_subfolder_per_run: bool = False` `class-attribute` `instance-attribute`

`folder: Optional[Path] = None` `class-attribute` `instance-attribute`

`max_iter: int = 10000` `class-attribute` `instance-attribute`

`print_every: int = 1000` `class-attribute` `instance-attribute`

`trainstop_criterion: Optional[Callable] = None` `class-attribute` `instance-attribute`

`validation_criterion: Optional[Callable] = None` `class-attribute` `instance-attribute`

`write_every: int = 50` `class-attribute` `instance-attribute`

`get_parameters(model)`

`num_parameters(model)`

`set_parameters(model, theta)`

`data_to_model(xs, device='cpu')`

`optimize_step(model, optimizer, loss_fn, xs, device='cpu')`

`train(model, dataloader, optimizer, config, loss_fn, device='cpu', optimize_step=optimize_step, write_tensorboard=write_tensorboard)`

`train(model, dataloader, optimizer, config, loss_fn)`