Using external methods for candidate generation in Ax¶

Out of the box, Ax offers many options for candidate generation, most of which utilize Bayesian optimization algorithms built using BoTorch. For users that want to leverage Ax for experiment orchestration (via AxClient or Scheduler) and other features (e.g., early stopping), while relying on other methods for candidate generation, we introduced ExternalGenerationNode.

A GenerationNode is a building block of a GenerationStrategy. They can be combined together utilize different methods for generating candidates at different stages of an experiment. ExternalGenerationNode exposes a lightweight interface to allow the users to easily integrate their methods into Ax, and use them as standalone or with other GenerationNodes in a GenerationStrategy.

In this tutorial, we will implement a simple generation node using RandomForestRegressor from sklearn, and combine it with Sobol (for initialization) to optimize the Hartmann6 problem.

NOTE: This is for illustration purposes only. We do not recommend using this strategy as it typically does not perform well compared to Ax's default algorithms due to it's overly greedy behavior.

In [1]:

import time
from typing import Any, Dict, List, Optional, Tuple

import numpy as np
from ax.core.base_trial import TrialStatus
from ax.core.data import Data
from ax.core.experiment import Experiment
from ax.core.parameter import RangeParameter
from ax.core.types import TParameterization
from ax.modelbridge.external_generation_node import ExternalGenerationNode
from ax.modelbridge.generation_node import GenerationNode
from ax.modelbridge.generation_strategy import GenerationStrategy
from ax.modelbridge.model_spec import ModelSpec
from ax.modelbridge.registry import Models
from ax.modelbridge.transition_criterion import MaxTrials
from ax.plot.trace import plot_objective_value_vs_trial_index
from ax.service.ax_client import AxClient, ObjectiveProperties
from ax.service.utils.report_utils import exp_to_df
from ax.utils.common.typeutils import checked_cast
from ax.utils.measurement.synthetic_functions import hartmann6
from sklearn.ensemble import RandomForestRegressor


class RandomForestGenerationNode(ExternalGenerationNode):
    """A generation node that uses the RandomForestRegressor
    from sklearn to predict candidate performance and picks the
    next point as the random sample that has the best prediction.

    To leverage external methods for candidate generation, the user must
    create a subclass that implements ``update_generator_state`` and
    ``get_next_candidate`` methods. This can then be provided
    as a node into a ``GenerationStrategy``, either as standalone or as
    part of a larger generation strategy with other generation nodes,
    e.g., with a Sobol node for initialization.
    """

    def __init__(self, num_samples: int, regressor_options: Dict[str, Any]) -> None:
        """Initialize the generation node.

        Args:
            regressor_options: Options to pass to the random forest regressor.
            num_samples: Number of random samples from the search space
                used during candidate generation. The sample with the best
                prediction is recommended as the next candidate.
        """
        t_init_start = time.monotonic()
        super().__init__(node_name="RandomForest")
        self.num_samples: int = num_samples
        self.regressor: RandomForestRegressor = RandomForestRegressor(
            **regressor_options
        )
        # We will set these later when updating the state.
        # Alternatively, we could have required experiment as an input
        # and extracted them here.
        self.parameters: Optional[List[RangeParameter]] = None
        self.minimize: Optional[bool] = None
        # Recording time spent in initializing the generator. This is
        # used to compute the time spent in candidate generation.
        self.fit_time_since_gen: float = time.monotonic() - t_init_start

    def update_generator_state(self, experiment: Experiment, data: Data) -> None:
        """A method used to update the state of the generator. This includes any
        models, predictors or any other custom state used by the generation node.
        This method will be called with the up-to-date experiment and data before
        ``get_next_candidate`` is called to generate the next trial(s). Note
        that ``get_next_candidate`` may be called multiple times (to generate
        multiple candidates) after a call to  ``update_generator_state``.

        For this example, we will train the regressor using the latest data from
        the experiment.

        Args:
            experiment: The ``Experiment`` object representing the current state of the
                experiment. The key properties includes ``trials``, ``search_space``,
                and ``optimization_config``. The data is provided as a separate arg.
            data: The data / metrics collected on the experiment so far.
        """
        search_space = experiment.search_space
        parameter_names = list(search_space.parameters.keys())
        metric_names = list(experiment.optimization_config.metrics.keys())
        if any(
            not isinstance(p, RangeParameter) for p in search_space.parameters.values()
        ):
            raise NotImplementedError(
                "This example only supports RangeParameters in the search space."
            )
        if search_space.parameter_constraints:
            raise NotImplementedError(
                "This example does not support parameter constraints."
            )
        if len(metric_names) != 1:
            raise NotImplementedError(
                "This example only supports single-objective optimization."
            )
        # Get the data for the completed trials.
        num_completed_trials = len(experiment.trials_by_status[TrialStatus.COMPLETED])
        x = np.zeros([num_completed_trials, len(parameter_names)])
        y = np.zeros([num_completed_trials, 1])
        for t_idx, trial in experiment.trials.items():
            if trial.status == "COMPLETED":
                trial_parameters = trial.arm.parameters
                x[t_idx, :] = np.array([trial_parameters[p] for p in parameter_names])
                trial_df = data.df[data.df["trial_index"] == t_idx]
                y[t_idx, 0] = trial_df[trial_df["metric_name"] == metric_names[0]][
                    "mean"
                ].item()

        # Train the regressor.
        self.regressor.fit(x, y)
        # Update the attributes not set in __init__.
        self.parameters = search_space.parameters
        self.minimize = experiment.optimization_config.objective.minimize

    def get_next_candidate(
        self, pending_parameters: List[TParameterization]
    ) -> TParameterization:
        """Get the parameters for the next candidate configuration to evaluate.

        We will draw ``self.num_samples`` random samples from the search space
        and predict the objective value for each sample. We will then return
        the sample with the best predicted value.

        Args:
            pending_parameters: A list of parameters of the candidates pending
                evaluation. This is often used to avoid generating duplicate candidates.
                We ignore this here for simplicity.

        Returns:
            A dictionary mapping parameter names to parameter values for the next
            candidate suggested by the method.
        """
        bounds = np.array([[p.lower, p.upper] for p in self.parameters.values()])
        unit_samples = np.random.random_sample([self.num_samples, len(bounds)])
        samples = bounds[:, 0] + (bounds[:, 1] - bounds[:, 0]) * unit_samples
        # Predict the objective value for each sample.
        y_pred = self.regressor.predict(samples)
        # Find the best sample.
        best_idx = np.argmin(y_pred) if self.minimize else np.argmax(y_pred)
        best_sample = samples[best_idx, :]
        # Convert the sample to a parameterization.
        candidate = {
            p_name: best_sample[i].item()
            for i, p_name in enumerate(self.parameters.keys())
        }
        return candidate

Construct the GenerationStrategy¶

We will use Sobol for the first 5 trials and defer to random forest for the rest.

In [2]:

generation_strategy = GenerationStrategy(
    name="Sobol+RandomForest",
    nodes=[
        GenerationNode(
            node_name="Sobol",
            model_specs=[ModelSpec(Models.SOBOL)],
            transition_criteria=[
                MaxTrials(
                    # This specifies the maximum number of trials to generate from this node, 
                    # and the next node in the strategy.
                    threshold=5,
                    block_transition_if_unmet=True,
                    transition_to="RandomForest"
                )
            ],
        ),
        RandomForestGenerationNode(num_samples=128, regressor_options={}),
    ],
)

Run a simple experiment using AxClient¶

More details on how to use AxClient can be found in the tutorial.

In [3]:

ax_client = AxClient(generation_strategy=generation_strategy)

ax_client.create_experiment(
    name="hartmann_test_experiment",
    parameters=[
        {
            "name": f"x{i}",
            "type": "range",
            "bounds": [0.0, 1.0],
            "value_type": "float",  # Optional, defaults to inference from type of "bounds".
        }
        for i in range(1, 7)
    ],
    objectives={"hartmann6": ObjectiveProperties(minimize=True)},
)


def evaluate(parameterization: TParameterization) -> Dict[str, Tuple[float, float]]:
    x = np.array([parameterization.get(f"x{i+1}") for i in range(6)])
    return {"hartmann6": (checked_cast(float, hartmann6(x)), 0.0)}

[INFO 05-03 00:24:01] ax.service.ax_client: Starting optimization with verbose logging. To disable logging, set the `verbose_logging` argument to `False`. Note that float values in the logs are rounded to 6 decimal points.

[INFO 05-03 00:24:01] ax.service.utils.instantiation: Created search space: SearchSpace(parameters=[RangeParameter(name='x1', parameter_type=FLOAT, range=[0.0, 1.0]), RangeParameter(name='x2', parameter_type=FLOAT, range=[0.0, 1.0]), RangeParameter(name='x3', parameter_type=FLOAT, range=[0.0, 1.0]), RangeParameter(name='x4', parameter_type=FLOAT, range=[0.0, 1.0]), RangeParameter(name='x5', parameter_type=FLOAT, range=[0.0, 1.0]), RangeParameter(name='x6', parameter_type=FLOAT, range=[0.0, 1.0])], parameter_constraints=[]).

Run the optimization loop¶

In [4]:

for i in range(15):
    parameterization, trial_index = ax_client.get_next_trial()
    ax_client.complete_trial(
        trial_index=trial_index, raw_data=evaluate(parameterization)
    )

[INFO 05-03 00:24:01] ax.service.ax_client: Generated new trial 0 with parameters {'x1': 0.332591, 'x2': 0.665138, 'x3': 0.953525, 'x4': 0.833778, 'x5': 0.667459, 'x6': 0.353341} using model Sobol.

[INFO 05-03 00:24:01] ax.service.ax_client: Completed trial 0 with data: {'hartmann6': (-0.249433, 0.0)}.

[INFO 05-03 00:24:01] ax.service.ax_client: Generated new trial 1 with parameters {'x1': 0.948901, 'x2': 0.112927, 'x3': 0.541063, 'x4': 0.095382, 'x5': 0.251553, 'x6': 0.352071} using model Sobol.

[INFO 05-03 00:24:01] ax.service.ax_client: Completed trial 1 with data: {'hartmann6': (-0.183565, 0.0)}.

[INFO 05-03 00:24:01] ax.service.ax_client: Generated new trial 2 with parameters {'x1': 0.44924, 'x2': 0.695093, 'x3': 0.478247, 'x4': 0.450264, 'x5': 0.156456, 'x6': 0.456935} using model Sobol.

[INFO 05-03 00:24:01] ax.service.ax_client: Completed trial 2 with data: {'hartmann6': (-0.531315, 0.0)}.

[INFO 05-03 00:24:01] ax.service.ax_client: Generated new trial 3 with parameters {'x1': 0.764962, 'x2': 0.368513, 'x3': 0.027139, 'x4': 0.702801, 'x5': 0.557126, 'x6': 0.402943} using model Sobol.

[INFO 05-03 00:24:01] ax.service.ax_client: Completed trial 3 with data: {'hartmann6': (-0.037054, 0.0)}.

[INFO 05-03 00:24:01] ax.service.ax_client: Generated new trial 4 with parameters {'x1': 0.999638, 'x2': 0.11696, 'x3': 0.281316, 'x4': 0.458621, 'x5': 0.794376, 'x6': 0.446874} using model Sobol.

[INFO 05-03 00:24:01] ax.service.ax_client: Completed trial 4 with data: {'hartmann6': (-0.004519, 0.0)}.

/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/sklearn/base.py:1474: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
  return fit_method(estimator, *args, **kwargs)
[INFO 05-03 00:24:01] ax.service.ax_client: Generated new trial 5 with parameters {'x1': 0.110829, 'x2': 0.410234, 'x3': 0.178481, 'x4': 0.787746, 'x5': 0.882319, 'x6': 0.97933} using model RandomForest.

[INFO 05-03 00:24:01] ax.service.ax_client: Completed trial 5 with data: {'hartmann6': (-0.002906, 0.0)}.

/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/sklearn/base.py:1474: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
  return fit_method(estimator, *args, **kwargs)

[INFO 05-03 00:24:01] ax.service.ax_client: Generated new trial 6 with parameters {'x1': 0.829822, 'x2': 0.542119, 'x3': 0.743559, 'x4': 0.826393, 'x5': 0.934228, 'x6': 0.722751} using model RandomForest.

[INFO 05-03 00:24:01] ax.service.ax_client: Completed trial 6 with data: {'hartmann6': (-0.001436, 0.0)}.

/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/sklearn/base.py:1474: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
  return fit_method(estimator, *args, **kwargs)

[INFO 05-03 00:24:01] ax.service.ax_client: Generated new trial 7 with parameters {'x1': 0.599644, 'x2': 0.070349, 'x3': 0.700847, 'x4': 0.378415, 'x5': 0.999115, 'x6': 0.929254} using model RandomForest.

[INFO 05-03 00:24:01] ax.service.ax_client: Completed trial 7 with data: {'hartmann6': (-0.018534, 0.0)}.

/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/sklearn/base.py:1474: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
  return fit_method(estimator, *args, **kwargs)

[INFO 05-03 00:24:02] ax.service.ax_client: Generated new trial 8 with parameters {'x1': 0.738488, 'x2': 0.33096, 'x3': 0.259166, 'x4': 0.935859, 'x5': 0.999532, 'x6': 0.565484} using model RandomForest.

[INFO 05-03 00:24:02] ax.service.ax_client: Completed trial 8 with data: {'hartmann6': (-0.000461, 0.0)}.

/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/sklearn/base.py:1474: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
  return fit_method(estimator, *args, **kwargs)

[INFO 05-03 00:24:02] ax.service.ax_client: Generated new trial 9 with parameters {'x1': 0.902593, 'x2': 0.649943, 'x3': 0.696071, 'x4': 0.562282, 'x5': 0.914872, 'x6': 0.636617} using model RandomForest.

[INFO 05-03 00:24:02] ax.service.ax_client: Completed trial 9 with data: {'hartmann6': (-0.001113, 0.0)}.

/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/sklearn/base.py:1474: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
  return fit_method(estimator, *args, **kwargs)

[INFO 05-03 00:24:02] ax.service.ax_client: Generated new trial 10 with parameters {'x1': 0.381555, 'x2': 0.151912, 'x3': 0.286819, 'x4': 0.893982, 'x5': 0.260035, 'x6': 0.873976} using model RandomForest.

[INFO 05-03 00:24:02] ax.service.ax_client: Completed trial 10 with data: {'hartmann6': (-0.054135, 0.0)}.

/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/sklearn/base.py:1474: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
  return fit_method(estimator, *args, **kwargs)

[INFO 05-03 00:24:02] ax.service.ax_client: Generated new trial 11 with parameters {'x1': 0.236558, 'x2': 0.644154, 'x3': 0.304187, 'x4': 0.940791, 'x5': 0.809804, 'x6': 0.402144} using model RandomForest.

[INFO 05-03 00:24:02] ax.service.ax_client: Completed trial 11 with data: {'hartmann6': (-0.053814, 0.0)}.

/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/sklearn/base.py:1474: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
  return fit_method(estimator, *args, **kwargs)

[INFO 05-03 00:24:02] ax.service.ax_client: Generated new trial 12 with parameters {'x1': 0.469254, 'x2': 0.930042, 'x3': 0.892594, 'x4': 0.85488, 'x5': 0.875946, 'x6': 0.892123} using model RandomForest.

[INFO 05-03 00:24:02] ax.service.ax_client: Completed trial 12 with data: {'hartmann6': (-0.000921, 0.0)}.

/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/sklearn/base.py:1474: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
  return fit_method(estimator, *args, **kwargs)

[INFO 05-03 00:24:02] ax.service.ax_client: Generated new trial 13 with parameters {'x1': 0.668136, 'x2': 0.3203, 'x3': 0.561483, 'x4': 0.599216, 'x5': 0.483797, 'x6': 0.614828} using model RandomForest.

[INFO 05-03 00:24:02] ax.service.ax_client: Completed trial 13 with data: {'hartmann6': (-0.333465, 0.0)}.

/opt/hostedtoolcache/Python/3.10.14/x64/lib/python3.10/site-packages/sklearn/base.py:1474: DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
  return fit_method(estimator, *args, **kwargs)

[INFO 05-03 00:24:02] ax.service.ax_client: Generated new trial 14 with parameters {'x1': 0.6826, 'x2': 0.098756, 'x3': 0.096295, 'x4': 0.251534, 'x5': 0.64098, 'x6': 0.275748} using model RandomForest.

[INFO 05-03 00:24:02] ax.service.ax_client: Completed trial 14 with data: {'hartmann6': (-0.06407, 0.0)}.

View the trials generated during optimization¶

In [5]:

exp_df = exp_to_df(ax_client.experiment)
exp_df

[WARNING 05-03 00:24:02] ax.service.utils.report_utils: Column reason missing for all trials. Not appending column.

Out[5]:

	trial_index	arm_name	trial_status	generation_method	hartmann6	x1	x2	x3	x4	x5	x6
0	0	0_0	COMPLETED	Sobol	-0.249433	0.332591	0.665138	0.953525	0.833778	0.667459	0.353341
1	1	1_0	COMPLETED	Sobol	-0.183565	0.948901	0.112927	0.541063	0.095382	0.251553	0.352071
2	2	2_0	COMPLETED	Sobol	-0.531315	0.449240	0.695093	0.478247	0.450264	0.156456	0.456935
3	3	3_0	COMPLETED	Sobol	-0.037054	0.764962	0.368513	0.027139	0.702801	0.557126	0.402943
4	4	4_0	COMPLETED	Sobol	-0.004519	0.999638	0.116960	0.281316	0.458621	0.794376	0.446874
5	5	5_0	COMPLETED	RandomForest	-0.002906	0.110829	0.410234	0.178481	0.787746	0.882319	0.979330
6	6	6_0	COMPLETED	RandomForest	-0.001436	0.829822	0.542119	0.743559	0.826393	0.934228	0.722751
7	7	7_0	COMPLETED	RandomForest	-0.018534	0.599644	0.070349	0.700847	0.378415	0.999115	0.929254
8	8	8_0	COMPLETED	RandomForest	-0.000461	0.738488	0.330960	0.259166	0.935859	0.999532	0.565484
9	9	9_0	COMPLETED	RandomForest	-0.001113	0.902593	0.649943	0.696071	0.562282	0.914872	0.636617
10	10	10_0	COMPLETED	RandomForest	-0.054135	0.381555	0.151912	0.286819	0.893982	0.260035	0.873976
11	11	11_0	COMPLETED	RandomForest	-0.053814	0.236558	0.644154	0.304187	0.940791	0.809804	0.402144
12	12	12_0	COMPLETED	RandomForest	-0.000921	0.469254	0.930042	0.892594	0.854880	0.875946	0.892123
13	13	13_0	COMPLETED	RandomForest	-0.333465	0.668136	0.320300	0.561483	0.599216	0.483797	0.614828
14	14	14_0	COMPLETED	RandomForest	-0.064070	0.682600	0.098756	0.096295	0.251534	0.640980	0.275748

In [6]:

plot_objective_value_vs_trial_index(
    exp_df=exp_df,
    metric_colname="hartmann6",
    minimize=True,
    title="Hartmann6 Objective Value vs. Trial Index",
)

/tmp/tmp.J4Ip05By14/Ax-main/ax/plot/trace.py:852: FutureWarning:

DataFrame.fillna with 'method' is deprecated and will raise in a future version. Use obj.ffill() or obj.bfill() instead.

Download Tutorial Jupyter Notebook

Download Tutorial Source Code

Total runtime of script: 8.13 seconds.