fusionlab.nn.transformers.TemporalFusionTransformer¶
- class fusionlab.nn.transformers.TemporalFusionTransformer[source]¶
Bases:
Model,NNLearnerTemporalFusionTransformer model implementation for multi-horizon forecasting, with optional static, past, and future inputs.
This class extends Keras Model and integrates with the gofast NNLearner interface. It supports dynamic (past) inputs, optional static inputs, and newly added optional future inputs (
future_input_dim). By including the future covariates, the TemporalFusionTransformer can account for known future features (e.g., events, planned discount rates, etc.) in its predictions.- Parameters:
dynamic_input_dim (
int) – Dimensionality of the dynamic (past) inputs. This is mandatory for the TFT model.static_input_dim (
int, optional) – Dimensionality of static inputs. If not None, the call method will expect static inputs.future_input_dim (
int, optional) – Dimensionality of future (known) inputs. If not None, the call method will expect future inputs to handle exogenous covariates known in the future (e.g., events, planned promotions, etc.).hidden_units (
int, default32) – Number of hidden units for the layers that do not have a distinct specification (e.g., GRNs, variable selection networks).num_heads (
int, default4) – Number of attention heads in the multi-head attention layer.dropout_rate (
float, default0.1) – Dropout rate for various layers (GRNs, attention, etc.).forecast_horizon (
int, default1) – Number of timesteps to forecast into the future.quantiles (
listoffloat, optional) – List of quantiles for probabilistic forecasting. If None, a single deterministic output is produced.activation (
str, default'elu') – Activation function. Must be one of{'elu', 'relu', 'tanh', 'sigmoid', 'linear', 'gelu'}.use_batch_norm (
bool, defaultFalse) – Whether to apply batch normalization in various sub-layers.num_lstm_layers (
int, default1) – Number of LSTM layers in the encoder.lstm_units (
listofintorNone, defaultNone) – If provided, each index corresponds to the number of LSTM units for that layer. If None, useshidden_unitsfor each layer.
Examples
>>> from fusionlab.nn._tensor_validation import validate_tft_inputs >>> from fusionlab.nn.tft import TemporalFusionTransformer >>> model = TemporalFusionTransformer( ... dynamic_input_dim=10, ... static_input_dim=5, ... future_input_dim=8, ... hidden_units=32, ... num_heads=4, ... dropout_rate=0.1, ... forecast_horizon=7, ... quantiles=[0.1, 0.5, 0.9], ... activation='elu', ... use_batch_norm=True, ... num_lstm_layers=2, ... lstm_units=[64, 32] ... )
Notes
The newly added
future_input_dimallows the model to incorporate future covariates known at forecast time. In thecallmethod, iffuture_input_dimis not None, the model expects three inputs:(static_inputs, dynamic_inputs, future_inputs). Otherwise, it expects only(static_inputs, dynamic_inputs).See also
VariableSelectionNetworkFor feature selection and embedding.
GatedResidualNetworkA GRN used in various sub-layers.
LSTMKeras LSTM layers for sequence processing.
References
- __init__(dynamic_input_dim, static_input_dim=None, future_input_dim=None, hidden_units=32, num_heads=4, dropout_rate=0.1, forecast_horizon=1, quantiles=None, activation='elu', use_batch_norm=False, num_lstm_layers=1, lstm_units=None, output_dim=1, **kw)[source]¶
Methods
__init__(dynamic_input_dim[, ...])add_loss(loss)Can be called inside of the call() method to add a scalar loss.
add_metric(*args, **kwargs)add_variable(shape, initializer[, dtype, ...])Add a weight variable to the layer.
add_weight([shape, initializer, dtype, ...])Add a weight variable to the layer.
build(input_shape)build_from_config(config)Builds the layer's states with the supplied config dict.
call(inputs[, training])The main forward pass for NTemporalFusionTransformer.
compile([optimizer, loss, loss_weights, ...])Configures the model for training.
compile_from_config(config)Compiles the model with the information given in config.
compiled_loss(y, y_pred[, sample_weight, ...])compute_loss([x, y, y_pred, sample_weight, ...])Compute the total loss, validate it, and return it.
compute_mask(inputs, previous_mask)compute_metrics(x, y, y_pred[, sample_weight])Update metric states and collect all metrics to be returned.
compute_output_shape(*args, **kwargs)compute_output_spec(*args, **kwargs)count_params()Count the total number of scalars composing the weights.
evaluate([x, y, batch_size, verbose, ...])Returns the loss value & metrics values for the model in test mode.
export(filepath[, format, verbose, ...])Export the model as an artifact for inference.
fit([x, y, batch_size, epochs, verbose, ...])Trains the model for a fixed number of epochs (dataset iterations).
from_config(config)Recreate NTemporalFusionTransformer instance from config.
get_build_config()Returns a dictionary with the layer's input shape.
get_compile_config()Returns a serialized config with information for compiling the model.
Return the model configuration for serialization.
get_layer([name, index])Retrieves a layer based on either its name (unique) or index.
get_metrics_result()Returns the model's metrics values as a dict.
get_params([deep])Get the parameters for this learner.
get_state_tree([value_format])Retrieves tree-like structure of model variables.
get_weights()Return the values of layer.weights as a list of NumPy arrays.
help(**kwargs)load(file_path[, format])Load the learner's state from a specified file in the desired format.
load_own_variables(store)Loads the state of the layer.
load_weights(filepath[, skip_mismatch])Load the weights from a single file or sharded files.
loss(y, y_pred[, sample_weight])make_predict_function([force])make_test_function([force])make_train_function([force])predict(x[, batch_size, verbose, steps, ...])Generates output predictions for the input samples.
predict_on_batch(x)Returns predictions for a single batch of samples.
predict_step(data)quantize(mode[, config])Quantize the weights of the model.
quantized_build(input_shape, mode)quantized_call(*args, **kwargs)rematerialized_call(layer_call, *args, **kwargs)Enable rematerialization dynamically for layer's call method.
reset_metrics()save(filepath[, overwrite, zipped])Saves a model as a .keras file.
save_own_variables(store)Saves the state of the layer.
save_weights(filepath[, overwrite, ...])Saves all weights to a single file or sharded files.
set_params(**params)Set the parameters of this learner.
set_state_tree(state_tree)Assigns values to variables of the model.
set_weights(weights)Sets the values of layer.weights from a list of NumPy arrays.
stateless_call(trainable_variables, ...[, ...])Call the layer without any side effects.
stateless_compute_loss(trainable_variables, ...)summary([line_length, positions, print_fn, ...])Prints a string summary of the network.
symbolic_call(*args, **kwargs)test_on_batch(x[, y, sample_weight, return_dict])Test the model on a single batch of samples.
test_step(data)to_json(**kwargs)Returns a JSON string containing the network configuration.
train_on_batch(x[, y, sample_weight, ...])Runs a single gradient update on a single batch of data.
train_step(data)Attributes
compiled_metricscompute_dtypeThe dtype of the computations performed by the layer.
distribute_reduction_methoddistribute_strategydtypeAlias of layer.variable_dtype.
dtype_policyinputRetrieves the input tensor(s) of a symbolic operation.
input_dtypeThe dtype layer inputs should be converted to.
input_specjit_compilelayerslossesList of scalar losses from add_loss, regularizers and sublayers.
metricsList of all metrics.
metrics_namesmetrics_variablesList of all metric variables.
non_trainable_variablesList of all non-trainable layer state.
non_trainable_weightsList of all non-trainable weight variables of the layer.
outputRetrieves the output tensor(s) of a layer.
pathThe path of the layer.
quantization_modeThe quantization mode of this layer, None if not quantized.
run_eagerlysupports_maskingWhether this layer supports computing a mask using compute_mask.
trainableSettable boolean, whether this layer should be trainable or not.
trainable_variablesList of all trainable layer state.
trainable_weightsList of all trainable weight variables of the layer.
variable_dtypeThe dtype of the state (weights) of the layer.
variablesList of all layer state, including random seeds.
weightsList of all weight variables of the layer.
- __init__(dynamic_input_dim, static_input_dim=None, future_input_dim=None, hidden_units=32, num_heads=4, dropout_rate=0.1, forecast_horizon=1, quantiles=None, activation='elu', use_batch_norm=False, num_lstm_layers=1, lstm_units=None, output_dim=1, **kw)[source]¶
- help(**kwargs)¶
- my_params = TemporalFusionTransformer( dynamic_input_dim, static_input_dim=None, future_input_dim=None, hidden_units=32, num_heads=4, dropout_rate=0.1, forecast_horizon=1, quantiles=None, activation='elu', use_batch_norm=False, num_lstm_layers=1, lstm_units=None, output_dim=1 )¶
- call(inputs, training=False, **kw)[source]¶
The main forward pass for NTemporalFusionTransformer.
Validate and unpack inputs using validate_tft_inputs.
Apply variable selection to static, dynamic, and future inputs.
Perform positional encoding on dynamic+future sequences.
Compute static context vectors if static is present.
Pass through LSTM encoders.
Optionally enrich dynamic with static context.
Temporal attention for interpretable weighting of time steps.
Position-wise feedforward (GRN).
Final slicing (forecast horizon) and output (quantiles or single).
- Parameters:
inputs (
tuple) – Should contain up to three elements: (dynamic_inputs, future_inputs, static_inputs) or fewer if not all are provided.training (
bool, defaultFalse) – Whether in training mode (affects dropout, BN, etc.).
- Returns:
Final predicted sequences of shape (batch_size, forecast_horizon, num_quantiles or 1).
- Return type:
tf.Tensor