Proper Orthogonal Decomposition + Interpolation (PODI)¶

PODI is an application used to predict vectorial outputs. It combines Proper Orthogonal Decomposition (POD) and Kriging based surrogate models to perform the estimations.

Context¶

We seek for an approximation of a vector \(u(\mathbf{x}) \in \mathbb{R}^p\), with \(p>>1\) and \(\mathbf{x}= \in \mathcal{X} \subset \mathbb{R}^d\) an input vector. PODI being a supervised learning approach we assume that a Design of Experiments of size \(N\) is available, i.e. \(u(\mathbf{x_k})\) for \(k \in [\![1,N]\!]\). In the model order reduction, a vector \(u(\mathbf{x_k})\) is called a snapshot. The PODI application aims at building an approximation \(\hat{u}(\mathbf{x})\) of \(u(\mathbf{x})\) for any \(\mathbf{x}\in\mathcal{X}\).

To construct this approximation, the \(N\) snapshots are first gathered in a database called the snapshot matrix:

\[\begin{split}S= \begin{bmatrix} u( \mathbf{x}_1)_1 & \dots & u( \mathbf{x}_N)_1 \\ \vdots & \ddots & \vdots \\ u( \mathbf{x}_1)_p & \dots & u( \mathbf{x}_N)_p \\ \end{bmatrix} \in \mathbb{R}^{p \times N}\end{split}\]

Each column of the matrix corresponds to a snapshot output \(u(\mathbf{x_k})\).

Proper Orthogonal Decomposition (POD)¶

Global POD¶

The Proper Orthogonal Decomposition of \(u\) reads,

\[\begin{equation}\label{e:pod} u({\mathbf x})\approx \hat{u}({\mathbf{x}}) =u_0 + \sum_{i=1}^{M} \alpha_i(\mathbf x)\phi_i \end{equation}\]

\(u\) is decomposed as a sum of \(M\) modes and \(u_0\) corresponds to the mean value of \(u\).
each mode \(i\) is defined by a scalar coefficient \(\alpha_i\), called generalized coordinate, and a vector \(\phi_{i}\) of dimension \(p\).
the \(\phi_i\) vectors are orthogonal and form the POD basis \(\Phi\). Note that they are independent of \(x\), hence the name “global” POD basis.

In practice, the mean value \(u_0\) of \(u\) is not available and will be estimated by the mean value of the \(N\) snapshots. It can be shown that the basis \(\Phi\) that leads to the best approximation in the mean square error sense is the singular vector of the matrix \(S-u_0\). The generalized coordinates \(\alpha_i(\mathbf{x}), i = 1,\cdots,M\) will be interpolated by GP.

Local POD¶

Local POD starts by assuming that the input vector \(\mathbf{x}\) can be splitted into two subsets of variables i.e. \(\mathbf{x}=\left\lbrace \mathbf{x_1},\mathbf{x_2}\right\rbrace\). The spliting is not automatic and left to the user. Then, the local POD approximation reads

\[\begin{equation}\label{e:local_pod} u({\mathbf x})\approx u_0 + \sum_{i=1}^{M} \alpha_i(\mathbf{x_1},\mathbf{x_2})\phi_i(\mathbf{x_1}) \end{equation}\]

where \(\phi_i\) is the local POD basis at input point \(\mathbf{x_1}\). In practice this POD basis must be approximated by interpolation on the Grassmann manifold of a database of local POD bases. Hence this approach further assumes that a DoE of local POD bases is provided. More information on this interpolation can be found in ref [1]. The generalized coordinates \(\alpha_i(\mathbf{x_1},\mathbf{x_2})\) are interpolated by GPs as for the global POD case.

[1] Porrello, C., Dubreuil, S., and Farhat, C. Bayesian framework with projection-based model order reduction for efficient global optimization. In AIAA AVIATION FORUM AND ASCEND 2024 (2024)

Singular Values Decomposition (SVD)¶

To perform the POD, the SVD of the snapshot matrix \(S\) is used:

\[\begin{equation}\label{e:svd} S=U\Sigma{V}^{T} \end{equation}\]

The \((p \times p)\) \(U\) and \((N \times N)\) \({V}^{T}\) matrices are orthogonal and contain the singular vectors. These vectors are the directions of maximum variance in the data and are ranked by decreasing order of importance. Each vector corresponds to a mode of \(u\). The total number of available modes is limited by the number of snapshots:

\[\begin{equation}\label{e:M<=N} M \le N \end{equation}\]

The importance of each mode is represented by the diagonal values of the \((p \times N)\) \(\Sigma\) matrix. They are known as the singular values \(\sigma_i\) and are positive numbers ranked by decreasing value. It is then needed to filter the modes to keep those that represent most of the data structure. To do this, we use the explained variance. It represents the data variance that we keep when filtering the modes.

If \(m\) modes are kept, their explained variance \(EV_m\) is:

\[\begin{equation}\label{e:ev_m} EV_m=\frac{\sum_{i=1}^{m} \sigma_i^2}{\sum_{i=1}^{N} \sigma_i^2} \end{equation}\]

The number of kept modes is defined by a tolerance \(\eta \in ]0,1]\) that represents the minimum variance we desire to explain during the SVD:

\[\begin{equation}\label{e:M_def} M = \min\{m \in [\![1,N]\!]: EV_m \ge \eta\} \end{equation}\]

Then, the first \(M\) singular vectors of the \(U\) matrix correspond to the \(\phi_i\) vectors in the POD. The \(\alpha_i\) coefficients of the \(A\) matrix can be deduced:

\[\begin{equation}\label{e:A} A={\Phi}^{T}(S-U_0) \end{equation}\]

Use of Surrogate models¶

To compute \(u\) at a new value \(\mathbf{x}_*\), the values of \(\alpha_i(\mathbf{x}_*)\) at each mode \(i\) are needed.

To estimate them, Kriging based surrogate models are used:

\[\begin{split}\mathbf{x}=(\mathbf{x}_1,\dots,\mathbf{x}_k,\dots,\mathbf{x}_N) \longrightarrow \begin{cases} \alpha_1(\mathbf{x}) \longrightarrow \text{model 1} \\ \vdots \\ \alpha_i(\mathbf{x}) \longrightarrow \text{model i} \\ \vdots \\ \alpha_M(\mathbf{x}) \longrightarrow \text{model M} \\ \end{cases}\end{split}\]

For each kept mode \(i\), we use a surrogate model that is trained with the inputs \(\mathbf{x}_k\) and outputs \(\alpha_i(\mathbf{x}_k)\).

These models are able to compute an estimation denoted \(\hat\alpha_i(\mathbf{x}_*)\). It is normally distributed:

\[\hat\alpha_i(\mathbf{x}_*) \hookrightarrow \mathcal{N}(\mu_i(\mathbf{x}_*),\sigma_i^{2}(\mathbf{x}_*))\]

The mean, variance and derivative of \(u(\mathbf{x}_*)\) can be deduced:

\[\begin{split}\begin{cases} \mathbb{E}[u(\mathbf{x}_*)]=u_0+\sum_{i=1}^{M} \mu_i(\mathbf{x}_*)\phi_i \\ \mathbb{V}[u(\mathbf{x}_*)]=\sum_{i=1}^{M} \sigma_i^{2}(\mathbf{x}_*)\phi_i^{2} \\ u'(\mathbf{x}_*)=\sum_{i=1}^{M} \hat\alpha_i'(\mathbf{x}_*)\phi_i \end{cases}\end{split}\]

NB: The variance equation takes in consideration that:

the models are pairwise independent, so are the coefficients \(\hat\alpha_i(\mathbf{x}_*)\).

Usage¶

Example 1: global POD case for 1D function¶

import matplotlib.pyplot as plt
import numpy as np

from smt.applications import PODI
from smt.sampling_methods import LHS

light_pink = np.array((250, 233, 232)) / 255

p = 100
t = np.linspace(-1, 1, p)
n_modes_test = 10

def function_test_1d(x, t, n_modes_test, p):
    import numpy as np  # Note: only required by SMT doc testing toolchain

    def cos_coeff(i: int, x: np.ndarray):
        a = 2 * i % 2 - 1
        return a * x[:, 0] * np.cos(i * x[:, 0])

    def Legendre(i: int, t: np.ndarray):
        from scipy import special

        return special.legendre(i)(t)

    def gram_schmidt(input_array: np.ndarray) -> np.ndarray:
        """To perform the  Gram-Schmidt's algorithm."""

        basis = np.zeros_like(input_array)
        for i in range(len(input_array)):
            basis[i] = input_array[i]
            for j in range(i):
                basis[i] -= (
                    np.dot(input_array[i], basis[j])
                    / np.dot(basis[j], basis[j])
                    * basis[j]
                )
            basis[i] /= np.linalg.norm(basis[i])
        return basis

    u0 = np.zeros((p, 1))

    alpha = np.zeros((x.shape[0], n_modes_test))
    for i in range(n_modes_test):
        alpha[:, i] = cos_coeff(i, x)

    V_init = np.zeros((p, n_modes_test))
    for i in range(n_modes_test):
        V_init[:, i] = Legendre(i, t)

    V = gram_schmidt(V_init.T).T
    database = u0 + np.dot(V, alpha.T)

    return database

seed_sampling = 42
xlimits = np.array([[0, 4]])
sampling = LHS(xlimits=xlimits, seed=seed_sampling)

nt = 40
xt = sampling(nt)

nv = 50
xv = sampling(nv)

x = np.concatenate((xt, xv))
dbfull = function_test_1d(x, t, n_modes_test, p)

# Training data
dbt = dbfull[:, :nt]

# Validation data
dbv = dbfull[:, nt:]

podi = PODI()
seed_pod = 42
podi.compute_pod(dbt, tol=0.9999, seed=seed_pod)
podi.set_training_values(xt)
podi.train()

values = podi.predict_values(xv)
variances = podi.predict_variances(xv)

# computing the POD errors:
# [max_interp_error, max_proj_error, max_total_error] = PODI.compute_pod_errors(xt = xt, database = dbt)
# print("interpolation error: ", max_interp_error)
# print("projection error: ", max_proj_error)
# print("total error: ", max_total_error)

# Choosing a value from the validation inputs
i = nv // 2

diff = dbv[:, i] - values[:, i]
rms_error = np.sqrt(np.mean(diff**2))
plt.figure(figsize=(8, 5))
light_pink = np.array((250, 233, 232)) / 255
plt.fill_between(
    np.ravel(t),
    np.ravel(values[:, i] - 3 * np.sqrt(variances[:, i])),
    np.ravel(values[:, i] + 3 * np.sqrt(variances[:, i])),
    color=light_pink,
    label="confiance interval (99%)",
)
plt.scatter(
    t,
    values[:, i],
    color="r",
    marker="x",
    s=15,
    alpha=1.0,
    label="prediction (mean)",
)
plt.scatter(
    t,
    dbv[:, i],
    color="b",
    marker="*",
    s=5,
    alpha=1.0,
    label="reference",
)
plt.plot([], [], color="w", label="rms = " + str(round(rms_error, 9)))

ax = plt.gca()
ax.axes.xaxis.set_visible(False)

plt.ylabel("u(x = " + str(xv[i, 0])[:4] + ")")
plt.title("Estimation of u at x = " + str(xv[i, 0])[:4])
plt.legend()
plt.show()

../../_images/podi_Test_run_podi_example_1d_global.png

Example 2: local POD case for 2D function¶

import matplotlib.pyplot as plt
import numpy as np

from smt.applications import PODI
from smt.sampling_methods import LHS

p = 100
y = np.linspace(-1, 1, p)
n_modes_test = 10

def function_test_2d_local(x, y, n_modes_test, p):
    import numpy as np  # Note: only required by SMT doc testing toolchain

    def cos_coeff_nd(i: int, x: np.ndarray):
        a = 2 * i % 2 - 1
        return a * sum(x.T) * np.cos(i * sum(x.T))

    def Legendre(i: int, y: np.ndarray):
        from scipy import special

        return special.legendre(i)(y)

    def gram_schmidt(input_array: np.ndarray) -> np.ndarray:
        """To perform the  Gram-Schmidt's algorithm."""

        basis = np.zeros_like(input_array)
        for i in range(len(input_array)):
            basis[i] = input_array[i]
            for j in range(i):
                basis[i] -= (
                    np.dot(input_array[i], basis[j])
                    / np.dot(basis[j], basis[j])
                    * basis[j]
                )
            basis[i] /= np.linalg.norm(basis[i])
        return basis

    u0 = np.zeros((p, 1))

    alpha = np.zeros((x.shape[0], n_modes_test))
    for i in range(n_modes_test):
        alpha[:, i] = cos_coeff_nd(i, x)

    V_init = np.zeros((p, n_modes_test))
    for i in range(n_modes_test):
        V_init[:, i] = Legendre(i, y)

    V = gram_schmidt(V_init.T).T
    database = u0 + np.dot(V, alpha.T)

    return database

seed = 42
xlimits = [[0, 1], [0, 4]]
sampling_x1 = LHS(xlimits=np.array([xlimits[0]]), seed=seed)
sampling_x2 = LHS(xlimits=np.array([xlimits[1]]), seed=seed + 1)

nt1 = 25
nt2 = 10
nt = nt1 * nt2
xt1 = sampling_x1(nt1)
xt2 = sampling_x2(nt)
xt = np.zeros((nt, 2))
xt[:, 1] = xt2[:, 0]
for i, elt in enumerate(xt1):
    xt[i * nt2 : (i + 1) * nt2, 0] = elt

sampling_new = LHS(xlimits=np.array(xlimits), seed=seed)

nv = 15
xv = sampling_new(nv)
xv1 = np.atleast_2d(xv[:, 0]).T
x = np.concatenate((xt, xv))

dbfull = function_test_2d_local(x, y, n_modes_test, p)

# Training data
dbt = dbfull[:, :nt]

# Validation data
dbv = dbfull[:, nt:]

plt.figure(figsize=(8, 5))
axes = plt.gca()
axes.tick_params(axis="x", labelsize=14)
axes.tick_params(axis="y", labelsize=14)
plt.scatter(xt[:, 1], xt[:, 0], marker="x", label="Training points", color="g")
plt.scatter(
    xv[:, 1], xv[:, 0], marker="*", label="Validation points", color="r"
)
plt.xlabel(r"$x^{(2)}$", fontsize=18)
plt.ylabel(r"$x^{(1)}$", fontsize=18)
plt.legend(loc="lower left", fontsize=14)

tol = 0.9999  # SVD tolerance for each line's POD basis
local_pod_bases = []  # list of each line's POD bases
n_modes_list = []

podi = PODI()
for i in range(nt1):
    db_loc = dbt[:, i * nt2 : (i + 1) * nt2]
    podi.compute_pod(
        db_loc, pod_type="global", n_modes=min(db_loc.shape), seed=i
    )
    ev_list = podi.get_ev_list()

    n_modes = PODI.choice_n_modes_tol(ev_list, tol)
    n_modes_list.append(n_modes)

    local_basis = podi.get_basis()
    local_pod_bases.append(local_basis)

# Function that choose the 'n_bases' closest bases (closest value of x^(1))
# and use 'interp_subspaces' to estimate a new basis
n_bases = 10

def choose_local_bases(local_pod_bases, n_bases, modes_list, xt1, xv1):
    import numpy as np

    from smt.applications import PODI

    interpolated_bases = []
    keep_index_list = []
    max_modes_list = []

    for value in xv1:
        sorted_ind = sorted(
            range(xt1.shape[0]), key=lambda k: abs(xt1[:, 0] - value)[k]
        )
        keep_index = sorted_ind[:n_bases]
        keep_index_list.append(keep_index)
        input_matrices = []
        keep_xt1 = []
        max_modes = max(modes_list[keep_index])
        max_modes_list.append(max_modes)
        for i in keep_index:
            input_matrices.append(local_pod_bases[i][:, :max_modes])
            keep_xt1.append(xt1[i, 0])
        basis = PODI.interp_subspaces(
            xt1=np.atleast_2d(keep_xt1).T,
            input_matrices=input_matrices,
            xn1=np.atleast_2d(value),
            frechet=True,
            print_global=False,
        )
        interpolated_bases.append(basis[0])
    return interpolated_bases, keep_index_list

interpolated_bases, keep_index_list = choose_local_bases(
    local_pod_bases,
    n_bases=n_bases,
    modes_list=np.array(n_modes_list),
    xt1=xt1,
    xv1=xv1,
)

# Choosing a value from the validation inputs
i = 0

podi = PODI()
j = []
for ind in keep_index_list[i]:
    j += list(range(ind * nt2, (ind + 1) * nt2))
podi.compute_pod(
    database=dbt[:, j], pod_type="local", local_basis=interpolated_bases[i]
)
n_modes = podi.get_n_modes()
print(f"{n_modes} modes were kept.")

# Choosing the default interp options
# Setting the training values
podi.set_training_values(xt=np.atleast_2d(xt[j]))

# Training the models
podi.train()

# predicting the desired values with inputs
values = podi.predict_values(np.atleast_2d(xv[i]))

diff = dbv[:, i] - values[:, 0]

rms_error = np.sqrt(np.mean(diff**2))
plt.figure(figsize=(8, 5))
plt.scatter(
    y,
    values,
    color="r",
    marker="x",
    s=15,
    alpha=1.0,
    label="prediction (mean)",
)
plt.scatter(
    y,
    dbv[:, i],
    color="b",
    marker="*",
    s=5,
    alpha=1.0,
    label="reference",
)
plt.plot([], [], color="w", label="rmse = " + str(round(rms_error, 5)))

ax = plt.gca()
ax.axes.xaxis.set_visible(False)

plt.ylabel("u(x = " + str(xv[i, 0])[:4] + ")")
plt.title(f"Estimation of u at x = ({str(xv[i, 0])[:4]}, {str(xv[i, 1])[:4]})")
plt.legend()
plt.show()

8 modes were kept.

../../_images/podi_Test_run_podi_example_2d_local.png

PODI class API¶

class smt.applications.podi.PODI(**kwargs)[source]¶

Class for Proper Orthogonal Decomposition and Interpolation (PODI) surrogate models based.

Attributes:

pod_typestr: Indicates which type of POD should be performed (‘global’ or ‘local’)
nxint: Dimension of the inputs in the DoE;
n_snapshotint: Number of snapshots in the database.
nyint: Dimension of the vector associated to a snapshot
databasenp.ndarray[ny, n_snapshot]: Database containing the vectorial snapshots.
n_modesint: Number of kept modes during the POD.
basisnp.ndarray[ny, n_modes]: POD basis.
EV_ratiofloat: Ratio of explained variance according to the kept modes during the POD (only for global POD).
singular_vectorsnp.ndarray: Singular vectors of the POD (only for global POD).
singular_valuesnp.ndarray: Singular values of the POD (only for global POD).
interp_coefflist[SurrogateModel]: List containing the surrogate models used during the interpolation.

Methods

`choice_n_modes_tol`(EV_list, tol)	Static method calculating the required number of kept modes to explain at least the intended ratio of variance.
`compute_global_pod`([tol, n_modes, seed])	Performs the global POD.
`compute_pod`(database[, pod_type, tol, ...])	Performs the POD (global or local).
`compute_pod_errors`(xt, database[, ...])	Calculates different errors for the POD.
`get_basis`()	Getter for the basis used for the POD.
`get_ev_list`()	Getter for the explained variance list.
`get_ev_ratio`()	Getter for the explained variance ratio with the kept modes.
`get_interp_coeff`()	Getter for the list of the interpolation surrogate models used
`get_n_modes`()	Getter for the number of modes kept during the POD.
`get_singular_values`()	Getter for the singular values from the Sigma matrix of the POD.
`get_singular_vectors`()	Getter for the singular vectors of the global POD.
`interp_subspaces`(xt1, input_matrices, xn1[, ...])	Static method computing the interpolation of subspaces.
`predict_derivatives`(xn, kx)	Predict the dy_dx derivatives at a set of points.
`predict_values`(xn)	Predict the output values at a set of points.
`predict_variance_derivatives`(xn, kx)	Predict the derivatives of the variances at a set of points.
`predict_variances`(xn)	Predict the variances at a set of points.
`set_interp_options`([interp_type, interp_options])	Set the options for the interpolation surrogate models used.
`set_training_values`(xt)	Set training data (values).
`train`()	Performs the training of the model.

Examples

>>> from smt.applications import PODI
>>> sm = PODI()

get_singular_vectors() → ndarray[source]¶

Getter for the singular vectors of the global POD. It represents the directions of maximum variance in the data.

Returns:

singular_vectorsnp.ndarray: Singular vectors of the global POD.

get_basis() → ndarray[source]¶

Getter for the basis used for the POD.

Returns:

basisnp.ndarray: Basis of the POD.

get_singular_values() → ndarray[source]¶

Getter for the singular values from the Sigma matrix of the POD.

Returns:

singular_valuesnp.ndarray: Singular values of the POD.

get_ev_list() → float[source]¶

Getter for the explained variance list.

Returns:

EV_ratiofloat: Explained variance ratio with the current kept modes.

get_ev_ratio() → float[source]¶

Getter for the explained variance ratio with the kept modes.

Returns:

EV_ratiofloat: Explained variance ratio with the current kept modes.

get_n_modes() → int[source]¶

Getter for the number of modes kept during the POD.

Returns:

n_modesint: number of modes kept during the POD.

set_interp_options(interp_type: str = 'KRG', interp_options: list = [{}]) → None[source]¶

Set the options for the interpolation surrogate models used. Only required if a model different than KRG is used or if non-default options are desired for the models.

Parameters:

interp_typestr: Name of the type of surrogate model that will be used for the whole set. By default, the Kriging model is used (KRG).
interp_optionslist[dict]: List containing dictionnaries for the options. The k-th dictionnary corresponds to the options of the k-th interpolation model. If the options are common to all surogate models, only a single dictionnary is required in the list. The available options can be found in the documentation of the corresponding surrogate models. By default, the print_global options are set to ‘False’.

Examples

>>> interp_type = "KRG"
>>> dict1 = {'corr' : 'matern52', 'theta0' : [1e-2]}
>>> dict2 = {'poly' : 'quadratic'}
>>> interp_options = [dict1, dict2]
>>> sm.set_interp_options(interp_type, interp_options)

set_training_values(xt: ndarray) → None[source]¶

Set training data (values). If the models’ options are still not set, default values are used for the initialization.

Parameters:

xtnp.ndarray[n_snapshot, nx]: The input values for the n_snapshot training points.

train() → None[source]¶: Performs the training of the model.

get_interp_coeff() → ndarray[source]¶

Getter for the list of the interpolation surrogate models used

Returns:

interp_coeffnp.ndarray[n_modes]: List of the kriging models used for the POD coefficients.

predict_values(xn) → ndarray[source]¶

Predict the output values at a set of points.

Parameters:

xnnp.ndarray[n_new, nx]: Input values for the prediction points.

Returns:

ynnp.ndarray[n_new, nx]: Output values at the prediction points.

predict_variances(xn) → ndarray[source]¶

Predict the variances at a set of points.

Parameters:

xnnp.ndarray[n_new, nx]: Input values for the prediction points.

Returns:

s2np.ndarray[ny, n_new]: Variances.

predict_derivatives(xn, kx) → ndarray[source]¶

Predict the dy_dx derivatives at a set of points.

Parameters:

xnnp.ndarray[n_new, nx]: Input values for the prediction points.
kxint: The 0-based index of the input variable with respect to which derivative is desired.

Returns:

dy_dxnp.ndarray[ny, n_new]: Derivatives.

predict_variance_derivatives(xn, kx) → ndarray[source]¶

Predict the derivatives of the variances at a set of points.

Parameters:

xnnp.ndarray[n_new, nx]: Input values for the prediction points.
kxint: The 0-based index of the input variable with respect to which derivative is desired.

Returns:

dv_dxnp.ndarray[ny, n_new]: Derivatives of the variances.

Proper Orthogonal Decomposition + Interpolation (PODI)¶

Context¶

Proper Orthogonal Decomposition (POD)¶

Global POD¶

Local POD¶

Singular Values Decomposition (SVD)¶

Use of Surrogate models¶

Usage¶

Example 1: global POD case for 1D function¶

Example 2: local POD case for 2D function¶

PODI class API¶

Table of Contents

Previous topic

Next topic

This Page