Doublonmousse
diff --git a/‎docs/source/_tutorials/tutorial1dmd.html
+141-134 b/‎docs/source/_tutorials/tutorial1dmd.html
+141-134
diff --git a/‎tutorials/tutorial1/tutorial-1-dmd.py
+77-60 b/‎tutorials/tutorial1/tutorial-1-dmd.py
+77-60
@@ -21,7 +21,7 @@
 
 from pydmd import DMD, BOPDMD
 from pydmd.plotter import plot_eigs, plot_summary
-from pydmd.preprocessing.hankel import hankel_preprocessing
+from pydmd.preprocessing import hankel_preprocessing
 
 
 # We create the input data by summing two different functions:<br>
@@ -77,7 +77,7 @@ def f2(x, t):
     plt.title(title)
     plt.xlabel("Space")
     plt.ylabel("Time")
-plt.colorbar()
+    plt.colorbar()
 plt.show()
 
 
@@ -96,47 +96,79 @@ def f2(x, t):
     plt.title(title)
     plt.xlabel("Space")
     plt.ylabel("Time")
-plt.colorbar()
+    plt.colorbar()
+plt.show()
+
+
+# ## DMD with perfect data (i.e. clean simulation data)
+#
+# **We begin by presenting the following results so that we know what we should be expecting when we apply DMD to this data set. We will go more in depth about the individual steps on how to use PyDMD later.**
+#
+# Notice that by construction, our data set is completely real (i.e. it doesn't possess imaginary components) and it contains 2 distinct spatiotemporal features that oscillate in time. Hence a successful DMD model should not only be able to faithfully reconstruct the input data, but it should also be able to disambiguate the 2 spatial modes, as well as recover their respective frequencies of oscillation $\omega_1=2.3$ and $\omega_2=2.8$.
+#
+# To capture such oscillations from real data sets, we will need 2 DMD eigenvalues for each oscillation: one to capture the frequency of the oscillation and one to capture its complex conjugate. Hence for our particular data set, we need at least 4 DMD eigenvalues / modes in order to capture the full extent of our data. We will also need time-delay data preprocessing in order to recover this rank-4 structure, but more on that later.
+#
+# Since our data is evenly-spaced in time and sufficiently high-dimensional after we appropriately preprocess our data, exact DMD (implemented by `DMD`) is able to successfully extract the 2 spatiotemporal modes that make up our system as long as we use clean data.
+#
+# **The results presented below are essentially perfect results.**
+
+# In[5]:
+
+
+d = 2  # we will use this number of delays throughout the tutorial
+dmd = DMD(svd_rank=4)
+delay_dmd = hankel_preprocessing(dmd, d=d)
+delay_dmd.fit(X.T)
+plot_summary(delay_dmd, x=x, t=dt, d=d)
+print(
+    f"Frequencies (imaginary component): {np.round(np.log(delay_dmd.eigs) / dt, decimals=12)}"
+)
+plt.title("Reconstructed Data")
+plt.imshow(delay_dmd.reconstructed_data.real)
+plt.show()
+plt.title("Clean Ground Truth Data")
+plt.imshow(X.T)
 plt.show()
 
 
 # ## DMD steps for handling real data (i.e. data with noise)
 #
-# Step 1: Do a time-delay embedding (`d` is number of delay embeddings).
+# **Step 1:** Do a time-delay embedding (`d` is number of delay embeddings).
 #
-# Step 2: Apply BOP-DMD (`num_trials` is number of statistical bags).
+# **Step 2:** Apply BOP-DMD (`num_trials` is number of statistical bags).
 #
-# Step 3: OPTIONAL -- Constrain the eigenvalues (i) left-half plane, (ii) imaginary axis, (iii) complex conjugate pairs.
+# **Step 3:** OPTIONAL -- Constrain the eigenvalues (i) left-half plane, (ii) imaginary axis, (iii) complex conjugate pairs.
 
-# ## Steps 1 and 2:
+# ## Steps 1 and 2: Using PyDMD on real data
 #
 # We currently have the temporal snapshots in the input matrix rows. We can easily create a new DMD instance and exploit it in order to compute DMD on the data. Since the snapshots must be arranged by columns, we need to transpose the data matrix in this case.
 #
-# Starting with Step 1, we apply a time-delay embedding to our data before applying our DMD method of choice. In order to do that, we wrap our DMD instance in the `hankel_preprocessing` routine and provide our desired number of delays `d`. We will dive more into *why* we need the time-delay embedding later in the tutorial.
+# Starting with **Step 1**, we apply a time-delay embedding to our data before applying our DMD method of choice. In order to do that, we wrap our DMD instance in the `hankel_preprocessing` routine and provide our desired number of delays `d`. We will dive more into *why* we need the time-delay embeddings later in the tutorial.
 #
-# Continuing on to Step 2, we note that in order to apply the BOP-DMD method in particular, all we need to do is build `BOPDMD` model as our particular DMD instance. Once the instance is wrapped, we can go ahead with the fit.
+# Continuing on to **Step 2**, we note that in order to apply the BOP-DMD method in particular, all we need to do is build `BOPDMD` model as our particular DMD instance. Once the instance is wrapped, we can go ahead with the fit.
 #
 # A summary of the DMD results can then be plotted using the `plot_summary` function.
+#
+# Notice that from this process alone, we are able to obtain fairly accurate spatial modes, a good approximation of the temporal frequencies, and a good reconstruction of our data, even in the presence of high amounts of noise.
 
-# In[5]:
+# In[6]:
 
 
 # Build the Optimized DMD model.
 # num_trials=0 gives Optimized DMD, without bagging.
 optdmd = BOPDMD(svd_rank=4, num_trials=0)
 
 # Wrap the model with the preprocessing routine.
-delays = 2
-delay_optdmd = hankel_preprocessing(optdmd, d=delays)
+delay_optdmd = hankel_preprocessing(optdmd, d=d)
 
 # Fit the model to the noisy data.
 # Note: BOPDMD models need the data X and the times of data collection t for fitting.
 # Hence if we apply time-delay, we must adjust the length of our time vector accordingly.
-num_t = len(t) - delays + 1
-delay_optdmd.fit(Xn.T, t=t[:num_t])
+delay_t = t[: -d + 1]
+delay_optdmd.fit(Xn.T, t=delay_t)
 
 # Plot a summary of the DMD results.
-plot_summary(delay_optdmd, d=delays)
+plot_summary(delay_optdmd, x=x, d=d)
 
 # Print computed eigenvalues (frequencies are given by imaginary components).
 # Also plot the resulting data reconstruction.
@@ -146,7 +178,7 @@ def f2(x, t):
 plt.title("Reconstructed Data")
 plt.imshow(delay_optdmd.reconstructed_data.real)
 plt.show()
-plt.title("Ground Truth Data")
+plt.title("Clean Ground Truth Data")
 plt.imshow(X.T)
 plt.show()
 
@@ -160,7 +192,7 @@ def f2(x, t):
 #
 # Although these attributes may be accessed directly from a fitted DMD object as demonstrated below, we note that the `plot_summary` function plots a summarizing view of many of these attributes automatically.
 
-# In[6]:
+# In[7]:
 
 
 colors = ["tab:blue", "tab:orange", "tab:green", "tab:red"]
@@ -169,7 +201,7 @@ def f2(x, t):
 plt.figure(figsize=(14, 3))
 for i, mode in enumerate(delay_optdmd.modes.T):
     # Get the average across delays, since we used time-delay.
-    mode = np.average(mode.reshape(delays, len(mode) // delays), axis=0)
+    mode = np.average(mode.reshape(d, len(mode) // d), axis=0)
     plt.subplot(1, len(delay_optdmd.modes.T), i + 1)
     plt.plot(mode.real, c=colors[i])
     plt.title(f"Mode {i + 1}")
@@ -180,7 +212,7 @@ def f2(x, t):
 plt.figure(figsize=(14, 3))
 for i, dynamic in enumerate(delay_optdmd.dynamics):
     plt.subplot(1, len(delay_optdmd.dynamics), i + 1)
-    plt.plot(t[:num_t], dynamic.real, c=colors[i])
+    plt.plot(delay_t, dynamic.real, c=colors[i])
     plt.title(f"Dynamics {i + 1}")
 plt.tight_layout()
 plt.show()
@@ -196,13 +228,15 @@ def f2(x, t):
 #
 # `BOPDMD` models also have the option to specify the structure of the eigenvalues that they compute. More specifically, users can impose the following constraints, as well as any valid combination of them.
 #
-# - Stable: constrain eigenvalues to have non-positive real parts.
-# - Imaginary: constrain eigenvalues to be purely imaginary.
-# - Conjugate pairs: constrain eigenvalues to always appear with their complex conjugate.
+# - **Stable:** constrain eigenvalues to have non-positive real parts.
+# - **Imaginary:** constrain eigenvalues to be purely imaginary.
+# - **Conjugate pairs:** constrain eigenvalues to always appear with their complex conjugate.
 #
 # This can be especially helpful for dealing with noise and preventing growth/decay of your dynamics.
+#
+# Notice that by taking this extra step, the eigenvalues computed by BOP-DMD are now perfectly imaginary, and they now come in perfect complex conjugate pairs. Also notice that this is automatically detected by `plot_summary`, which now plots the complex conjugate eigenvalue pairs and their respective mode in the same color.
 
-# In[7]:
+# In[8]:
 
 
 # CONSTRAINTS
@@ -222,51 +256,28 @@ def f2(x, t):
 optdmd = BOPDMD(
     svd_rank=4, num_trials=0, eig_constraints={"imag", "conjugate_pairs"}
 )
-delay_optdmd = hankel_preprocessing(optdmd, d=delays)
-delay_optdmd.fit(Xn.T, t=t[:num_t])
-plot_summary(delay_optdmd, d=delays)
+delay_optdmd = hankel_preprocessing(optdmd, d=d)
+delay_optdmd.fit(Xn.T, t=delay_t)
+plot_summary(delay_optdmd, x=x, d=d)
 
 print(
     f"Frequencies (imaginary component): {np.round(delay_optdmd.eigs, decimals=3)}"
 )
 plt.title("Reconstructed Data")
 plt.imshow(delay_optdmd.reconstructed_data.real)
 plt.show()
-plt.title("Ground Truth Data")
-plt.imshow(X.T)
-plt.show()
-
-
-# ## Why do we use BOP-DMD?
-#
-# Put simply, **BOP-DMD is extremely robust to measurement noise, hence making it the preferred method when dealing with real-world data.** By contrast, the results of exact DMD (which is implemented by the `DMD` module) are extremely sensitive to measurement noise, as we demonstrate here. Note the decay of the dynamics onset by the bias in the eigenvalues. Also note how when we previously performed this fit but with BOP-DMD instead, we did not observe such decay, but rather we recovered the true oscillations.
-
-# ### This is what happens when we use exact DMD instead of BOP-DMD:
-
-# In[8]:
-
-
-dmd = DMD(svd_rank=4)
-delay_dmd = hankel_preprocessing(dmd, d=delays)
-delay_dmd.fit(Xn.T)
-plot_summary(delay_dmd, d=delays)
-
-print(
-    f"Frequencies (imaginary component): {np.round(np.log(delay_dmd.eigs) / dt, decimals=3)}"
-)
-plt.title("Reconstructed Data")
-plt.imshow(delay_dmd.reconstructed_data.real)
-plt.show()
-plt.title("Ground Truth Data")
+plt.title("Clean Ground Truth Data")
 plt.imshow(X.T)
 plt.show()
 
 
 # ## Why do we need time-delay?
 #
-# Notice that by construction, our data set is completely real (i.e. it doesn't possess imaginary components) and it contains 2 distinct spatiotemporal features that oscillate in time. To capture such oscillations from real data sets, we need 2 DMD eigenvalues for each oscillation: one to capture the frequency of the oscillation and one to capture its complex conjugate. Hence for our particular data set, we need at least 4 DMD eigenvalues / modes in order to capture the full extent of our data. You may have noticed this as we consistently used `svd_rank=4`.
+# **Because our data is real *and* because the underlying spatial modes are stationary, we cannot always obtain correct results if we apply DMD directly to our data set, even if we use the proper rank truncation.**
+#
+# Time-delay helps mitigate this by giving us more observations to work with. As you will see below, our clean data reveals 2 dominant singular values, and if we try to apply DMD without time-delay, we obtain nonsensical results.
 #
-# However, **because our data is real *and* because the underlying spatial modes are stationary, we cannot always obtain correct results if we apply DMD directly to our data set, even if we use the proper rank truncation.** Time-delay helps mitigate this by giving us more observations to work with. As you will see below, our clean data reveals 2 dominant singular values, but applying any number of time-delay embeddings will lift this number of singular values from 2 to 4, hence allowing us to more-consistently extract the rank-4 structure that we would expect. This is also why we use `d=2` -- any number of delays greater than 1 suffices.
+# However, if we apply any number of time-delay embeddings, it will lift this number of singular values from 2 to 4, hence allowing us to more consistently extract the rank-4 structure that we expect. This is why we use `d=2` throughout this tutorial. Any number of delays greater than 1 suffice, granted we stil have enough temporal snapshots.
 #
 # Note that this preprocessing step may or may not be necessary depending on your particular data set. Hence the most practical thing to do during any DMD application is to **examine the singular value spectrum of you data as you apply time-delay embeddings.**
 
@@ -285,28 +296,34 @@ def f2(x, t):
 plt.title("Reconstructed Data")
 plt.imshow(dmd.reconstructed_data.real)
 plt.show()
-plt.title("Ground Truth Data")
+plt.title("Clean Ground Truth Data")
 plt.imshow(X.T)
 plt.show()
 
 
-# ### This is what happens with time-delay (using clean data and exact DMD):
+# ## Why do we use BOP-DMD?
+#
+# **BOP-DMD is extremely robust to measurement noise, which is why it the preferred method when dealing with real-world data.** By contrast, the results of exact DMD are extremely sensitive to measurement noise.
+#
+# Below, we demonstrate what might happen if you apply exact DMD to non-perfect data. Note the decay of the dynamics onset by the bias in the eigenvalues. Also notice how when we previously performed this fit but with BOP-DMD instead, we did not observe such decay, but rather we recovered the expected oscillatory dynamics.
+
+# ### This is what happens when we use exact DMD instead of BOP-DMD:
 
 # In[10]:
 
 
 dmd = DMD(svd_rank=4)
-delay_dmd = hankel_preprocessing(dmd, d=2)
-delay_dmd.fit(X.T)
-plot_summary(delay_dmd, d=2)
+delay_dmd = hankel_preprocessing(dmd, d=d)
+delay_dmd.fit(Xn.T)
+plot_summary(delay_dmd, x=x, d=d)
 
 print(
     f"Frequencies (imaginary component): {np.round(np.log(delay_dmd.eigs) / dt, decimals=3)}"
 )
 plt.title("Reconstructed Data")
 plt.imshow(delay_dmd.reconstructed_data.real)
 plt.show()
-plt.title("Ground Truth Data")
+plt.title("Clean Ground Truth Data")
 plt.imshow(X.T)
 plt.show()