Wednesday, March 26, 2025

The Devil’s Dice: How Chance and Determinism Seduce the Universe

Introduction: The Seduction of Certainty

The book opens with a provocative declaration:

"The universe is a cheating lover, whispering promises of certainty while rolling the dice behind your back. You think you know the rules, but every toss reveals your ignorance."

Pascal, Laplace, and de Moivre are cast as three conspirators in humanity’s eternal struggle to understand fate—each seduced by their own vision of how the game works.

Chapter 1: Pascal’s Wager and the Gambler’s Lust

Pascal bursts onto the scene like a preacher in a smoky casino. He’s sweating bullets, clutching his triangle like a holy relic, and screaming about eternity:

"Bet on God or lose everything! The stakes are infinite!"

But beneath his pious exterior lies a gambler who can’t resist the thrill of uncertainty. He knows life is rigged—he just hopes the house (God) will let him win.

Drama: Pascal’s obsession with faith is less about salvation and more about his terror of losing the ultimate cosmic bet.
Juicy Gossip: He secretly doubts the dealer (God) even exists but plays along because the odds are irresistible.

Chapter 2: Laplace’s Demon and the Fatal Seduction

Enter Laplace, smooth as silk, sipping wine at a table where all outcomes are already known. He doesn’t gamble—he calculates.

"Chance? What nonsense. Every roll of the dice is predetermined. You’re just too stupid to see it."

Laplace seduces us with his vision of determinism, promising certainty if we can only learn all the rules. But his cold logic hides a darker truth:

Drama: If everything is predetermined, what’s the point of playing? Laplace whispers sweet nothings about control while leaving us existentially naked and alone.
Juicy Gossip: Laplace secretly envies gamblers—they live for chaos, while he’s trapped in his sterile perfection.

Chapter 3: De Moivre’s Doctrine of Pleasure

De Moivre crashes into the story like a rock star mathematician at an afterparty, throwing dice and laughing at Laplace’s uptight determinism.

"Life’s a game, baby! You win some, you lose some—but I’ll teach you how to cheat."

He writes The Doctrine of Chances not as a dry textbook but as a love letter to chaos itself. For de Moivre, gambling isn’t just math—it’s foreplay with fate.

Drama: He revels in randomness, seducing us into believing we can master chance with enough cleverness.
Juicy Gossip: De Moivre secretly knows the house always wins but keeps playing because he loves the thrill.

Chapter 4: The Cosmic Love Triangle

Pascal wants salvation. Laplace wants control. De Moivre wants pleasure. Together, they form a toxic love triangle where no one gets what they truly desire:

Pascal bets on eternity but fears he’ll lose.
Laplace calculates every move but feels empty inside.
De Moivre embraces chaos but knows it will destroy him.

The chapter crescendos into an orgy of philosophical betrayal:

"Chance seduces us with freedom; necessity binds us with rules; reason leaves us naked before the universe."

Chapter 5: The Devil Rolls the Dice

The final chapter takes us to the heart of their cosmic drama—a smoky casino run by none other than Satan himself:

"The devil doesn’t care about your bets; he just loves watching you squirm."

Pascal prays for divine intervention. Laplace demands to see behind the curtain. De Moivre laughs and orders another drink. And Satan? He rolls the dice and smiles.

Conclusion: The Seduction Never Ends

The book ends with this tantalizing thought:

"You’ll never know if life is rigged or random—but you’ll keep playing anyway because you’re addicted to hope."

It leaves readers breathless, questioning everything they thought they knew about fate while craving one more roll of the cosmic dice.

Sunday, December 15, 2024

Contractive Containment, Stationary Dilations, and Partial Isometries: Equivalence, Properties, and Geometric Intuition

1. Preliminaries

Definition 1 (Hilbert Space Contraction). A bounded linear operator

$T: H_1 \to H_2$ between Hilbert spaces is called a contraction if

$\|Tx\|_{H_2} \leq \|x\|_{H_1} \quad \forall x \in H_1$ Equivalently,

$\|T\| \leq 1$ .

Definition 2 (Stationary Process). A stochastic process

$\{Y(t)\}_{t \in \mathbb{R}}$ is stationary if for any finite set of time points

$\{t_1,\ldots,t_n\}$ and any

$h \in \mathbb{R}$ , the joint distribution of

$\{Y(t_1+h),\ldots,Y(t_n+h)\}$ is identical to that of

$\{Y(t_1),\ldots,Y(t_n)\}$ .

Definition 3 (Stationary Dilation). Given a non-stationary process

$X(t)$ , a stationary dilation is a stationary process

$Y(s)$ together with a family of bounded operators

$\{\phi(t,\cdot)\}_{t \in \mathbb{R}}$ such that

$X(t) = \int_{\mathbb{R}} \phi(t,s)Y(s)ds$ where

$\phi(t,s)$ is a measurable function satisfying:

$\|\phi(t,\cdot)\|_{\infty} \leq 1$ for all $t$
The map $t \mapsto \phi(t,\cdot)$ is strongly continuous

Remark. The conditions on

$\phi(t,s)$ ensure that the integral is well-defined and the resulting process

$X(t)$ inherits appropriate regularity properties from

$Y(s)$ .

2. Main Results

Proposition 1 (Properties of Scaling Function). The scaling function

$\phi(t,s)$ in a stationary dilation satisfies:

$\|\phi(t,s)\| \leq 1$ for all $t,s \in \mathbb{R}$
For fixed $t$ , $s \mapsto \phi(t,s)$ is measurable
For fixed $s$ , $t \mapsto \phi(t,s)$ is continuous

Theorem 1 (Equivalence of Containment). For a non-stationary process

$X(t)$ and a stationary process

$Y(s)$ , the following are equivalent:

$Y(s)$ is a stationary dilation of $X(t)$
There exists a contractive mapping $\Phi$ from the space generated by $Y$ to the space generated by $X$ such that $X(t) = (\Phi Y)(t)$ for all $t$

Proof.
(

$1 \Rightarrow 2$ ): Define

$\Phi$ by

$(\Phi Y)(t) = \int_{\mathbb{R}} \phi(t,s)Y(s)ds$ For any finite linear combination

$\sum_i \alpha_i Y(t_i)$ :

$\begin{align*} \|\Phi(\sum_i \alpha_i Y(t_i))\|^2 &= \|\sum_i \alpha_i \int_{\mathbb{R}} \phi(t_i,s)Y(s)ds\|^2 \\ &\leq \|\sum_i \alpha_i Y(t_i)\|^2 \end{align*}$ where the inequality follows from the bound on

$\|\phi(t,s)\|$ and the Cauchy-Schwarz inequality. (

$2 \Rightarrow 1$ ): The contractive mapping

$\Phi$ induces a family of operators

$\phi(t,s)$ via the Kernel theorem for Hilbert spaces. The stationarity of

$Y$ and the contractivity of

$\Phi$ ensure that these operators satisfy the required properties.

Lemma 1 (Minimal Dilation Property). If

$Y(s)$ is a minimal stationary dilation of

$X(t)$ , then the scaling function

$\phi(t,s)$ achieves the bound

$\sup_{t,s} \|\phi(t,s)\| = 1$

Proof.
If

$\sup_{t,s} \|\phi(t,s)\| < 1$ , we could construct a smaller dilation by scaling

$Y(s)$ , contradicting minimality.

3. Structure Theory

Theorem 2 (Sz.-Nagy Dilation). For any contraction

$T$ on a Hilbert space

$H$ , there exists a minimal unitary dilation

$U$ on a larger space

$K \supseteq H$ such that:

$T^n = P_H U^n|_H \quad \forall n \geq 0$ where

$P_H$ is the orthogonal projection onto

$H$ .

Lemma 2 (Defect Operators). For a contraction

$T$ , the defect operators defined by:

$D_T = (I - T^*T)^{1/2}$

$D_{T^*} = (I - TT^*)^{1/2}$ satisfy:

$\|D_T\| \leq 1$ and $\|D_{T^*}\| \leq 1$
$D_T = 0$ if and only if $T$ is an isometry
$D_{T^*} = 0$ if and only if $T$ is a co-isometry

4. Convergence Properties

Theorem 3 (Strong Convergence). For a contractive stationary dilation, the following limit exists in the strong operator topology:

$\lim_{n \to \infty} T^n = P_{ker(I-T^*T)}$ where

$P_{ker(I-T^*T)}$ is the orthogonal projection onto the kernel of

$I-T^*T$ .

Proof.
For any

$x$ in the Hilbert space:

The sequence $\{\|T^n x\|\}$ is decreasing since $T$ is a contraction
It is bounded below by 0
Therefore, $\lim_{n \to \infty} \|T^n x\|$ exists
The limit operator must be the projection onto the space of vectors $x$ satisfying $\|Tx\| = \|x\|$
This space is precisely $ker(I-T^*T)$

Corollary 1 (Asymptotic Behavior). If

$T$ is a strict contraction (i.e.,

$\|T\| < 1$ ), then

$\lim_{n \to \infty} T^n = 0$ in the strong operator topology.

5. Partial Isometries: The Mathematical Scalpel

Definition 4 (Partial Isometry). An operator

$A$ on a Hilbert space

$H$ is a partial isometry if

$A^*A$ is an orthogonal projection.

Remark (Geometric Intuition). A partial isometry is like a mathematical scalpel that carves out a section of space:

It acts as a perfect rigid motion (isometry) on a specific subspace
It completely annihilates the rest of the space

This property makes partial isometries powerful tools for selecting and transforming specific parts of a Hilbert space while cleanly disposing of the rest.

Proposition 2 (Key Properties of Partial Isometries). Let

$A$ be a partial isometry. Then:

$A$ is an isometry when restricted to $(ker A)^\perp$
$A(ker A)^\perp = ran A$
$A^*$ is also a partial isometry
$AA^*A = A$ and $A^*AA^* = A^*$

Theorem 4 (Geometric Characterization). For a partial isometry

$A$ :

$A^*A = P_{(ker A)^\perp} \quad \text{and} \quad AA^* = P_{ran A}$ where

$P_S$ denotes the orthogonal projection onto subspace

$S$ .

Proof.
The action of

$A$ can be decomposed as:

Project onto $(ker A)^\perp$ (this is $A^*A$ )
Apply a perfect rigid motion to the projected space

This two-step process ensures

$A^*A$ is the projection onto

$(ker A)^\perp$ .

Remark (The "Not So Partial" Nature). Despite the name, there's nothing incomplete about a partial isometry. It performs a complete operation:

It's a full isometry on its initial space ( $(ker A)^\perp$ )
It perfectly maps this initial space onto its final space ( $ran A$ )
It precisely annihilates everything else

This makes partial isometries fundamental building blocks in operator theory, crucial in polar decompositions, dimension theory of von Neumann algebras, and quantum mechanics.

Wednesday, November 27, 2024

Reproducing Kernel Hilbert Spaces and Covariance Functions

Let $K : T \times T \to \mathbb{C}$ be a covariance function such that the associated RKHS $\mathcal{H}_K$ is separable where $T \subset \mathbb{R}$ . Then there exists a family of vector functions

$\displaystyle \Psi (t, x) = (\psi_n (t, x), n \geq 1) \forall t \in T$

and a Borel measure $\mu$ on $T$ such that $\psi_n (t, x) \in L^2 (T, \mu)$ in terms of which $K$ is representable as:

$\displaystyle K (s, t) = \int_T \sum_{n = 1}^{\infty} \psi_n (s, x) \overline{\psi_n (t, x)} d \mu (x)$

The vector functions $\Psi (s, .), s \in T$ and the measure $\mu$ may not be unique, but all such $(\Psi, .), .)$ determine $K$ and its reproducing kernel Hilbert space (RKHS) $H_K$ uniquely and the cardinality of the components determining $K$ remains the same. [1, ]

Remark 2. 1. If $\Psi (t, .)$ is a scalar, then we have

$\displaystyle K (s, t) = \int_T \Psi (s, x) \overline{\Psi (t, x)} d \mu (x)$

which includes the tri-diagonal triangular covariance with $\mu$ absolutely continuous relative to the Lebesgue measure.

2. The following notational simplification of (25) can be made. Let $n = R \times Z_+ = S \otimes P$ , where $P$ is the power set of integers Z, and let P = u @ o where o is the counting measure. Then

$\displaystyle \Psi (t, n) = (\psi_n (t, x), n \in Z)$

Hence

$\displaystyle | \Psi^{\ast} (t) |^2_{L^2} = \int_T | \psi_n (t, x) |^2 d \mu (x)$

Bibliography

[1]: Malempati M. Rao. Stochastic Processes: Inference Theory. Springer Monographs in Mathematics. Springer, 2nd edition, 2014.

Thursday, November 21, 2024

Aimeds, Tenghistor Gratifier

Interpretation of "Aimeds, Tenghistor Gratifier"

In a speculative context, "aimeds, tenghistor gratifier" can be interpreted as follows:

Aimeds could suggest the concept of focus or intention. It might refer to the state of being directed or purpose-driven, implying the act of setting intentions or aiming toward a specific outcome.

Tenghistor could evoke ideas of history or chronology. It might refer to the interconnectedness of past events and their influence on the present, symbolizing the weight of historical experiences in shaping current realities or a collective memory among people.

Gratifier might indicate something that provides fulfillment or satisfaction. In this context, it could represent the ultimate goal of the intentions set in "aimeds" and the historical context of "tenghistor." It implies that pursuing knowledge, understanding, or connection leads to a gratifying experience.

Putting it all together, "The focused pursuit of understanding, informed by the lessons of history, leads to a fulfilling and rewarding experience."

Monday, November 4, 2024

Stationary Dilations

1Stationary Dilations

Definition 1. Let $(\Omega, \mathcal{F}, P)$ and $(\tilde{\Omega}, \tilde{\mathcal{F}}, \tilde{P})$ be probability spaces. We say that $(\Omega, \mathcal{F}, P)$ is a factor of $(\tilde{\Omega}, \tilde{\mathcal{F}}, \tilde{P})$ if there exists a measurable surjective map $\phi : \tilde{\Omega} \to \Omega$ such that:

For all $A \in \mathcal{F}$ , $\phi^{- 1} (A) \in \tilde{\mathcal{F}}$

For all $A \in \mathcal{F}$ , $P (A) = \tilde{P} (\phi^{- 1} (A))$

In other words, $(\Omega, \mathcal{F}, P)$ can be obtained from $(\tilde{\Omega}, \tilde{\mathcal{F}}, \tilde{P})$ by projecting the larger space onto the smaller one while preserving the probability measure structure.

Remark 2. In the context of stationary dilations, this means that the original nonstationary process $\{X_t \}$ can be recovered from the stationary dilation $\{Y_t \}$ through a measurable projection that preserves the probabilistic structure of the original process.

Definition 3. (Stationary Dilation) Let $(\Omega, \mathcal{F}, P)$ be a probability space and let $\{X_t \}_{t \in \mathbb{R}_+}$ be a nonstationary stochastic process. A stationary dilation of $\{X_t \}$ is a stationary process $\{Y_t \}_{t \in \mathbb{R}_+}$ defined on a larger probability space $(\tilde{\Omega}, \tilde{\mathcal{F}}, \tilde{P})$ such that:

$(\Omega, \mathcal{F}, P)$ is a factor of $(\tilde{\Omega}, \tilde{\mathcal{F}}, \tilde{P})$

There exists a measurable projection operator $\Pi$ such that:

$\displaystyle X_t = \Pi Y_t \quad \forall t \in \mathbb{R}_+$

Theorem 4. (Representation of Nonstationary Processes) For a continuous-time nonstationary process $\{X_t \}_{t \in \mathbb{R}_+}$ , its stationary dilation exists which has sample paths $t \mapsto X_t (\omega)$ which are continuous with probability one when $X_t$ :

is uniformly continuous in probability over compact intervals:

$\displaystyle \lim_{s \to t} P (|X_s - X_t | > \epsilon) = 0 \quad \forall \epsilon > 0, t \in [0, T], T > 0$

has finite second moments:

$\displaystyle \mathbb{E} [|X_t |^2] < \infty \quad \forall t \in \mathbb{R}_+$

has an integral representation of the form:

$\displaystyle X_t = \int_0^t \eta (s) ds$

where $\eta (t)$ is a measurable random function that is stationary in the wide sense (with $\int_0^t \mathbb{E} [| \eta (s) |^2] \hspace{0.17em} ds < \infty$ for all $t$ )

and has a covariance operator

$\displaystyle R (t, s) =\mathbb{E} [X_t X_s]$

which is symmetric $(R (t, s) = R (s, t))$ , positive definite and continuous

Under these conditions, there exists a representation:

$\displaystyle X_t = M (t) \cdot S_t$

where:

$M (t)$ is a continuous deterministic modulation function

$\{S_t \}_{t \in \mathbb{R}_+}$ is a stationary process

This representation can be obtained through the stationary dilation by choosing:

$\displaystyle Y_t = \left( \begin{array}{c} M (t)\\ S_t \end{array} \right)$

with the projection operator $\Pi$ defined as:

$\displaystyle \Pi Y_t = M (t) \cdot S_t$

Proposition 5. (Properties of Dilation) The stationary dilation satisfies:

Preservation of moments:

$\displaystyle \mathbb{E} [|X_t |^p] \leq \mathbb{E} [|Y_t |^p] \quad \forall p \geq 1$

Minimal extension: Among all stationary processes that dilate $X_t$ , there exists a minimal one (unique up to isomorphism) in terms of the probability space dimension

Corollary 6. For any nonstationary process satisfying the above conditions, the stationary dilation provides a canonical factorization into deterministic time-varying components and stationary stochastic components.

Monday, October 28, 2024

Treehouse of Horror: The LaTeX Massacre

Segment 1: The Formatting

Homer works as a LaTeX typesetter at the nuclear plant. After Mr. Burns demands perfectly aligned equations, Homer goes insane trying to format complex mathematical expressions, eventually snapping when his equations run off the page. In a parody of "The Shinning," Homer chases his family around with a mechanical keyboard while screaming "All work and no proper alignment makes Homer go crazy!"

Segment 2: Time and Compilation

In a nod to "Time and Punishment", Homer accidentally breaks his LaTeX compiler and tries to fix it, but ends up creating a time paradox where every document compiles differently in parallel universes. He desperately tries to find his way back to a reality where his equations render properly.

Segment 3: The Cursed Code

Bart discovers an ancient LaTeX document that contains forbidden mathematics. When he compiles it, it summons an eldrich horror made entirely of misaligned integrals and malformed matrices. Lisa must save Springfield by finding the one perfect alignment that will banish the mathematical monster back to its dimension.

The episode ends with a meta-joke about how even the credits won't compile properly.

Friday, October 25, 2024

A Modest Proposal: Statistical Token Prediction Is No Replacement for Syntactic Construction

A Modest Proposal: Statistical Token Prediction Is No Replacement for Syntactic Construction

by Stephen Crowley

October 25, 2024

1Current Generative-Pretrained-Transformer Architecture

Given vocabulary $V$ , $|V| = v$ , current models map token sequences to vectors:

$\displaystyle (t_1, \ldots, t_n) \mapsto X \in \mathbb{R}^{n \times d}$

Through layers of transformations:

$\displaystyle \text{softmax} (QK^T / \sqrt{d}) V$

where $Q = XW_Q$ , $K = XW_K$ , $V = XW_V$

Optimizing:

$\displaystyle \max_{\theta} \sum \log P (t_{n + 1} |t_1, \ldots, t_n ; \theta)$

2Required Reformulation

Instead, construct Abstract Syntax Trees where each node $\eta$ must satisfy:

$\displaystyle \eta \in \{ \text{Noun}, \text{Verb}, \text{Adjective}, \text{Conjunction}, \ldots\}$

With composition rules $R$ such that for nodes $\eta_1, \eta_2$ :

$\displaystyle R (\eta_1, \eta_2) = \left\{ \begin{array}{ll} \text{valid\_subtree} & \text{if grammatically valid}\\ \emptyset & \text{otherwise} \end{array} \right.$

And logical constraints $L$ such that for any subtree $T$ :

$\displaystyle L (T) = \left\{ \begin{array}{ll} T & \text{if logically consistent}\\ \emptyset & \text{if contradictory} \end{array} \right.$

3Parsing and Generation

Input text $s$ maps to valid AST $T$ or error $E$ :

$\displaystyle \text{parse} (s) = \left\{ \begin{array}{ll} T & \text{if } \exists \text{valid AST}\\ E (\text{closest\_valid}, \text{violation}) & \text{otherwise} \end{array} \right.$

Generation must traverse only valid AST constructions:

$\displaystyle \text{generate} (c) = \{T|R (T) \neq \emptyset \wedge L (T) \neq \emptyset\}$

where $c$ is the context/prompt.

4Why Current GPT Fails

The statistical model:

$\displaystyle \text{softmax} (QK^T / \sqrt{d}) V$

Has no inherent conception of:

Syntactic validity
Logical consistency
Conceptual preservation

It merely maximizes:

$\displaystyle P (t_{n + 1} |t_1, \ldots, t_n)$

Based on training patterns, with no guaranteed constraints on:

$\displaystyle \prod_{i = 1}^n P (t_i |t_1, \ldots, t_{i - 1})$

This allows generation of:

Grammatically invalid sequences
Logically contradictory statements
Conceptually inconsistent responses

5Conclusion

The fundamental flaw is attempting to learn syntax and logic from data rather than building them into the architecture. An AST-based approach with formal grammar rules and logical constraints must replace unconstrained statistical token prediction.

Friday, August 23, 2024

Harmonizable Stochastic Processes

Harmonizable Stochastic Processes_ Mathematical Fo

Harmonizable Stochastic Processes: Mathematical Foundations and Key Theorems

The theory of harmonizable stochastic processes extends spectral analysis to non-stationary processes through measure-theoretic frameworks. This report synthesizes fundamental theorems, emphasizing rigorous mathematical formulations. Below, we present the core results with precise definitions, proofs, and applications.

1. Spectral Representation and Harmonizability

1.1 Loève’s Harmonizability Theorem

A stochastic process ${X(t), t }$ is harmonizable if and only if its covariance function $C(s,t) = [X(s)]$ admits a spectral representation:

C(s,t) = ∬_ℝ²e^i(λs−μt) dF(λ,μ),

where $F$ is a complex-valued bimeasure of bounded variation on $^2$ . This generalizes the Wiener-Khinchin theorem for stationary processes, where $F$ reduces to a measure on the diagonal $=$ .

1.2 Cramér’s Representation Theorem

Every harmonizable process $X(t)$ can be expressed as:

X(t) = ∫_ℝe^iλt dZ(λ),

where $Z()$ is a stochastic process with orthogonally scattered increments satisfying:

$\mathbb{E}[dZ(\lambda)\overline{dZ(\mu)}] = dF(\lambda, \mu).$

This representation preserves the Fourier-analytic structure while accommodating non-stationarity.

2. Structural Decompositions

2.1 Rao’s Decomposition Theorem

Any harmonizable process decomposes uniquely into:

X(t) = X_p(t) + X_w(t),

where:

$X_p(t)$ is purely harmonizable (spectral measure absolutely continuous),
$X_w(t)$ is weakly harmonizable (spectral measure singular).

This separation clarifies the interplay between spectral regularity and pathological components.

2.2 Karhunen-Loève Expansion

For harmonizable processes with continuous covariance, the orthogonal expansion holds:

$X(t) = \sum_{k=1}^\infty \sqrt{\lambda_k} \xi_k \phi_k(t),$

where ${_k}$ and ${_k(t)}$ are eigenvalues/eigenfunctions of the covariance operator:

∫_ℝC(s,t)ϕ_k(s) ds = λ_kϕ_k(t),

and $_k$ are uncorrelated random variables with $[_k] = 0$ , $[|_k|^2] = 1$ .

3. Analytical Properties

3.1 Continuity and Differentiability

A harmonizable process $X(t)$ is:

Mean-square continuous iff $F$ is continuous in each variable.
Mean-square differentiable iff:

$\iint_{\mathbb{R}^2} (\lambda^2 + \mu^2) \, d|F|(\lambda, \mu) < \infty,$

where $|F|$ denotes the total variation measure.

3.2 Spectral Density and Inversion

When $F$ is absolutely continuous with density $f(, )$ , the covariance becomes:

C(s,t) = ∬_ℝ²e^i(λs−μt)f(λ,μ) dλdμ.

The inversion formula recovers $f$ via:

$f(\lambda, \mu) = \frac{1}{(2\pi)^2} \iint_{\mathbb{R}^2} e^{-i(\lambda s - \mu t)} C(s,t) \, ds dt.$

4. Relationship to Stationarity

4.1 Stationarity as a Special Case

A harmonizable process is wide-sense stationary iff its spectral measure $F$ is concentrated on the diagonal $=$ , reducing to:

C(s,t) = ∫_ℝe^iλ(s−t) dF(λ).

This recovers the classical Wiener-Khinchin theorem.

4.2 Gladyshev’s Characterization

A process $X(t)$ is harmonizable iff for any $t_1, , t_n$ , the characteristic function of $(X(t_1), , X(t_n))$ satisfies:

$\phi(\theta_1, \dots, \theta_n) = \exp\left( \iint_{\mathbb{R}^2} \left( \sum_{k=1}^n \theta_k e^{i\lambda t_k} \right) \, dF(\lambda, \mu) \right).$

This links finite-dimensional distributions to spectral structure.

5. Prediction and Sampling

5.1 Optimal Linear Prediction

The best linear predictor $(t)$ given ${X(s) : s t_0}$ is:

X̂(t) = ∫_−∞^t₀h(t,s)X(s) ds,

where the kernel $h(t, s)$ solves the Wiener-Hopf equation:

∫_−∞^t₀C(s,u)h(t,u) du = C(s,t), ∀s ≤ t₀.

This extends Kalman filtering to non-stationary contexts.

5.2 Sampling Theorem

If $X(t)$ has spectral support in $[-, ]^2$ , it is reconstructible from samples ${X(nT)}$ at rate $T /$ :

$X(t) = \sum_{n=-\infty}^\infty X(nT) \frac{\sin[\Omega(t - nT)]}{\Omega(t - nT)}.$

This generalizes the Nyquist-Shannon theorem.

6. Measure-Theoretic Foundations

6.1 Spectral Bimeasures

The spectral bimeasure $F$ is a complex-valued function on $() ()$ satisfying:

Hermitian symmetry: $F(A, B) =$ .
Additivity: $ F(_i A_i, j B_j) = {i,j} F(A_i, B_j) $.
Bounded variation: $|F|(, ) <$ .

Integration over $F$ requires the strict Morse-Transue integral.

6.2 Operator-Valued Extensions

For Hilbert space-valued processes, the spectral representation generalizes to:

X(t) = ∫_ℝe^iλt dZ(λ),

where $Z$ is an operator-valued measure on . This underpins quantum stochastic processes.

7. Computational and Statistical Methods

7.1 Spectral Estimation

Given observations ${X(t_1), , X(t_n)}$ , the spectral measure $F$ is estimable via:

$\hat{F}(\Lambda, \mathrm{M}) = \frac{1}{n} \sum_{j,k=1}^n e^{-i(\lambda t_j - \mu t_k)} X(t_j)\overline{X(t_k)} \mathbf{1}_{\Lambda \times \mathrm{M}}(\lambda, \mu).$

Consistency requires $n$ and $|t_j - t_{j-1}|$ .

7.2 Simulation Techniques

To simulate $X(t)$ with spectral density $f$ :

Discretize $f(, )$ on a grid ${_j, _k}$ .
Generate complex Gaussian variables $_{jk} (0, f(_j, _k) )$ .
Construct:

X(t) = ∑_j, kξ_jke^{i(λ_j−μ_k)t}.

This approximates the harmonizable synthesis.

8. Current Research Frontiers

8.1 Multivariate Extensions

Recent work generalizes harmonizability to matrix-valued processes $(t) ^{n n}$ , with spectral representation:

X(t) = ∫_ℝe^iλt dZ(λ),

where $()$ is a matrix-valued orthogonal measure.

8.2 Fractal Harmonizable Processes

Processes with Hurst index $H (0,1)$ admit harmonizable representations:

$X_H(t) = \int_{\mathbb{R}} \frac{e^{i\lambda t} - 1}{|\lambda|^{H+1/2}} \, dZ(\lambda),$

linking harmonizability to self-similarity.

This report has delineated the mathematical backbone of harmonizable processes, emphasizing their spectral anatomy and analytical power. From foundational theorems to cutting-edge generalizations, the framework remains indispensable for modeling non-stationarity in both theory and practice.

Saturday, August 3, 2024

The Spectral Representation of Stationary Processes: Bridging Gelfand-Vilenkin and Wiener-Khinchin

Introduction

At the heart of stochastic process theory lies a profound connection between time and frequency domains, elegantly captured by two fundamental theorems: the Gelfand-Vilenkin Spectral Representation Theorem and the Wiener-Khinchin Theorem. These results, while often presented separately, are intimately linked, offering complementary insights into the nature of stationary processes.

Gelfand-Vilenkin Theorem

The Gelfand-Vilenkin theorem provides a general, measure-theoretic framework for representing wide-sense stationary processes. Consider a stochastic process

$\{X(t) : t \in \mathbb{R}\}$ on a probability space

$(\Omega, \mathcal{F}, P)$ . The theorem states that we can represent

$X(t)$ as:

$X(t) = \int_{\mathbb{R}} e^{i\omega t} dZ(\omega)$

Here,

$Z(\omega)$ is a complex-valued process with orthogonal increments, and the integral is taken over the real line. This representation expresses the process as a superposition of complex exponentials, each contributing to the overall behavior of

$X(t)$ at different frequencies.

The key to understanding this representation lies in the spectral measure

$\mu$ , which is defined by

$E[|Z(A)|^2] = \mu(A)$ for Borel sets

$A$ . This measure encapsulates the distribution of "energy" across different frequencies in the process.

Wiener-Khinchin Theorem

The Wiener-Khinchin theorem, in its classical form, states that for a wide-sense stationary process, the power spectral density

$S(\omega)$ is the Fourier transform of the autocorrelation function:

$S(\omega) = \int_{\mathbb{R}} R(\tau) e^{-i\omega\tau} d\tau$

Bridging the Theorems

The connection becomes clear when we recognize that the spectral measure

$\mu$ from Gelfand-Vilenkin is related to the power spectral density

$S(\omega)$ from Wiener-Khinchin by:

$d\mu(\omega) = \frac{1}{2\pi} S(\omega) d\omega$

This relationship holds when

$S(\omega)$ exists as a well-defined function. However, the beauty of the Gelfand-Vilenkin approach is that it allows for spectral measures that may not have a density, accommodating processes with more complex spectral structures.

Spectral Density Example

To illustrate the connection between spectral properties and sample path behavior, consider a process with a spectral density of the form:

$S(\omega) = \frac{1}{\sqrt{1 - \omega^2}}, \quad |\omega| < 1$

This density has singularities at

$\omega = \pm 1$ , which profoundly influence the behavior of the process in the time domain:

- The sample paths will be continuous and infinitely differentiable.

- The paths will exhibit rapid oscillations, reflecting the strong presence of frequencies near

$\pm 1$ .

- The process will show a mix of components with different periods, with those corresponding to

$|\omega|$ near 1 having larger amplitudes on average.

- The autocorrelation function is

$R(\tau) = J_0(\tau)$ , where

$J_0$ is the Bessel function of the first kind of order zero.

Frequency Interpretation

In our spectral density

$S(\omega) = 1 / \sqrt{1 - \omega^2}$ with

$|\omega| < 1$ :

$\omega$ represents angular frequency, with

$|\omega|$ closer to 0 corresponding to longer-period components in the process.

$|\omega|$ closer to 1 corresponds to shorter-period components.

- As

$|\omega|$ approaches 1,

$S(\omega)$ increases sharply, approaching infinity.

- This means components with

$|\omega|$ near 1 contribute more strongly to the process variance.

Dirac Delta Example

Consider a spectral measure that is a Dirac delta function at

$\omega = 0.25$ :

$S(\omega) = \delta(\omega - 0.25) + \delta(\omega + 0.25)$

In this case:

- The process can be written as:

$X(t) = A \cos(0.25t) + B \sin(0.25t)$

- The covariance function is

$R(\tau) = \cos(0.25\tau)$

- The period of the covariance function is

$2\pi/0.25 = 8\pi \approx 25.13$

- This illustrates that a frequency of 0.25 in the spectral domain corresponds to a period of

$8\pi$ in the time domain

This example demonstrates the crucial relationship: for any peak or concentration of spectral mass at a frequency

$\omega_0$ , we'll see corresponding oscillations in the covariance function with period

$2\pi/\omega_0$ .

Saturday, July 6, 2024

J₀(y)=Joy

This expression captures the idea that the Bessel function of the first kind of order zero, $J_0(y)$ , represents more than just a mathematical function. It symbolizes the joy of discovery, the beauty of mathematical solutions, and the profound satisfaction that comes from understanding the intricate patterns of the universe.

Khinchin's theorem

Summarizing an excerpt about Khinchin's theorem from

An Introduction to the Theory of Stationary Random Functions

Khinchin's theorem is a simple consequence of the following two statements,
taken together:

(a) The class of functions $B (t)$ , which are correlation functions of
stationary random processes, coincides with the class of positive definite
functions of the variable $t$ (see above, Sec. 4 for a real case and Sec. 5
for a complex case).

(b) A continuous function $B (t)$ of the real variable $t$ is positive
definite if, and only if, it can be represented in the form (2.52), where $F (\omega)$ is bounded and nondecreasing (this statement was proved
independently by Bochner and Khinchin, but was first published by Bochner and
therefore is known as Bochner's theorem; see, e.g., Bochner (1959) and also
Note 3 to Introduction).

In the preceding section it was emphasized that Khinchin's theorem lies at the
basis of almost all the proofs of the spectral representation theorem for
stationary random processes. It is, however, obvious that if we proved the
spectral representation theorem without using Khinchin's theorem, this would
also clearly imply the possibility of representing $B (t)$ in the form (2.52).
Indeed, replacing $X (t + \tau)$ and $X (t)$ in the formula $B (t) = \langle X (t + \tau) X (t) \rangle$ by their spectral representation (2.61) and then
using (2.1) by definition (2.62) of the corresponding Fourier--Stieltjes
integral and the property (b') of the random function $Z (\omega)$ , we obtain
at once (2.52), where
$\begin{equation} F (\omega + \Delta \omega) - F (\omega) = |Z (\omega + \Delta \omega) - Z (\omega) |^2 \end{equation}$
so that $F (\omega)$ is clearly a nondecreasing function. Formula (2.76) can
also be written in the differential form:
$\begin{equation} \langle dZ (\omega)^2 \rangle = dF (\omega) \end{equation}$
Moreover, $(2.77)$ can be combined with the property $(b')$ of $Z (\omega)$ in
the form of a single symbolic relation
$\begin{equation} \langle dZ (\omega) dZ (\omega') \rangle = \delta (\omega - \omega') dF (\omega) d \omega' \end{equation}$
where $\delta (\omega)$ is the Dirac delta-function. It is easy to see that
the substitution of $(2.78)$ into the expression for the mean value of any
double integral with respect to $dZ (\omega)$ and $dZ (\omega')$ gives the
correct result. As the simplest example we consider the following derivation
of Khinchin's formula $(2.52)$ :
$\begin{equation} \begin{array}{ll} \langle X (t + \tau) X (t) \rangle & = \left\langle \int_{- \infty}^{\infty} e^{i \omega (t + \tau)} dZ (\omega) \int_{- \infty}^{\infty} e^{- i \omega' t} dZ (\omega') \right\rangle\\ & = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} e^{i \omega (t + \tau) - i \omega' t} \langle dZ (\omega) dZ (\omega') \rangle\\ & = \int_{- \infty}^{\infty} \int_{- \infty}^{\infty} e^{i \omega (t + \tau) - i \omega' t} \delta (\omega - \omega') dF (\omega) d \omega'\\ & = \int_{- \infty}^{\infty} e^{i \omega \tau} dF (\omega) \end{array} \end{equation}$
Quite similarly, the following more general result can be derived:
$\begin{equation} \int_{- \infty}^{\infty} g (\omega) dZ (\omega) \int_{- \infty}^{\infty} h (\omega') dZ (\omega') = \int_{- \infty}^{\infty} g (\omega) h (\omega') \delta (\omega - \omega') dF (\omega) \end{equation}$
where $g (\omega)$ and $h (\omega)$ are any two complex functions whose
squared absolute values are integrable with respect to $dF (\omega)$ . Note
also that if the spectral density $f (\omega)$ exists, then the relations
$(2.77)$ and $(2.78)$ obviously take the form
$\begin{equation} \langle dZ (\omega)^2 \rangle = f (\omega) d \omega \end{equation}$
$\begin{equation} \langle dZ (\omega) dZ (\omega') \rangle = \delta (\omega - \omega') f (\omega) d \omega d \omega' \end{equation}$
Formulae $(2.76) (2.78)$ and $(2.80) (2.81)$
establish the relationship between the spectral representation of the
correlation function (determined by the functions $F (\omega)$ and $f (\omega)$ ) and the spectral representation of the stationary random process $X (t)$ itself, which includes the random point function $Z (\omega)$ or the
random interval function
$\begin{equation} Z (\Delta \omega) = Z (\omega_2) - Z (\omega_1) \end{equation}$
where $\Delta \omega = [\omega_1, \omega_2]$ . We shall see in Sec. 11 that
this relationship gives physical meaning to Khinchin's mathematical theorem
and permits one to verify it experimentally when the stationary process $X (t)$ is realized in the form of oscillations of some measurable physical
quantity $X$ .