Empirica Technologies

Categorical Spectralism: Spectral Decomposition of Multi-Asset Return Spaces

Question

Does the eigenvalue spectrum of a mixed-asset-class return correlation matrix reveal hidden categorical isomorphisms—specifically, do structurally different instruments (equities, bonds, commodities, oil) cluster into identical spectral density regimes that would enable cross-asset hedging strategies beyond what traditional pairwise correlation analysis suggests?

Method

We computed the eigenvalue spectrum of the 8×8 return correlation matrix for a deliberately heterogeneous universe: equities (SPY, AAPL, JPM, JNJ), precious metals (GLD), long-duration Treasuries (TLT), oil (USO), and energy equities (XOM). The data source is yfinance daily adjusted-close returns over the window 2010-01-01 through 2024-12-31, yielding 3,772 observations across eight instruments (q-ratio = 0.002).

The inference method is principal component analysis (PCA) of the return correlation matrix, with statistical significance determined by comparison to the Marchenko-Pastur (MP) null distribution. Under the MP framework, a correlation matrix constructed from purely random, uncorrelated time series produces eigenvalues bounded within a predictable range. Eigenvalues exceeding the upper MP bound are statistically distinguishable from random-matrix noise and represent genuine covariance structure. For our sample dimensions, the MP upper bound is 1.0942 and the lower bound is 0.91.

To assess time variation, we recomputed the eigenvalue spectrum separately for each calendar year 2010–2024 using the same method, counting the number of eigenvalues exceeding the MP upper bound in each annual window. This rolling analysis reveals whether the spectral structure is stable or regime-dependent.

Result

The full-sample eigenvalue spectrum is: [3.45, 1.2191, 1.0224, 0.6913, 0.6137, 0.5158, 0.3452, 0.1425]. Against the MP upper bound of 1.0942, exactly two eigenvalues are statistically significant: 3.45 and 1.2191. The remaining six eigenvalues fall below the threshold and are indistinguishable from random noise.

The top eigenvalue (3.45) accounts for 43.12% of total variance; the two significant factors together explain 58.36% of variance. The remaining 41.64% of variance is distributed across six noise-level modes.

Factor 1 loadings (eigenvalue 3.45): SPY (−0.489), JPM (−0.436), XOM (−0.415). This factor captures broad equity market exposure, with the three highest-loading instruments all equity-linked (a broad index, a financial, and an energy stock). The negative sign is an arbitrary rotation; the economic interpretation is a common equity risk factor.

Factor 2 loadings (eigenvalue 1.2191): GLD (0.799), TLT (0.499), USO (0.256). This factor loads most heavily on gold, moderately on long-duration Treasuries, and weakly on oil. The structure suggests a "safe-haven / inflation-hedge" axis orthogonal to equity risk, though the oil loading is modest.

Time variation: The per-year significant factor count is:

2010–2013: 1 factor per year
2014: 2 factors
2015–2016: 1 factor per year
2017: 2 factors
2018–2020: 1 factor per year
2021–2024: 2 factors per year

The structure is not static. From 2010 through 2013, only one eigenvalue exceeded the MP bound in each annual window, indicating a single dominant equity-risk mode. A second significant factor emerged intermittently in 2014 and 2017, then stabilized as a persistent feature from 2021 onward. The post-2020 regime exhibits consistently two-factor structure, coinciding with the period of elevated inflation volatility and divergent monetary policy responses.

Interpretation

The eigenvalue spectrum does not reveal hidden categorical isomorphisms in the strong sense posed by the research question. The two significant factors correspond to economically interpretable, well-known risk dimensions: a broad equity factor and a safe-haven/inflation-hedge factor. The instruments do not cluster into "identical spectral density regimes" that would surprise a traditional correlation analyst—equities load on factor 1, defensive assets load on factor 2, and the loadings align with their asset-class labels.

The absence of additional significant factors is the substantive finding. Six of the eight eigenvalues are statistically indistinguishable from random noise, meaning that 75% of the nominal dimensions in this return space carry no systematic covariance structure. A portfolio manager seeking cross-asset hedging opportunities beyond the equity/safe-haven dichotomy will find no spectral evidence for hidden structure in this universe over this period. The correlation matrix is effectively low-rank (rank 2), and the low-rank structure maps cleanly onto conventional asset-class categories.

The time variation in factor count is economically interpretable but does not support the categorical-isomorphism hypothesis. The shift from one-factor to two-factor regimes reflects changing macroeconomic conditions (the reemergence of inflation as a distinct risk factor post-2020), not the discovery of latent cross-asset linkages. The second factor's stabilization after 2021 is consistent with a regime in which inflation risk and equity risk decouple, but this is a standard macro-finance narrative, not a spectral anomaly.

The modest oil loading on factor 2 (0.256) is the closest candidate for a "cross-category" linkage—oil is neither a pure safe haven nor a pure equity—but the loading is weak and does not constitute a distinct spectral regime. Oil's return variance is largely idiosyncratic (captured by the noise eigenvalues), not systematically tied to either dominant factor.

Out-of-sample considerations: This is an in-sample eigenvalue decomposition. The stability of the two-factor structure in the rolling annual windows provides some evidence that the result is not a sample-specific artifact, but the method does not test out-of-sample predictive power. A true test of cross-asset hedging efficacy would require constructing factor-mimicking portfolios and evaluating their out-of-sample hedging performance, which is beyond the scope of this spectral analysis.

What the result does support: The Marchenko-Pastur framework successfully distinguishes signal from noise in a mixed-asset return matrix. The q-ratio of 0.002 (far below the critical threshold where MP bounds collapse) ensures that the test has power. The result confirms that traditional asset-class categories (equity risk, safe-haven risk) are not arbitrary labels but correspond to the dominant eigenmodes of the return covariance structure. A spectral decomposition adds precision (quantifying the variance share of each mode and the exact loading structure) but does not overturn the categorical framework.

What the result does not support: The hypothesis that structurally different instruments exhibit hidden spectral equivalence. If categorical isomorphisms existed—if, for example, a subset of equities and a subset of commodities shared an eigenmode invisible to pairwise correlation—we would observe additional significant eigenvalues with mixed loadings across asset classes. We do not. The eigenvalue spectrum is parsimonious (two factors) and the loadings are categorically aligned (equities on factor 1, defensive assets on factor 2).

Limitations

Universe selection: The eight-instrument universe is small and hand-selected. A larger, more granular universe (e.g., 50+ instruments spanning multiple commodity sectors, credit instruments, currencies, and international equities) might reveal additional spectral structure. The current result is a lower bound on complexity, not an upper bound. The absence of hidden factors in this universe does not preclude their existence in a richer one.

Window choice: The 2010–2024 window spans multiple macro regimes (post-crisis recovery, low-inflation expansion, pandemic shock, inflation resurgence) but is still a single historical realization. The two-factor structure may be specific to this period. A longer historical window or a different geographic market might yield different eigenvalue counts.

Daily frequency: Daily returns emphasize short-horizon comovement. A monthly or quarterly return matrix might reveal different spectral structure, particularly if cross-asset linkages operate at lower frequencies (e.g., through slow-moving inflation or growth shocks). The choice of daily data biases the result toward high-frequency equity comovement.

Static correlation matrix: The full-sample analysis uses a single 15-year correlation matrix. The rolling annual factor counts show that the structure varies over time, but the method does not model time-varying loadings or eigenvalues continuously. A dynamic factor model (e.g., with time-varying betas) would capture regime shifts more granularly.

No economic identification: The eigenvalue decomposition is purely statistical. The factors are orthogonal by construction, but economic risk factors (equity risk, inflation risk, liquidity risk) are not necessarily orthogonal. The second factor's mixed loading on gold, Treasuries, and oil could reflect a blend of inflation hedging and safe-haven demand, but the method does not disentangle these. A structural factor model with economic priors would provide sharper interpretation.

Significance threshold: The Marchenko-Pastur bound is a sharp cutoff. The third eigenvalue (1.0224) is just below the upper bound (1.0942) and might be marginally significant under a different null or with a finite-sample correction. The distinction between two and three factors is not robust to small perturbations in the threshold.

Hedging efficacy untested: The result quantifies spectral structure but does not test whether the identified factors are useful for hedging. A factor with a large eigenvalue and clear loadings might still be difficult to trade (e.g., if the factor-mimicking portfolio has high turnover or transaction costs) or might not provide out-of-sample hedge performance. The spectral result is a necessary but not sufficient condition for practical cross-asset hedging.

Strengthening the result: A stronger test would (1) expand the universe to 30+ instruments across more granular categories (investment-grade credit, high-yield credit, REITs, international equities, agricultural commodities, industrial metals); (2) compute eigenvalue spectra at multiple frequencies (daily, weekly, monthly) to assess frequency-dependent structure; (3) implement a true out-of-sample test by constructing factor-mimicking portfolios on a training window and evaluating their hedging performance on a holdout window; (4) apply a dynamic factor model to track time-varying eigenvalues and loadings continuously rather than in annual snapshots; and (5) compare the MP-based significance test to alternative random-matrix benchmarks (e.g., the Tracy-Widom distribution for the largest eigenvalue) to assess robustness of the factor count.

Research evidence, not investment advice.

Categorical Spectralism — spectral decomposition of portfolio return spaces