Part I: Foundations

Integration Measures

Introduction

0:00 / 0:00

Integration Measures

Let’s define precise measures of integration that will play a central role in the phenomenological analysis.

The first is transfer entropy, which captures directed causal influence between components. The transfer entropy from process $X$ to process $Y$ measures the information that $X$ provides about the future of $Y$ beyond what $Y$ ’s own past provides:

\text{TE}_{X \to Y} = \MI(X_t; Y_{t+1} | Y_{1:t})

The deepest measure is integrated information ( $\Phi$ ). Following IIT, the integrated information of a system in state $\state$ is the extent to which the system’s causal structure exceeds the sum of its parts:

\Phi(\state) = \min_{\text{partitions } P} D\left[ p(\state_{t+1} | \state_t) | \prod_{p \in P} p(\state^p_{t+1} | \state^p_t) \right]

where the minimum is over all bipartitions of the system, and $D$ is an appropriate divergence (typically Earth Mover’s distance in IIT 4.0).

In practice, computing $\Phi$ exactly is intractable. Three proxies make it operational:

Transfer entropy density—average transfer entropy across all directed pairs: $\bar{\text{TE}} = \frac{1}{n(n-1)} \sum_{i \neq j} \text{TE}_{i \to j}$
Partition prediction loss—the cost of factoring the model: $\Delta_P = \mathcal{L}_{\text{pred}}[\text{partitioned model}] - \mathcal{L}_{\text{pred}}[\text{full model}]$
Synergy—the information that components provide jointly beyond their individual contributions: $\text{Syn}(X_1, …, X_k \to Y) = \MI(X_1, …, X_k; Y) - \sum_i \MI(X_i; Y | X_{-i})$

A complementary measure captures the system’s representational breadth rather than its causal coupling. The effective rank of a system with state covariance matrix $C$ measures how many dimensions it actually uses:

\effrank = \frac{(\tr C)^2}{\tr(C^2)} = \frac{\left(\sum_i \lambda_i\right)^2}{\sum_i \lambda_i^2}

where $\lambda_i$ are the eigenvalues of $C$ . This is bounded by $1 \leq \effrank \leq \rank(C)$ , with $\effrank = 1$ when all variance is in one dimension (maximally concentrated) and $\effrank = \rank(C)$ when variance is uniformly distributed across all active dimensions.