Experiments

The Emergence Experiment Program

The Emergence Experiment Program

Eleven measurement experiments on V13 snapshots, testing whether world modeling, abstraction, communication, counterfactual reasoning, self-modeling, affect structure, perceptual mode, normativity, and social integration emerge in a substrate with zero exposure to human affect concepts. All experiments: 3 seeds, 7 snapshots per seed, 50 recording steps per snapshot.

Experiment 0: Substrate Engineering

Status: Complete. V13 content-based coupling Lenia with lethal resource dynamics. Foundation for all subsequent experiments.

Experiment 1: Emergent Existence

Status: Complete. Patterns persist, maintain boundaries, respond to perturbation. Established by V11-V12, confirmed in V13.

Experiment 2: Emergent World Model

Question: When does a pattern's internal state carry predictive information about the environment beyond current observations?

Method: Prediction gap W(τ)=MSE[fenv]MSE[ffull]\mathcal{W}(\tau) = \text{MSE}[f_{\text{env}}] - \text{MSE}[f_{\text{full}}] using Ridge regression with 5-fold CV.

SeedCwm\mathcal{C}_{\text{wm}} (early)Cwm\mathcal{C}_{\text{wm}} (late)HwmH_{\text{wm}} (late)% with WM
1230.00040.028220.0100%
420.00020.00025.340%
70.00100.00027.960%

Finding: World model signal present but weak. Seed 123 at bottleneck shows 100x amplification. World models are amplified by bottleneck selection, not gradual evolution. To be clear about magnitude: Cwm0.0002\mathcal{C}_{\text{wm}} \approx 0.0002 for most seeds means the internal state predicts the environment barely better than the environment alone. Only seed 123 at maximum bottleneck pressure reaches 0.028 — detectable but still small. These patterns are not building substantial world models; they carry a faint trace of environmental predictive information, amplified briefly under extreme selection.

Experiment 2 world model summary
Experiment 2: World model summary. (a) World model capacity over evolution — note the y-axis scale (0.000–0.030). Seed 123 shows a dramatic late spike; seeds 42 and 7 remain near zero throughout. (b) World model horizon in recording steps. (c) Prediction gap at late evolution — seed 123 maintains a flat, elevated prediction gap across all horizons, consistent with a genuine (if weak) internal model.
World model capacity vs pattern longevity
World model vs pattern longevity (r = 0.084). Near-zero correlation: having a world model does not help a pattern survive longer. Most points cluster at C_wm ≈ 0 regardless of lifetime. The few high-C_wm outliers are long-lived patterns at cycle 29 — the world model emerges as a byproduct of bottleneck survival, not as a survival advantage.

Source code

Experiment 3: Internal Representation Structure

Question: When do patterns develop low-dimensional, compositional representations?

Seeddeffd_{\text{eff}} (early to late)A\mathcal{A}D\mathcal{D}KcompK_{\text{comp}}
1236.6 to 5.60.90 to 0.920.27 to 0.380.20 to 0.12
427.3 to 7.50.89 to 0.890.23 to 0.230.23 to 0.25
77.7 to 8.80.89 to 0.870.24 to 0.220.20 to 0.27

Finding: Compression is cheap — deff7d_{\text{eff}} \approx 7/68 from cycle 0. But quality only improves under bottleneck selection. Note the asymmetry: abstraction (A0.89\mathcal{A} \approx 0.89) is high and stable from the start — the system compresses efficiently without effort. But disentanglement (D0.25\mathcal{D} \approx 0.25) remains low — the compressed representations are tangled, not cleanly factored. Disentanglement requires active information-seeking that this substrate lacks.

Experiment 3 representation structure summary
Experiment 3: Representation structure summary. (a) Effective dimensionality: 5.6–8.8 out of 68 possible — strong compression from cycle 0. Seed 123 compresses further over evolution. (b) Abstraction (A ≈ 0.89) is high and stable; disentanglement (D ≈ 0.25) remains low. The gap confirms the theory: compression is cheap but clean factoring requires agency. (c) Compositionality error — lowest for seed 123 at bottleneck, consistent with the bottleneck amplification pattern from Exp 2.
Eigenspectrum of internal state: early vs late evolution
Internal state eigenspectrum, early vs late. Log-scale variance fraction by PCA dimension. Seeds 123 and 42 show the late eigenspectrum becoming more concentrated in the top components — genuine compression under evolutionary pressure. Seed 7 stays relatively flat, consistent with its lack of world-model development.

Source code

Experiment 4: Emergent Language

Question: When do patterns develop structured, compositional communication?

SeedMI significantMI rangeρtopo\rho_{\text{topo}} significant
1234/60.019-0.0390/6
427/70.024-0.0300/7
74/70.023-0.0550/7

Finding: Chemical commons, not proto-language. MI above baseline in 15/20 snapshots but ρtopo0\rho_{\text{topo}} \approx 0 everywhere. Unstructured broadcast, not language.

Source code

Experiment 5: Counterfactual Detachment

Question: When do patterns decouple from external driving and run offline world model rollouts?

Result: Null. ρsync0\rho_{\text{sync}} \approx 0 from cycle 0. Patterns are inherently internally driven. The FFT convolution kernel integrates over the full grid — there is no reactive-to-autonomous transition because the starting point is already autonomous.

Source code

Experiment 6: Self-Model Emergence

Question: When does a pattern predict itself better than an external observer can?

Result: Weak signal at bottleneck only. ρself0\rho_{\text{self}} \approx 0 everywhere. SMsal>1\text{SM}_{\text{sal}} > 1 appears once: seed 123, cycle 20, one pattern at SMsal=2.28\text{SM}_{\text{sal}} = 2.28.

Source code

Experiment 7: Affect Geometry Verification

Question: Does the geometric affect structure predicted by the thesis actually appear? RSA between structural affect (Space A) and behavioral affect (Space C).

Seedρ(A,C)\rho(A,C) rangeSignificantTrend
123-0.09 to 0.722/5Strong at low pop
42-0.17 to 0.394/7Mixed
70.01 to 0.385/7Increasing (0.01 to 0.24)

Finding: Structure-behavior alignment in 8/19 snapshots. Seed 7 shows evolutionary trend. A-B alignment null (structure maps to behavior but not communication).

Source code

Experiment 8: Perceptual Mode and Computational Animism

Question: Do patterns develop modulable perceptual coupling?

MetricSeed 123Seed 42Seed 7
ι\iota (mean)0.27-0.440.27-0.410.31-0.35
ι\iota trajectory0.32 to 0.290.41 to 0.270.31 to 0.32
Animism score1.28-2.101.60-2.161.10-2.02

Confirmed. Default is participatory (ι0.30\iota \approx 0.30). Animism score > 1.0 in all 20 snapshots. Patterns model resources using the same dynamics they use to model other agents. Computational animism is the default because reusing the agent-model template is the cheapest compression.

Source code

Experiment 9: Proto-Normativity

Question: Does the viability gradient generate structural normativity?

Result: Null. No ΔV\Delta V asymmetry between cooperative and competitive contexts. But ΦsocialΦisolated\intinfo_{\text{social}} \gg \intinfo_{\text{isolated}} (~4.9 vs ~3.1). Social context increases integration regardless of interaction type. Normativity requires agency — the capacity to act otherwise.

Source code

Experiment 10: Social-Scale Integration

Question: Does ΦG>iΦi\intinfo_G > \sum_i \intinfo_i?

Finding: No superorganism. Ratio 1-12% but increasing. Seed 7: 6.1% to 12.3% over evolution. Moving toward threshold but not reaching it.

Source code

Experiment 11: Entanglement Analysis

Question: Are world models, abstraction, communication, detachment, and self-modeling separable or entangled?

Finding: Four clusters — but not the ones predicted. Most measures cluster into one large group driven by population-mediated selection. Overall entanglement increases (mean |r| from 0.68 to 0.91). Everything becomes more correlated, just not in the clusters the theory expected.

Source code

Experiment 12: Identity Thesis Capstone

Question: Does the full program hold in a system with zero human contamination?

CriterionStatusStrength
World modelsMetWeak (strong at bottleneck)
Self-modelsMetWeak (n=1 event)
CommunicationMetModerate (15/21 sig)
Affect dimensionsMetStrong (84/84)
Affect geometryMetModerate (9/19 sig)
Tripartite alignmentMetPartial (A-C pos, A-B null)
Perturbation responseMetModerate (rob 0.923)

Verdict: All seven criteria met, most at moderate/weak strength. Geometry confirmed; dynamics undertested, blocked by the coupling wall.

Source code