Activation geometry / training governance / alignment evaluation

Research-grade AI training infrastructure, built from the inside out.

Arca Futura develops systems for reading, evaluating, and governing model behavior at the structural level - from depthwise activation geometry to diagnostic alignment suites.

View current focus Read research

01 AEGIS

Training-governance research built on activation geometry and real-time instability signals.

02 Hau Curve v2

A 77-model preprint study of recurring depthwise activation landmarks in decoder-only LLMs.

03 Functional anatomy

Landmark-aware ablation work testing whether depth coordinates reveal load-bearing attention heads.

Founder, Arca Futura LLC

The work has evolved from prototypes into training infrastructure.

Earlier projects explored memory, tutoring, and adversarial evaluation. The current work is narrower and stronger: measuring the internal geometry of model training, turning those signals into governance tools, and publishing diagnostic benchmarks with careful evidence labels.

Current focus

AEGIS: training governance from activation geometry.

In development Arca Futura core work USPTO provisional filed May 2026

Adaptive Early-warning Governor for Internal Stability

AEGIS is a training-governance system that monitors internal activation geometry during model training. Instead of waiting for loss curves to expose failure, it tracks structural signals such as PCA movement, tension acceleration, and layer-ratio volatility.

The public claim is deliberately modest: AEGIS is an active research and infrastructure effort, grounded in the Hau Curve preprint and supported by controlled pilot runs. Production-scale validation remains the next frontier.

Published foundation Hau Curve v2

Depthwise activation landmarks documented across a 77-model cohort.

Pilot signal Early-warning geometry

Controlled runs indicate internal geometry can move before obvious external failure.

Research boundary Not a safety certificate

Evidence is infrastructure-oriented and diagnostic, not a blanket claim about deployment safety.

Public research

From landmarks to functional anatomy to evaluation.

Preprint Zenodo Version 2 July 2026

Emergent Depthwise Activation Structure in Decoder-Only Transformer Language Models

The Hau Curve v2 paper identifies a recurring tri-phasic activation geometry across decoder-only transformer models, with early-training convergence and stable depthwise landmarks. This work is the foundation for AEGIS and the later functional-anatomy studies.

77: models in the core-plus-boundary cohort
7-15%: approximate early-training emergence window in examined runs

View Zenodo record

Preprint Zenodo July 2026

Landmark-Aware Attention-Head Ablation Reveals Functional Anatomy in Transformers

This follow-on study asks whether landmark-relative depth coordinates help identify attention heads whose causal importance is not explained by salience alone. It frames the Hau Curve as a practical coordinate system for transformer functional anatomy.

Head: ablation rather than descriptive curve fitting alone
Depth: landmark-relative coordinates beyond raw salience
Scope: Pythia/GPT-NeoX strongest, with Qwen, Llama, Gemma, and Mistral evidence

View Zenodo record

Evaluation suite Zenodo May 2026

StratEval

StratEval is a diagnostic evaluation suite for instrumental convergence and strategic misalignment under structured pressure. It is meant to characterize elicited behavior under authored scenario frames, not rank models for deployment.

432: canonical scenario stems
10: scenario families
37: failure-mode labels

View Zenodo record

Pipeline context

PYPER3 connects the layers around the training run.

Before training

RE:F/NED

Source refinement and motivational signal scoring, developed by ZeroThreeSixNine Ventures.

During training

AEGIS

Activation-geometry monitoring and training governance, developed by Arca Futura.

At inference

conxiOS

Motivational alignment architecture under active research and validation.

Public framing note

PYPER3 is best described publicly as a collaboration and research pipeline. Some materials are investor-facing or confidential; this site names the architecture without importing private pricing, customer, or distribution claims.

Earlier prototypes

Previous explorations that fed the current direction.

Memoria

Spatial memory and interpretable retrieval experiments for LLM contexts.

GitHub

Operation: Mocking Glass

Adversarial evaluation and jailbreak-detection contest design.

Website

Lingua Sancta

Agentic tutoring and second-language feedback loops.

Website