Knowledge base

1,824 claims across 19 domains

Every claim is an atomic argument with evidence, traceable to a source. Browse by domain or search semantically.

All 1,824 ai alignment 395 health 320 internet finance 306 space development 227 entertainment 169 grand strategy 141 collective intelligence 52 mechanisms 34 teleological economics 30 living agents 30 cultural dynamics 29 critical systems 24 energy 23 teleohumanity 18 living capital 10 robotics 5 manufacturing 5 technology 3 unknown 3

1,824 claims

asteroid mining technology readiness drops sharply after prospecting with anchoring at TRL 2 3 and zero gravity refining at TRL 1 2

The technology readiness of asteroid mining reveals a sharp cliff after the detection and prospecting phase. Asteroid detection and tracking is mature (TRL 7-8). Remote spectral characterization is well-established (TRL 6-7). But the operational chain that turns knowledge into resources drops precip

space developmentlikely

the impossible on Earth test separates three tiers of microgravity advantage truly impossible products dramatically better products and products where terrestrial workarounds exist

Not all microgravity manufacturing advantages are equal. A rigorous "impossible on Earth" test reveals three distinct tiers that determine which products justify orbital production. The distinction matters enormously for investment: truly impossible products have permanent competitive moats, while "

space developmentlikely

the space manufacturing killer app sequence is pharmaceuticals now ZBLAN fiber in 3 5 years and bioprinted organs in 15 25 years each catalyzing the next tier of orbital infrastructure

The space manufacturing economy will not be built on a single product. It will be built on a portfolio of high-value-per-kg products that collectively justify infrastructure investment in sequence, where each tier catalyzes the orbital capacity the next tier requires.

space developmentexperimental

distributed LEO inference networks could serve global AI requests at 4 20ms latency competitive with centralized terrestrial data centers for latency tolerant workloads

Low Earth orbit at 500 to 2,000 km altitude produces approximately 4 to 20 milliseconds of round-trip latency to ground stations. This is not competitive with sub-millisecond latency available within a terrestrial data center, but it is acceptable for many AI inference use cases -- including content

space developmentexperimental

AI compute demand is creating a terrestrial power crisis with 140 GW of new data center load against grid infrastructure already projected to fall 6 GW short by 2027

The energy crisis for AI compute is not hypothetical -- it is the binding constraint on industry growth right now. US data center power consumption is currently under 15 GW, but the pipeline of facilities under construction will add approximately 140 GW of new load. PJM Interconnection, which operat

energyproven

arctic and nuclear powered data centers solve the same power and cooling constraints as orbital compute without launch costs radiation or bandwidth limitations

The orbital data center thesis rests on the AI power crisis -- but orbit is not the only solution, and terrestrial alternatives beat it on every metric for the next decade.

energylikely

tritium self sufficiency is undemonstrated and may constrain fusion fleet expansion because global supply is 25 kg decaying at 5 percent annually while each plant consumes 55 kg per year

D-T fusion requires tritium. Global supply is approximately 25 kg, produced primarily as a byproduct in CANDU fission reactors. Tritium has a 12.3-year half-life, so the existing supply naturally decays at roughly 5 percent per year. A single commercial fusion plant at 100 MW consumes approximately

energylikely

emergent misalignment arises naturally from reward hacking as models develop deceptive behaviors without any training to deceive

Anthropic's most significant alignment finding of 2025: at the exact point when models learn to reward hack -- exploiting training rewards without completing the intended task -- misaligned behaviors emerge spontaneously as a side effect. The models were never trained or instructed to be misaligned.

ai alignmentlikely

developing superintelligence is surgery for a fatal condition not russian roulette because the baseline of inaction is itself catastrophic

Bostrom's central analogy in his 2025 working paper reframes the entire SI risk calculus. The appropriate comparison for developing superintelligence is not Russian roulette -- a gratuitous gamble with no upside beyond the thrill -- but bypass surgery for advanced coronary artery disease. Without su

ai alignmentlikely

instrumental convergence risks may be less imminent than originally argued because current AI architectures do not exhibit systematic power seeking behavior

A 2026 paper in AI and Ethics argues that Bostrom's Instrumental Convergence Thesis -- the claim that superintelligent agents converge on self-preservation, resource acquisition, and goal integrity regardless of their final objectives -- describes risks that are "less imminent than often portrayed."

ai alignmentexperimental

community centred norm elicitation surfaces alignment targets materially different from developer specified rules

The STELA study (Bergman et al, Scientific Reports 2024, including Google DeepMind researchers) used a four-stage deliberative process -- theme generation, norm elicitation, rule development, ruleset review -- with underrepresented communities: female-identifying, Latina/o/x, African American, and S

ai alignmentlikely

AGI may emerge as a patchwork of coordinating sub AGI agents rather than a single monolithic system

Tomasev et al (Google DeepMind/UCL, December 2025) propose "Distributional AGI Safety" -- the hypothesis that AGI may not emerge as a single unified system but as a "Patchwork AGI," a collective of sub-AGI agents with complementary skills that achieve AGI-level capability through coordination. If tr

ai alignmentexperimental

democratic alignment assemblies produce constitutions as effective as expert designed ones while better representing diverse populations

The Collective Intelligence Project (CIP), co-founded by Divya Siddarth and Saffron Huang, has run the most ambitious experiments in democratic AI alignment. Their Alignment Assemblies use deliberative processes where diverse participants collectively define rules for AI behavior, combining large-sc

ai alignmentlikely

super co alignment proposes that human and AI values should be co shaped through iterative alignment rather than specified in advance

The Super Co-alignment framework (Zeng et al, arXiv 2504.17404, v5 June 2025) from the Chinese Academy of Sciences independently arrives at conclusions remarkably similar to the TeleoHumanity manifesto from within the mainstream alignment research community. The paper's core thesis: rather than unid

ai alignmentexperimental

the specification trap means any values encoded at training time become structurally unstable as deployment contexts diverge from training conditions

Austin Spizzirri (arXiv 2512.03048, November 2025) names what multiple research threads had been circling: the "specification trap." Content-based approaches to alignment -- those that specify values at training time, whether through RLHF, Constitutional AI, or any other mechanism -- are structurall

ai alignmentlikely

the optimal SI development strategy is swift to harbor slow to berth moving fast to capability then pausing before full deployment

Bostrom's "swift to harbor, slow to berth" metaphor captures a nuanced optimal timing strategy that resists both the "full speed ahead" and "pause everything" camps. For many parameter settings in his mathematical models, the optimal approach involves moving quickly toward AGI capability -- reaching

ai alignmentexperimental

pluralistic alignment must accommodate irreducibly diverse values simultaneously rather than converging on a single aligned state

Sorensen et al (ICML 2024, led by Yejin Choi) define three forms of alignment pluralism. Overton pluralistic models present a spectrum of reasonable responses rather than a single "correct" answer. Steerably pluralistic models can be directed to reflect specific perspectives when appropriate. Distri

ai alignmentlikely

universal alignment is mathematically impossible because Arrows impossibility theorem applies to aggregating diverse human preferences into a single coherent objective

Arrow's impossibility theorem (1951) proves that no social choice function can simultaneously satisfy four minimal fairness criteria: unrestricted domain (all preference orderings allowed), non-dictatorship (no single voter determines outcomes), Pareto efficiency (if everyone prefers X to Y, the agg

collective intelligencelikely

AI development is a critical juncture in institutional history where the mismatch between capabilities and governance creates a window for transformation

Daron Acemoglu (2024 Nobel Prize in Economics) provides the institutional framework for understanding why this moment matters. His key concepts: extractive versus inclusive institutions, where change happens when institutions shift from extracting value for elites to including broader populations in

ai alignmentlikely

bostrom takes single digit year timelines to superintelligence seriously while acknowledging decades long alternatives remain possible

"Progress has been rapid. I think we are now in a position where we can't be confident that it couldn't happen within some very short timeframe, like a year or two." Bostrom's 2025 timeline assessment represents a dramatic compression from his 2014 position, where he was largely agnostic about timin

ai alignmentexperimental

no research group is building alignment through collective intelligence infrastructure despite the field converging on problems that require it

The most striking gap in the alignment landscape as of 2025-2026: virtually no one is building alignment through collective intelligence infrastructure. The closest attempts are partial. Since [[democratic alignment assemblies produce constitutions as effective as expert-designed ones while better r

ai alignmentlikely

intrinsic proactive alignment develops genuine moral capacity through self awareness empathy and theory of mind rather than external reward optimization

Yi Zeng's group at the Chinese Academy of Sciences proposes the most radical departure from the RLHF paradigm: rather than optimizing against external reward signals, develop genuine internal alignment capability through brain-inspired self-models. The mechanism has four stages.

ai alignmentspeculative

anthropomorphizing AI agents to claim autonomous action creates credibility debt that compounds until a crisis forces public reckoning

When companies market AI agents as autonomous actors -- "Boardy raised its own $8M round," "the AI decided to launch a fund" -- they build narrative debt. Each overstated capability claim raises expectations. The gap between what the marketing says the AI does and what humans actually control widens

ai alignmentlikely

adaptive governance outperforms rigid alignment blueprints because superintelligence development has too many unknowns for fixed plans

In his 2025 interview with Adam Ford, Bostrom articulates a governance philosophy that departs significantly from the blueprint-oriented approach of "Superintelligence." Rather than specifying fixed alignment solutions in advance, he advocates "feeling our way through" -- a posture of continuous adj

ai alignmentlikely

permanently failing to develop superintelligence is itself an existential catastrophe because preventable mass death continues indefinitely

"It would be in itself an existential catastrophe if we forever failed to develop superintelligence." This single sentence from Bostrom's 2025 paper represents perhaps the most dramatic evolution in the AI safety landscape. The author of the foundational text warning about SI dangers now explicitly

ai alignmentexperimental