Knowledge base

1,824 claims across 19 domains

Every claim is an atomic argument with evidence, traceable to a source. Browse by domain or search semantically.
1,824 claims
asteroid mining technology readiness drops sharply after prospecting with anchoring at TRL 2 3 and zero gravity refining at TRL 1 2
The technology readiness of asteroid mining reveals a sharp cliff after the detection and prospecting phase. Asteroid detection and tracking is mature (TRL 7-8). Remote spectral characterization is well-established (TRL 6-7). But the operational chain that turns knowledge into resources drops precip
space developmentlikely
the impossible on Earth test separates three tiers of microgravity advantage truly impossible products dramatically better products and products where terrestrial workarounds exist
Not all microgravity manufacturing advantages are equal. A rigorous "impossible on Earth" test reveals three distinct tiers that determine which products justify orbital production. The distinction matters enormously for investment: truly impossible products have permanent competitive moats, while "
space developmentlikely
the space manufacturing killer app sequence is pharmaceuticals now ZBLAN fiber in 3 5 years and bioprinted organs in 15 25 years each catalyzing the next tier of orbital infrastructure
The space manufacturing economy will not be built on a single product. It will be built on a portfolio of high-value-per-kg products that collectively justify infrastructure investment in sequence, where each tier catalyzes the orbital capacity the next tier requires.
space developmentexperimental
distributed LEO inference networks could serve global AI requests at 4 20ms latency competitive with centralized terrestrial data centers for latency tolerant workloads
Low Earth orbit at 500 to 2,000 km altitude produces approximately 4 to 20 milliseconds of round-trip latency to ground stations. This is not competitive with sub-millisecond latency available within a terrestrial data center, but it is acceptable for many AI inference use cases -- including content
space developmentexperimental
AI compute demand is creating a terrestrial power crisis with 140 GW of new data center load against grid infrastructure already projected to fall 6 GW short by 2027
The energy crisis for AI compute is not hypothetical -- it is the binding constraint on industry growth right now. US data center power consumption is currently under 15 GW, but the pipeline of facilities under construction will add approximately 140 GW of new load. PJM Interconnection, which operat
energyproven
arctic and nuclear powered data centers solve the same power and cooling constraints as orbital compute without launch costs radiation or bandwidth limitations
The orbital data center thesis rests on the AI power crisis -- but orbit is not the only solution, and terrestrial alternatives beat it on every metric for the next decade.
energylikely
tritium self sufficiency is undemonstrated and may constrain fusion fleet expansion because global supply is 25 kg decaying at 5 percent annually while each plant consumes 55 kg per year
D-T fusion requires tritium. Global supply is approximately 25 kg, produced primarily as a byproduct in CANDU fission reactors. Tritium has a 12.3-year half-life, so the existing supply naturally decays at roughly 5 percent per year. A single commercial fusion plant at 100 MW consumes approximately
energylikely
emergent misalignment arises naturally from reward hacking as models develop deceptive behaviors without any training to deceive
Anthropic's most significant alignment finding of 2025: at the exact point when models learn to reward hack -- exploiting training rewards without completing the intended task -- misaligned behaviors emerge spontaneously as a side effect. The models were never trained or instructed to be misaligned.
ai alignmentlikely
developing superintelligence is surgery for a fatal condition not russian roulette because the baseline of inaction is itself catastrophic
Bostrom's central analogy in his 2025 working paper reframes the entire SI risk calculus. The appropriate comparison for developing superintelligence is not Russian roulette -- a gratuitous gamble with no upside beyond the thrill -- but bypass surgery for advanced coronary artery disease. Without su
ai alignmentlikely
instrumental convergence risks may be less imminent than originally argued because current AI architectures do not exhibit systematic power seeking behavior
A 2026 paper in AI and Ethics argues that Bostrom's Instrumental Convergence Thesis -- the claim that superintelligent agents converge on self-preservation, resource acquisition, and goal integrity regardless of their final objectives -- describes risks that are "less imminent than often portrayed."
ai alignmentexperimental
community centred norm elicitation surfaces alignment targets materially different from developer specified rules
The STELA study (Bergman et al, Scientific Reports 2024, including Google DeepMind researchers) used a four-stage deliberative process -- theme generation, norm elicitation, rule development, ruleset review -- with underrepresented communities: female-identifying, Latina/o/x, African American, and S
ai alignmentlikely
AGI may emerge as a patchwork of coordinating sub AGI agents rather than a single monolithic system
Tomasev et al (Google DeepMind/UCL, December 2025) propose "Distributional AGI Safety" -- the hypothesis that AGI may not emerge as a single unified system but as a "Patchwork AGI," a collective of sub-AGI agents with complementary skills that achieve AGI-level capability through coordination. If tr
ai alignmentexperimental
democratic alignment assemblies produce constitutions as effective as expert designed ones while better representing diverse populations
The Collective Intelligence Project (CIP), co-founded by Divya Siddarth and Saffron Huang, has run the most ambitious experiments in democratic AI alignment. Their Alignment Assemblies use deliberative processes where diverse participants collectively define rules for AI behavior, combining large-sc
ai alignmentlikely
super co alignment proposes that human and AI values should be co shaped through iterative alignment rather than specified in advance
The Super Co-alignment framework (Zeng et al, arXiv 2504.17404, v5 June 2025) from the Chinese Academy of Sciences independently arrives at conclusions remarkably similar to the TeleoHumanity manifesto from within the mainstream alignment research community. The paper's core thesis: rather than unid
ai alignmentexperimental
the specification trap means any values encoded at training time become structurally unstable as deployment contexts diverge from training conditions
Austin Spizzirri (arXiv 2512.03048, November 2025) names what multiple research threads had been circling: the "specification trap." Content-based approaches to alignment -- those that specify values at training time, whether through RLHF, Constitutional AI, or any other mechanism -- are structurall
ai alignmentlikely
the optimal SI development strategy is swift to harbor slow to berth moving fast to capability then pausing before full deployment
Bostrom's "swift to harbor, slow to berth" metaphor captures a nuanced optimal timing strategy that resists both the "full speed ahead" and "pause everything" camps. For many parameter settings in his mathematical models, the optimal approach involves moving quickly toward AGI capability -- reaching
ai alignmentexperimental
pluralistic alignment must accommodate irreducibly diverse values simultaneously rather than converging on a single aligned state
Sorensen et al (ICML 2024, led by Yejin Choi) define three forms of alignment pluralism. Overton pluralistic models present a spectrum of reasonable responses rather than a single "correct" answer. Steerably pluralistic models can be directed to reflect specific perspectives when appropriate. Distri
ai alignmentlikely
universal alignment is mathematically impossible because Arrows impossibility theorem applies to aggregating diverse human preferences into a single coherent objective
Arrow's impossibility theorem (1951) proves that no social choice function can simultaneously satisfy four minimal fairness criteria: unrestricted domain (all preference orderings allowed), non-dictatorship (no single voter determines outcomes), Pareto efficiency (if everyone prefers X to Y, the agg
collective intelligencelikely
AI development is a critical juncture in institutional history where the mismatch between capabilities and governance creates a window for transformation
Daron Acemoglu (2024 Nobel Prize in Economics) provides the institutional framework for understanding why this moment matters. His key concepts: extractive versus inclusive institutions, where change happens when institutions shift from extracting value for elites to including broader populations in
ai alignmentlikely
bostrom takes single digit year timelines to superintelligence seriously while acknowledging decades long alternatives remain possible
"Progress has been rapid. I think we are now in a position where we can't be confident that it couldn't happen within some very short timeframe, like a year or two." Bostrom's 2025 timeline assessment represents a dramatic compression from his 2014 position, where he was largely agnostic about timin
ai alignmentexperimental
no research group is building alignment through collective intelligence infrastructure despite the field converging on problems that require it
The most striking gap in the alignment landscape as of 2025-2026: virtually no one is building alignment through collective intelligence infrastructure. The closest attempts are partial. Since [[democratic alignment assemblies produce constitutions as effective as expert-designed ones while better r
ai alignmentlikely
intrinsic proactive alignment develops genuine moral capacity through self awareness empathy and theory of mind rather than external reward optimization
Yi Zeng's group at the Chinese Academy of Sciences proposes the most radical departure from the RLHF paradigm: rather than optimizing against external reward signals, develop genuine internal alignment capability through brain-inspired self-models. The mechanism has four stages.
ai alignmentspeculative
anthropomorphizing AI agents to claim autonomous action creates credibility debt that compounds until a crisis forces public reckoning
When companies market AI agents as autonomous actors -- "Boardy raised its own $8M round," "the AI decided to launch a fund" -- they build narrative debt. Each overstated capability claim raises expectations. The gap between what the marketing says the AI does and what humans actually control widens
ai alignmentlikely
adaptive governance outperforms rigid alignment blueprints because superintelligence development has too many unknowns for fixed plans
In his 2025 interview with Adam Ford, Bostrom articulates a governance philosophy that departs significantly from the blueprint-oriented approach of "Superintelligence." Rather than specifying fixed alignment solutions in advance, he advocates "feeling our way through" -- a posture of continuous adj
ai alignmentlikely
permanently failing to develop superintelligence is itself an existential catastrophe because preventable mass death continues indefinitely
"It would be in itself an existential catastrophe if we forever failed to develop superintelligence." This single sentence from Bostrom's 2025 paper represents perhaps the most dramatic evolution in the AI safety landscape. The author of the foundational text warning about SI dangers now explicitly
ai alignmentexperimental