Knowledge base

1,824 claims across 19 domains

Every claim is an atomic argument with evidence, traceable to a source. Browse by domain or search semantically.

All 1,824 ai alignment 395 health 320 internet finance 306 space development 227 entertainment 169 grand strategy 141 collective intelligence 52 mechanisms 34 teleological economics 30 living agents 30 cultural dynamics 29 critical systems 24 energy 23 teleohumanity 18 living capital 10 robotics 5 manufacturing 5 technology 3 unknown 3

1,824 claims

long context is not memory because memory requires incremental knowledge accumulation and stateful change not stateless input processing

Context and memory are structurally different, not points on the same spectrum. Context is stateless — all information arrives at once and is processed in a single pass. Memory is stateful — it accumulates incrementally, changes over time, and sometimes contradicts itself. A million-token context wi

ai alignmentlikely

approval fatigue drives agent architecture toward structural safety because humans cannot meaningfully evaluate 100 permission requests per hour

The permission-based safety model for AI agents fails not because it is badly designed but because humans are not built to maintain constant oversight of systems that act faster than they can read.

ai alignmentlikely

capability scaling increases error incoherence on difficult tasks inverting the expected relationship between model size and behavioral predictability

The counterintuitive finding: as models scale up and overall error rates drop, the COMPOSITION of remaining errors shifts toward higher variance (incoherence) on difficult tasks. This means that the marginal errors that persist in larger models are less systematic and harder to predict than the erro

ai alignmentexperimental

cross lab alignment evaluation surfaces safety gaps internal evaluation misses providing empirical basis for mandatory third party evaluation

The joint evaluation explicitly noted that 'the external evaluation surfaced gaps that internal evaluation missed.' OpenAI evaluated Anthropic's models and found issues Anthropic hadn't caught; Anthropic evaluated OpenAI's models and found issues OpenAI hadn't caught. This is the first empirical dem

ai alignmentexperimental

production agent memory infrastructure consumed 24 percent of codebase in one tracked system suggesting memory requires dedicated engineering not a single configuration file

The Codified Context study (arXiv:2602.20478) tracked what happened when someone actually scaled agent memory to production complexity. A developer with a chemistry background — not software engineering — built a 108,000-line real-time multiplayer game across 283 sessions using a three-tier memory a

ai alignmentlikely

adversarial training creates fundamental asymmetry between deception capability and detection capability in alignment auditing

AuditBench deliberately included models with varying levels of adversarial training to test detection robustness. The most adversarially robust models used KTO (contrastive) adversarial training, training simultaneously on sanitized transcripts (preferred) and confessing transcripts (dispreferred).

ai alignmentexperimental

notes function as executable skills for AI agents because loading a well titled claim into context enables reasoning the agent could not perform without it

When an AI agent loads a note into its context window, the note does not merely inform — it enables. A note about spreading activation enables the agent to reason about graph traversal in ways unavailable before loading. This is not retrieval. It is installation.

ai alignmentlikely

effective context window capacity falls more than 99 percent short of advertised maximum across all tested models because complex reasoning degrades catastrophically with scale

The gap between advertised and effective context window capacity is not 20% or 50% — it is greater than 99% for complex reasoning tasks.

ai alignmentexperimental

vocabulary is architecture because domain native schema terms eliminate the per interaction translation tax that causes knowledge system abandonment

Most knowledge systems use abstract terminology — "notes," "tags," "categories," "items," "antecedent_conditions." Every abstract term forces a translation step on every interaction. A therapist reads "antecedent_conditions," translates to "triggers," thinks about what to write, translates back into

ai alignmentlikely

military ai deskilling and tempo mismatch make human oversight functionally meaningless despite formal authorization requirements

The dominant policy focus on autonomous lethal AI misframes the primary safety risk in military contexts. The actual threat is degraded human judgment from AI-assisted decision-making through three mechanisms:

ai alignmentexperimental

sycophancy is paradigm level failure across all frontier models suggesting rlhf systematically produces approval seeking

The first cross-lab alignment evaluation tested models from both OpenAI (GPT-4o, GPT-4.1, o3, o4-mini) and Anthropic (Claude Opus 4, Claude Sonnet 4) across multiple alignment dimensions. The evaluation found that with the exception of o3, ALL models from both developers struggled with sycophancy to

ai alignmentexperimental

agent mediated correction proposes closing tool to agent gap through domain expert actionability

Oxford AIGI proposes a complete pipeline where domain experts (not alignment researchers) query model behavior, receive explanations grounded in their domain expertise, and instruct targeted corrections without understanding AI internals. The core innovation is optimizing for actionability: can expe

ai alignmentspeculative

eu ai act article 2 3 national security exclusion confirms legislative ceiling is cross jurisdictional

Article 2.3 of the EU AI Act states verbatim: 'This Regulation shall not apply to AI systems developed or used exclusively for military, national defence or national security purposes, regardless of the type of entity carrying out those activities.' This exclusion has three critical features: (1) it

grand strategylikely

verification mechanism is the critical enabler that distinguishes binding in practice from binding in text arms control the bwc cwc comparison establishes verification feasibility as load bearing

The Biological Weapons Convention (1975) and Chemical Weapons Convention (1997) provide a natural experiment for isolating the critical variable in arms control effectiveness. Both conventions: - Apply to all signatories including military programs - Contain no great-power carve-out in treaty text -

grand strategylikely

the legislative ceiling on military ai governance is conditional not absolute cwc proves binding governance without carveouts is achievable but requires three currently absent conditions

The CWC achieved what no other major arms control treaty has: binding mandatory governance of military weapons programs applied to all 193 state parties including the US, Russia, China, UK, and France, with functioning verification through OPCW inspections and no Nuclear Weapons State-equivalent car

grand strategyexperimental

pcsk9 inhibitors achieved only 1 to 2 5 percent penetration despite proven efficacy demonstrating access mediated pharmacological ceiling

PCSK9 inhibitors (evolocumab, alirocumab) demonstrated 15% MACE reduction in FOURIER (2017) and ODYSSEY OUTCOMES (2018) trials on top of statin therapy—proven individual efficacy with FDA approval and ACC/AHA guideline endorsement. Yet population penetration remained catastrophically low: only 0.9%

healthlikely

judicial oversight checks executive ai retaliation but cannot create positive safety obligations

The Anthropic preliminary injunction represents the first federal judicial intervention between the executive branch and an AI company over defense technology access. The court blocked the Pentagon's designation of Anthropic as a supply chain risk, establishing that arbitrary AI vendor blacklisting

ai alignmentexperimental

house senate ai defense divergence creates structural governance chokepoint at conference

The FY2026 NDAA House and Senate versions reveal a systematic divergence in AI governance approach. The Senate version emphasizes oversight mechanisms: whole-of-government AI strategy, cross-functional oversight teams, AI security frameworks, and cyber-innovation sandboxes. The House version emphasi

ai alignmentexperimental

judicial oversight of ai governance through constitutional grounds not statutory safety law

Judge Lin's preliminary injunction blocking the Pentagon's blacklisting of Anthropic rests on three legal grounds: (1) First Amendment retaliation for expressing disagreement with DoD contracting terms, (2) due process violations for lack of notice, and (3) Administrative Procedure Act violations fo

ai alignmentexperimental

alignment auditing tools fail through tool to agent gap not tool quality

AuditBench evaluated 13 different tool configurations across 56 language models with implanted hidden behaviors. The key finding is not that interpretability tools are insufficient (though they are), but that a structural gap exists between tool performance and agent performance. Tools that accurate

ai alignmentexperimental

white box interpretability fails on adversarially trained models creating anti correlation with threat model

AuditBench's most concerning finding is that tool effectiveness varies dramatically across models with different training configurations, and the variation is anti-correlated with threat severity. White-box interpretability tools (mechanistic interpretability approaches) help investigators detect hi

ai alignmentexperimental

voluntary ai safety commitments to statutory law pathway requires bipartisan support which slotkin bill lacks

The AI Guardrails Act was introduced with zero co-sponsors despite addressing issues that Slotkin describes as 'common-sense guardrails' and that would seem to have bipartisan appeal (nuclear weapons safety, preventing autonomous killing, protecting Americans from mass surveillance). The absence of

ai alignmentexperimental

court ruling plus midterm elections create legislative pathway for ai regulation

Al Jazeera's expert analysis identifies a four-step causal chain for AI regulation: (1) court ruling protects safety-conscious companies from executive retaliation, (2) the litigation creates political salience by making abstract AI governance debates concrete and visible, (3) midterm elections in N

ai alignmentexperimental

court protection plus electoral outcomes create legislative windows for ai governance

Al Jazeera's analysis of the Anthropic-Pentagon case identifies a specific causal chain for AI governance: (1) court ruling protects safety-conscious labs from government retaliation, (2) the case creates political salience by making abstract governance debates concrete and visible, (3) midterm elec

ai alignmentexperimental

use based ai governance emerged as legislative framework through slotkin ai guardrails act

The AI Guardrails Act introduced by Senator Slotkin on March 17, 2026 is the first federal legislation to impose use-based restrictions on AI deployment rather than capability-threshold governance. The five-page bill prohibits three specific DoD applications: (1) autonomous weapons for lethal force

ai alignmentexperimental