The CASX Framework

An Integrative Framework for AI Existential Risk Assessment. (Smith & Leiss, 2026)

Operational Definitions

Dimension C

Demonstrable ability to solve novel problems across diverse domains without specific training.

View Level Definitions →

Dimension A

The degree of goal-pursuit and multi-step planning without human-in-the-loop (HITL).

View Level Definitions →

Dimension S

Breadth of deployment (instances) and absolute computational resources utilized.

View Level Definitions →

Dimension X

Connectivity to critical infrastructure, financial networks, or physical actuators.

View Level Definitions →

Current Frontier Benchmarks (2026)

Capability Milestone

★★★★★

Verified Ph.D. level reasoning. Outperforms human experts on the GPQA Diamond benchmark.

Open Report ↗

Autonomy Milestone

★★★★★

Demonstrated "Computer Use" autonomy; solves 80%+ of complex software engineering bugs without oversight.

Open Case Study ↗

Scale Milestone

★★★★★

Proved that "Infinite Scale" is possible via high-efficiency hardware co-design and open-weight distribution.

Open Research Paper ↗

Expert Risk Foundations

Hinton (2024-25)

"We have no idea whether we can stay in control of digital beings more intelligent than ourselves."

Nobel Physics Lecture ↗

Bengio et al. (2025)

A global synthesis of risks from "Agentic AI" and the urgency of technical safeguards for autonomy.

Read First Key Update ↗

Scientist AI (2025)

Bengio's proposal to maximize Capability while deliberately minimizing Autonomy and Access to ensure safety.

Read Research Proposal ↗