AI Safety Incident Classification

All incidents in the AI Incident Database have been processed using an LLM and classified according to the MIT Risk Repository causal and domain taxonomies then scored for harm-severity on 10 different dimensions based on the CSET AI Harm Taxonomy, using a scale to reflect impact from zero to 'worst-case catastrophe'.

This is intended as a proof of concept to explore the potential capabilities and limitations of a scalable incident analysis framework. Write-up of this work to follow, but in the meantime, please feel free to explore and share feedback

Example outputs: