§6 · Lane 2 — AI Audit and Accountability
Auditing LLMs — A Three-Layered Approach
Mökander, Schuett, Kirk, Floridi (2024) · AI & Ethics 4
Bibliographic data
- Title
- Auditing Large Language Models: A Three-Layered Approach (Mökander, Schuett, Kirk, Floridi, 2024)
- Authors / Issuing body
- Jakob Mökander (Oxford Internet Institute + Princeton CITP), Jonas Schuett (Centre for the Governance of AI + Goethe University Frankfurt), Hannah Rose Kirk (Oxford Internet Institute), Luciano Floridi (Oxford Internet Institute + Bologna)
- Venue / Publisher
- AI and Ethics 4 (2024) 1085-1115. Received Feb 2023; accepted April 2023; published May 2023 (volume year 2024).
- Year
- 2024
- Designation
- Academic
- Licence
- DOI — refer to publisher for full licence terms.
- Canonical link
- https://doi.org/10.1007/s43681-023-00289-2
How to cite
Mökander, Schuett, Kirk, Floridi (2024). Auditing Large Language Models: A Three-Layered Approach (Mökander, Schuett, Kirk, Floridi, 2024). AI and Ethics 4 (2024) 1085-1115. Received Feb 2023; accepted April 2023; published May 2023 (volume year 2024).. https://doi.org/10.1007/s43681-023-00289-2.
Proposes a three-layered audit blueprint for large language models — governance audits (of providers' organisational accountability structures and quality management systems), model audits (of LLM capabilities and limitations between pre-training and release), and application audits (of products built on LLMs, covering legal compliance and impact).
Why it matters for NETEVO
NETEVO cites Auditing Large Language Models: A Three-Layered Approach (Mökander, Schuett, Kirk and Floridi, 2024) as the structural artefact behind the forthcoming AI-Washing Audit whitepaper. The paper proposes three coordinated audit layers — governance audits of provider accountability structures and quality-management systems, model audits of LLM capabilities and limitations between pre-training and release, and application audits of products built on LLMs — and argues that the three layers must operate as a single instrumented loop rather than as independent activities.
The layers are interconnected, not siloed. Output from one audit becomes input to the next: governance findings inform model audits, model-limitation reports shape application audits, and application-layer operational logs feed back into the governance layer. This is the audit-side mirror of the integrated-management-system thesis the Integrated Management Systems Practical Guide establishes — one engineered system, multiple normative regimes, observed end-to-end.
Governance audits live at the management-system level. The paper places software development processes and quality management systems squarely inside the governance-audit layer, which is the exact surface the NETEVO Law-to-Code Methodology engineers. AS ISO/IEC 42001 provides the management-system shell; this paper supplies the audit-side application.
Ex-ante and ex-post assessment together. The model argues against audit-as-snapshot and for audit as a continuous, telemetry-driven activity — the position the forthcoming AI-Washing Audit whitepaper takes in favour of executable controls over paper policies. The five institutional-arrangement archetypes the paper proposes for who audits whom also map cleanly onto Australia's emerging AI assurance landscape, anchoring NETEVO's AI Governance in ANZ framing.
Where NETEVO applies this
- Agent Infrastructure Whitepaper — application-audit layer hooks the Authority Register
- AI Governance in ANZ Whitepaper — institutional-arrangements taxonomy anchors AU regulatory framing