Document Type

Conference Paper

Publication Date

2026

DOI

10.34190/iccws.21.1.4524

Publication Title

Proceedings of the 21st International Conference on Cyber Warfare and Security (ICCWS 2026)

Pages

743-750

Conference Name

21st International Conference on Cyber Warfare and Security, March 5-6, 2026, Wilmington, North Carolina, U.S.A.

Abstract

Large Language Models (LLMs) are becoming critical infrastructure in scientific, healthcare, and governmental contexts. As frontier AI laboratories increasingly partner with government agencies, a fundamental question arises: Who should control the safety and policy-enforcement layers that constrain model behavior? Current safety mechanisms (LLM guardrails) are typically designed for generic "harmlessness" and operate by detecting semantic patterns and refusing requests. However, they are inadequate governance instruments because they cannot implement auditable, domain-specific controls tied to external regulatory policy objects (e.g., control lists or rules governing personally identifying information). Even a perfectly aligned model is not able to express institution-specific policy without an external control layer. This paper argues that the logical separability of policy enforcement from model inference, demonstrated by firewall-style architectures, demands corresponding institutional separability as well. Concentrating both model development and safety governance within the same commercial entities creates unacceptable conflicts of interest, regulatory capture risks, and accountability gaps. We propose that the policy control layers must be housed within independent regulatory bodies, governmental agencies, or trusted third parties rather than the organizations that build and profit from the underlying models. Drawing on the Biosecure-LLM framework as a technical proof-of-concept, we demonstrate that such separation is architecturally feasible and argue it is well-suited for verifiable compliance.

Rights

Copyright © 2026 Xavier-Lewis Palmer, Lucas Potter, Srdjan Lesaja, Sotirios Karathanasis, Mohammad Ghasemigol.

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International (CC BY-ND 4.0) License.

Original Publication Citation

Palmer, X.-L., Potter, L., Lesaja, S., Karathanasis, S., & Ghasemigol, M. (2026). Biosecure-LLM framework: Protecting LLMs from cyberbiosecurity threats and the case for independent AI safety governance. In U. Clark, T. Pence, & B. Karabacak (Eds.), Proceedings of the 21st International Conference on Cyber Warfare and Security (ICCWS 2026) (pp. 743-750). Academic Conferences and Publishing International Limited. https://doi.org/10.34190/iccws.21.1.4524

ORCID

0000-0001-6661-0942 (Ghasemigol)

Share

COinS