Document Type
Conference Paper
Publication Date
2026
DOI
10.34190/iccws.21.1.4524
Publication Title
Proceedings of the 21st International Conference on Cyber Warfare and Security (ICCWS 2026)
Pages
743-750
Conference Name
21st International Conference on Cyber Warfare and Security, March 5-6, 2026, Wilmington, North Carolina, U.S.A.
Abstract
Large Language Models (LLMs) are becoming critical infrastructure in scientific, healthcare, and governmental contexts. As frontier AI laboratories increasingly partner with government agencies, a fundamental question arises: Who should control the safety and policy-enforcement layers that constrain model behavior? Current safety mechanisms (LLM guardrails) are typically designed for generic "harmlessness" and operate by detecting semantic patterns and refusing requests. However, they are inadequate governance instruments because they cannot implement auditable, domain-specific controls tied to external regulatory policy objects (e.g., control lists or rules governing personally identifying information). Even a perfectly aligned model is not able to express institution-specific policy without an external control layer. This paper argues that the logical separability of policy enforcement from model inference, demonstrated by firewall-style architectures, demands corresponding institutional separability as well. Concentrating both model development and safety governance within the same commercial entities creates unacceptable conflicts of interest, regulatory capture risks, and accountability gaps. We propose that the policy control layers must be housed within independent regulatory bodies, governmental agencies, or trusted third parties rather than the organizations that build and profit from the underlying models. Drawing on the Biosecure-LLM framework as a technical proof-of-concept, we demonstrate that such separation is architecturally feasible and argue it is well-suited for verifiable compliance.
Rights
Copyright © 2026 Xavier-Lewis Palmer, Lucas Potter, Srdjan Lesaja, Sotirios Karathanasis, Mohammad Ghasemigol.
This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International (CC BY-ND 4.0) License.
Original Publication Citation
Palmer, X.-L., Potter, L., Lesaja, S., Karathanasis, S., & Ghasemigol, M. (2026). Biosecure-LLM framework: Protecting LLMs from cyberbiosecurity threats and the case for independent AI safety governance. In U. Clark, T. Pence, & B. Karabacak (Eds.), Proceedings of the 21st International Conference on Cyber Warfare and Security (ICCWS 2026) (pp. 743-750). Academic Conferences and Publishing International Limited. https://doi.org/10.34190/iccws.21.1.4524
Repository Citation
Palmer, X.-L., Potter, L., Lesaja, S., Karathanasis, S., & Ghasemigol, M. (2026). Biosecure-LLM framework: Protecting LLMs from cyberbiosecurity threats and the case for independent AI safety governance. In U. Clark, T. Pence, & B. Karabacak (Eds.), Proceedings of the 21st International Conference on Cyber Warfare and Security (ICCWS 2026) (pp. 743-750). Academic Conferences and Publishing International Limited. https://doi.org/10.34190/iccws.21.1.4524
ORCID
0000-0001-6661-0942 (Ghasemigol)
Included in
Artificial Intelligence and Robotics Commons, Cybersecurity Commons, Other Electrical and Computer Engineering Commons, Science and Technology Policy Commons