For enterprise technology leaders, the current debate around Europe’s AI rules can sound deceptively simple. According to TechRadar, if regulators delay some high-risk AI requirements or soften parts of the compliance burden, it can appear as though deployment should become easier, with fewer reporting layers and fewer obstacles between proof of concept and production. However, the risk does not disappear; it moves downstream, onto the organizations actually deploying the systems. Regulatory flexibility on paper does not translate into reduced accountability in practice.
The burden shift: from compliance paperwork to operational proof
Most organizations are not deploying AI tools in a vacuum. According to TechRadar, they are using it in customer communications, operational workflows, compliance checks, document handling, claims processes and internal decision support, where outputs have real consequences and where “the model got it wrong” is not a defensible answer. Boards, risk teams and operational owners will still need answers to the same basic questions: why did the system produce this output, what shaped that decision, what happens when it is uncertain, and can its reasoning be reviewed after the fact?
The old assumption was that regulation would tell businesses exactly what “responsible AI” looked like. Now, many technology leaders are discovering that compliance is only part of the problem. The harder challenge is proving that an AI system is dependable enough to use in workflows where mistakes carry serious consequences.
Why large language models alone fall short
Most of the AI now being deployed in enterprise settings is built on large language models (LLMs). These systems are powerful but probabilistic by design: they generate the most likely next output based on patterns in data, rather than reasoning through a problem in a transparent, rule-bound way. That makes them useful for drafting, summarizing and handling ambiguity, but much less suited to workflows where decisions need to be consistent, traceable and easy to justify after the fact.
TechRadar notes that human-in-the-loop is often a weaker safeguard than it first appears. If the human reviewer is simply being asked to sense-check an output from a black-box model that cannot explain how it reached its answer, then the organization has not solved the trust problem. It has just inserted a manual backstop into an unreliable process. This may reduce legal exposure in the short term, but it does not improve productivity, accountability or confidence. It also scales poorly, as human oversight of every AI output defeats the purpose of automation.
Four questions for enterprise buyers
TechRadar outlines four questions worth bringing into any procurement or deployment decision:
- Can the system explain how it arrived at an answer in a way a non-specialist reviewer can follow? Not just a plausible summary, but expose the logic, rules or constraints that shaped the outcome.
- Does it know when not to answer? In high-stakes settings, a useful AI system is one that can recognize ambiguity, defer, escalate or say “don’t know” when confidence is too low. An LLM will rarely do this as it is optimized to respond, even when it doesn’t know the answer.
- Can it be audited after the fact? If a regulator, customer or internal reviewer asks why a decision was made, teams need more than a confidence score and a generic disclaimer. They need a trail.
- Is the architecture suited to the type of problem being solved? This is where neurosymbolic AI becomes directly relevant. Neural systems – LLMs – are powerful at pattern recognition and language flexibility. Symbolic systems are strong at rules, constraints, consistency and auditability.
The article draws a clear analogy: when a spreadsheet calculates a result of a formula, nobody checks twice whether it may have hallucinated an alternative answer. That is the standard enterprises need from AI in regulated workflows.
| Question | Key consideration |
|---|---|
| Explainability | Can logic be followed by non-specialists? |
| Uncertainty handling | Can system say “don’t know”? |
| Auditability | Is there a decision trail? |
| Architectural fit | Is neurosymbolic AI needed? |
Neurosymbolic AI combines both, using neural capability to interpret language and extract information, while applying symbolic reasoning to determine and explain outcomes. According to TechRadar, organizations including Lloyds Banking Group are already piloting these approaches in regulated environments.
The real enterprise risk is opaque outputs, not regulation
For years, the industry has tended to treat transparency as something that can be added once a system has already been built – through disclosures, warnings or compliance dashboards. The reality of enterprise deployment is exposing the limits of that approach. If a system is opaque by design, no amount of paperwork will make it truly trustworthy.
As TechRadar emphasizes: "If transparency obligations weaken, enterprises don’t escape accountability, they absorb it." The organizations that move most effectively from pilot to production will not be the ones taking the most permissive view of compliance. They will be the ones choosing architectures, controls and operating models that can stand up to scrutiny from the start.
This article was produced as part of TechRadar Pro Perspectives, featuring the views of a Solutions Lead at UnlikelyAI. The views expressed are those of the author and not necessarily those of TechRadarPro or Future plc.