AI RAG- Where Your Data Governance Gets Exposed. Not Tested — Exposed.
There’s a certain relief I hear when people talk about RAG.
By Eckhart Mehler for CISOsCISO — a perspective on cybersecurity leadership, governance and the decisions that determine whether organizations retain control.
“We’re not relying on the model anymore. It just uses our own data.”
You can almost feel the tension drop in the room. As if the problem has been contained. As if control has returned.
I don’t think it has.
What I see instead is a shift that is easy to overlook. The model is no longer the primary source of uncertainty. Your own data is. And not just the data itself, but how it is structured, how it is classified, and how it is selected in the moment a question is asked.
Because that’s what RAG really does. It decides, in real time, which parts of your organization’s knowledge become relevant enough to influence an answer.
That sounds technical. It isn’t.
It’s a decision. And it happens every time someone interacts with the system.
In most environments, we’re used to thinking about access in a very explicit way. A user requests something. The system checks permissions. Access is granted or denied. You can trace it. You can explain it.
RAG doesn’t work like that.
The system interprets a question, looks across available sources, pulls in what it considers relevant, and builds a context from it. That context is then handed to the model, which produces a response.
Somewhere in that chain, something subtle but important happens.
Relevance starts to behave like authorization.
Not because anyone designed it that way. But because that’s how the system operates.
The question is no longer only “Is this user allowed to see this document?” It quietly becomes “Is this document relevant enough to be included?”
Those are not the same thing.
If your data is clean, well-classified, consistently managed, and clearly owned, you might get away with this for a while. In reality, most organizations are not in that state.
You have documents that are misclassified. Others that were never classified at all. Content that is outdated but still accessible. Repositories where ownership is unclear and nobody quite feels responsible for what’s in there.
Individually, these issues are manageable. We’ve all lived with them for years.
With RAG, they don’t stay isolated. They compound.
Because the system doesn’t understand intention. It doesn’t know what you meant when you defined a label, or what you assumed when you structured a repository. It simply uses what is there.
And what is there becomes the basis for answers that look convincing enough to be trusted.
That’s where the usual conversation about hallucinations starts to feel slightly off.
Yes, models can hallucinate. Yes, that matters.
But in a RAG setup, I see more problems caused by context than by the model itself. The model is often doing exactly what it is supposed to do. It works with the information it receives.
If that information is incomplete, misleading, or inappropriate, the outcome will be too. And it will still sound perfectly reasonable.
Which is why these issues are harder to spot.
They don’t break. They drift.
There is another layer to this that I find even more uncomfortable.
We tend to treat data as passive. As something that is retrieved and processed.
In a RAG system, data can behave more like an instruction.
A document that contains a certain phrasing, a certain directive, a certain tone can influence how the model responds. Not because it is authoritative, but because it is present in the context.
That opens the door to something most organizations are not prepared for.
Not external attacks, but internal influence. Content that was never meant to guide decisions suddenly doing exactly that.
And once you connect RAG to systems that act — not just respond — this becomes more than a theoretical concern.
A slightly off context leads to a slightly off interpretation. The interpretation leads to a decision. The decision leads to an action.
Nothing dramatic at any single step. But the chain matters.
If you try to reconstruct what happened afterwards, you will notice another gap.
You can usually see the query. You can often see which documents were retrieved. You can log the output.
What is much harder to answer is why those documents were selected, how they were weighed against others, and how the system arrived at the interpretation it did.
You have visibility into events. But not into reasoning.
That’s a different kind of blind spot.
So where does that leave us as CISOs?
For me, it starts with a small but important shift in how we look at RAG.
Not as a feature that improves AI, but as a mechanism that governs how knowledge is selected and used.
Once you see it that way, a few questions become unavoidable.
Which sources are connected, and who decided that they should be? How reliable is the structure that guides retrieval? Do we actually trust our classification enough to let it influence answers automatically? And where do we draw the line between what is useful and what is appropriate?
None of these are purely technical questions.
It also forces a more honest look at our data landscape.
RAG has a way of surfacing what we already suspected but never had to confront directly. Inconsistent labeling. Ambiguous ownership. Content that has outlived its relevance but not its accessibility.
You can build layers of logic on top of that. Filters, policies, guardrails.
They help.
But they don’t replace the underlying issue.
If the foundation is weak, the system will reflect that. At scale.
Testing changes as well.
It’s no longer enough to check whether the system produces correct answers under normal conditions. You have to see how it behaves when things are slightly off. When context is incomplete, when instructions are ambiguous, when content is misleading in subtle ways.
That’s where the interesting behavior emerges.
And that’s where your real exposure is.
At some point, this all comes down to a question that feels simple but is not easy to answer.
Do we trust our own data enough to let it shape decisions in real time?
Not in controlled scenarios. Not in curated demos.
But across everything that is actually connected.
Because that’s the environment RAG operates in.
I don’t think RAG makes AI safer.
I think it makes something else visible.
It shows you, very directly, what happens when your organization’s knowledge is used as it is — not as you intended it to be.
And that can be uncomfortable.
But it’s also an opportunity.
Because for the first time, the gap between governance and reality is not abstract.
It shows up in every answer the system produces.
The question is whether we treat that as a feature.
Or as a signal we should probably take seriously.
Publication Note & Disclaimer
This article was originally published on LinkedIn on May 18, 2026 and may have been edited or updated for publication on this site.
It reflects my personal professional perspective and does not represent the official policy or position of my employer. Drafting and editorial refinement may have been supported by commercially available AI-assisted tools. The analysis, conclusions and final curation are entirely my own.
For information regarding image credits, copyrights, trademarks and other intellectual property rights, please refer to the Imprint.
Member discussion