Data March 30, 2012 4 min read

Self-service BI: how not to turn reporting freedom into a contradiction factory

Self-service analytics only works with a shared metrics vocabulary and data trust - otherwise every department arrives at the meeting with its own version of the truth.

The self-service BI idea looks attractive: analysts and managers build the reports they need, IT is not a bottleneck, decisions happen faster. On paper it is a win for everyone.

In practice I have watched it turn into a situation where every department has its own numbers, no set of numbers matches another, and meetings are spent arguing about data rather than making decisions. This is not a new problem - the root is usually dirty master data and the absence of a shared reference layer, which self-service tooling alone cannot fix.

Why freedom without a shared vocabulary breaks analytics

When everyone builds reports independently, discrepancies in definitions are inevitable. Is "revenue" inclusive or exclusive of tax? Is an "active customer" someone who bought this month, or in the last 90 days? Is "conversion" calculated from unique visitors or from sessions?

If these questions are not resolved centrally, each analyst answers them on their own. The result: finance shows one revenue figure, sales shows another, and both are correct by their own logic. But the company cannot operate with two versions of the truth.

This is not a tool problem. It is the absence of a shared vocabulary.

What a shared metrics vocabulary is

A metrics vocabulary is a document or registry in which each key metric has a recorded:

exact definition (what is included, what is not);
calculation formula;
data source;
owner - the person responsible for its correctness;
date of last definition update.

This is not academic work. It is a pragmatic tool that lets two people from different departments reference the same definition and arrive at the same number.

Without this, self-service BI produces not analytics but chaos with a polished interface.

Data trust is not an obvious problem

Even when metrics are defined, people will build reports outside the shared system if they do not trust the data inside it. This happens more often than it appears.

Typical signs of missing trust:

"The system says one thing, but we all know the real number is different."
"That figure is wrong - Maria counts it differently."
"Last month's data hasn't updated yet, so I pulled it from my own spreadsheet."

If phrases like these come up regularly, the reporting system is not the source of truth. It is one source among several, equal in standing to personal spreadsheets.

The solution is not to force everyone to use only the system. The solution is to understand why the data in the system does not earn trust, and fix that.

How to structure self-service so it works

A few principles that separate working implementations from broken ones:

A semantic layer. Instead of direct table access - a managed layer with pre-built metrics and dimensions. Analysts choose from verified definitions instead of creating their own.
Certified reports. A subset of reports is reviewed, versioned, and marked as "official". The rest are working drafts, not for decision-making.
A metric change process. If a metric definition changes, it goes through the owner, with notification to dependent reports.
Data lineage. An analyst should be able to answer "where did this number come from" in a reasonable amount of time.

Self-service is not "everyone is their own analyst". It is "everyone can independently ask questions of a shared, verified knowledge base".

A practical maturity check

A few questions that help assess where a company stands:

If two managers independently calculate "revenue for last quarter", do they get the same number?
Is there a single place where the official definition of each key metric is recorded?
When numbers diverge in a meeting, is there a clear process for reconciling them?
Who is responsible for the correctness of data in the BI system?

If at least two of these questions have no clear answer, self-service BI is currently producing more questions than answers.

Back to all posts

Contact

Why freedom without a shared vocabulary breaks analytics

What a shared metrics vocabulary is

Data trust is not an obvious problem

How to structure self-service so it works

A practical maturity check

If this resonated, write to me. I reply personally.