Defining a Data Governor (2022)

Last modified: August 09, 2021 • Reading Time: 6 minutes

As more people depend on data in their daily workflow, organizations are pressured to think critically about the quality of data being provided. Having a small team field all data questions will not scale, so companies must move from a centralized data organization to a decentralized one.

Why you need a Data Governor

According to Gartner, Data Governance is an effective program to manage and control the ever-growing amount of data in order to improve business outcomes. It helps ensure that the quality of data is high and compliance standards are adhered to. This does not happen with process alone; you need a Data Governor to drive and maintain Data Governance principles.

If your organization has built a Data Warehouse and has any of the following data services or restrictions, we strongly recommend appointing at least one Data Governor:

  • Self-service dashboards
  • Operates in an industry with regulations and compliance procedures
  • Has large data sources spanning different departments
  • Strives for operational intelligence

Without oversight, employees will misinterpret data, sensitive data may be shared inappropriately, employees will lack access to necessary data, and employees’ analysis will often be incorrect. A Data Governor will maintain and improve the quality of data and ensure your company is compliant with any regulations. It is a vital role to have for any informed company.

Data Governors for Data Governance

With the exploding volume of data within companies, it has become extremely difficult for a small technical team to govern an entire organization’s data. As this trend continues, these Data Scientists and Analysts should transition themselves from their traditional reporting responsibilities to those of Data Governors.

(Video) What is Data Governance?

In a traditional reporting role, their day was filled with answering questions for various business groups around their needed metrics. The shift to Data Governors finds them instead creating cleaned, documented data products for those end business groups to explore themselves.

This is called Democratized Data Governance, where the technical team (traditionally data gatekeepers) handles the technical aspects of governance and share the responsibilities of analytics with the end business groups.

The Roles of the Data Governor

As the Data Governor, everything addressed in all the chapters of this book is your responsibility. This is your manual. Your role changes at each stage of sophistication. You bravely lead your company from struggling to get value out of its data to producing accurate insights consistently. Let’s step through each of the roles you will play.

1. Data Cleanup and Maintenance

The majority of the technical work of data governance is around collecting, cleaning, and maintaining various data sets. This is a many-part activity that’s broken out here in subtypes.

Data Piping (ETL) and Warehousing

Data is going to exist in many different places inside of your organization. A big part of your job may consist of bringing those disparate sets of data together, where people can query across various sources. These combined places are data warehouses such as Google BigQuery or Amazon Redshift, and there are various Extract, Transform and Load (ETL) tools out there such as Stitch and Fivetran.

(Video) Introduction to Data Governance (Data Architecture | Data Governance)

Schema Cleanup/Modeling

For most companies the team collecting the data is also the team reporting on the data. The people on the team know all the ins and outs of the data. They can, for the most part, remember where the data they needed was and what tricky conditionals they’d have to put in each query (for example, not to count deleted or expired accounts).

But when organizations grow and their need to have access grows as well, the people exploring aren’t always the ones that put the data there. So, you have to clean up that data with the non-technical data explorer in mind.

Some BI products have ways to do this internally, but often it’s best and more reusable to do this on the database level. Just create new schemas in your database with a file full of your chosen views for that consumer. This is both a usability best practice and a security best practice.

Process and Auditing

Manually created data, such as that coming from your CRM, has a large margin for error. Also, how this data is recorded in CRMs is often determined by business users, not by data teams, so governance and data integrity can be less than ideal. For example, there may be two places sales reps need to manually enter the date of a call, or cancellation tracking may change when a new cancellation policy is put in place. Whenever there’s manually entered data, there will likely be discrepancies.

The way of handling this is to audit the data, ensure that it’s being recorded properly for the needed reports, and identify and develop missing processes with the managers of the relevant teams.

(Video) What is Data Governance? How Does it Impact Businesses?

Documentation

Again, the people exploring the data are no longer the people who put it there in the first place. Ideally you’ve now created clean, curated, and simple models for specific teams. Even so, you’ll still find a lot of benefit in documenting each table and column.

This can be done with a Wiki or leaving comments inside the database schema.

2. Permissions and Organization

Data security is obviously incredibly important. But besides that, permissions can be leveraged for proper organization. Data projects can get messy fast. Not everyone needs access to absolutely everything, especially if there is a clear process for requesting whatever additional information is needed.

Organizations today often strive to be highly transparent, but when over-transparency leads to confusion, it’s time to make the tradeoff for curating your team’s data experience.

3. Integrity Handling

It happens all the time: two people exploring data end up with two different values for the same metric. This can be one of the must frustrating moments for anyone working with data and can lead to some serious mistrust in the integrity of the data.

(Video) Why are there so many different Data Governance definitions?

There’s no way of stopping this, but it can be minimized. If the data is kept clean and well documented this problem should come up much less often. The best way to deal with it is to educate everyone on the fact that the problem does happen and they should expect and embrace it. Just as every product has bugs, every dataset does as well. When these inconsistencies are discovered, you have an opportunity to fix/solve/clarify them as soon as possible. One method of clarification is to build standardized metrics in your Data Warehouse model and point people to them when these discrepancies arise.

Ensure there’s a clear process for people to resolve these integrity issues. Be available to them and helpful when it’s reported. Maintaining a dataset is like maintaining a garden. There will always be weeds growing and more to do. It will never be perfect, but it can be beautiful.

4. Tool Selection

The Data Governor has to make decisions on what best fits your organization’s needs. Be mindful of tools that have high learning curves or have proprietary languages that lock you into a tool. Consider all the pieces of your data analytics stack and make sure tools you are selecting work well together.

5. Education/Enablement

No matter how well you’ve done your data cleaning, documentation and tool selection, you’re still going to have to educate your organization on how to use the data to get accurate and actionable insights.

Here are the things you must educate your organization on:

(Video) Defining data governance

  1. What’s available in the models
  2. How to use the BI tool
  3. Your process for prioritizing data requests, data sharing, and access
  4. Data Basics in databases, tables, data structures and SQL
  5. Quality versus Vanity Metrics
  6. Chart best practices

Written by: Matt David
Reviewed by: Dave Fowler

FAQs

How do you define data governance? ›

Data governance (DG) is the process of managing the availability, usability, integrity and security of the data in enterprise systems, based on internal data standards and policies that also control data usage. Effective data governance ensures that data is consistent and trustworthy and doesn't get misused.

What are the 3 key elements of good data governance? ›

A good data governance program typically includes the steering committee with three main groups: data owners, data stewards, and data custodians. The three positions all work together to create the policies, process, and procedures for governing data, especially the reference data and master data elements.

What questions does data governance answer? ›

General data governance interview questions
  • Tell me about yourself.
  • What are your biggest strengths?
  • Why did you choose to work in this field?
  • How did you find out about our company?
  • Where do you see yourself in five years?
  • Are you willing to relocate?
  • Tell me about your strengths.
  • Tell me about your weaknesses.
Nov 9, 2021

How do you ensure good data governance? ›

Data governance best practices
  1. Think with the big picture in mind, but start small. ...
  2. Build a business case. ...
  3. Metrics and more metrics. ...
  4. Communicate early and often. ...
  5. Account for the fact data governance is a marathon, not a sprint. ...
  6. Identify related roles and responsibilities.

What are the 4 pillars of data governance? ›

There are four pillars to the data governance framework to enable organizations to get the most out of their data.
  • Identify distinct use cases. ...
  • Quantify value. ...
  • Improve data capabilities. ...
  • Develop a scalable delivery model.
Mar 9, 2022

What does successful data governance look like? ›

Any successful data governance approach requires a high level of accountability. An organization's data governance will be aimless, blunt, and useless if no one takes responsibility. Across the organization, You must implement ownership and accountability.

What are the goals of data governance? ›

Goals and Principles for Data Governance
  • Enable better decision-making.
  • Reduce operational friction.
  • Protect the needs of data stakeholders.
  • Train management and staff to adopt common approaches to data issues.
  • Build standard, repeatable processes.
  • Reduce costs and increase effectiveness through coordination of efforts.

How do you create a data governance framework? ›

Creating your data governance framework
  1. Determine your data governance strategy. Create a data strategy by bringing together existing processes, people, and workflows.
  2. Start small. Start with just one business area or data issue and expand from there. ...
  3. Pick the right framework. ...
  4. Communicate. ...
  5. Keep it updated.

What is the difference between data management and data governance? ›

In the simplest terms, data governance establishes policies and procedures around data, while data management enacts those policies and procedures to compile and use that data for decision-making.

What is data governance tools? ›

A data governance tool is defined as a tool that aids in the process of creating and maintaining a structured set of policies, procedures, and protocols that control how an organization's data is stored, used, and managed.

Which of the following is critical for successful data governance? ›

The three critical aspects of building an effective data governance strategy are the people, processes, and technology.

What are the types of data governance? ›

Let's take a look at four of the most common data governance models:
  1. De-centralized Execution – Single Business Unit. ...
  2. De-Centralized Execution – Multiple Business Units. ...
  3. Centralized Governance – Single or Multiple Business Units. ...
  4. Centralized Data Governance & Decentralized Execution.
Aug 8, 2016

Why data governance is important example? ›

Data governance helps to ensure that data is usable, accessible and protected. Effective data governance leads to better data analytics, which in turn leads to better decision making and improved operations support.

What is the difference between data governance and data management? ›

Data Governance: The Difference Explained. In the simplest terms, data governance establishes policies and procedures around data, while data management enacts those policies and procedures to compile and use that data for decision-making.

Videos

1. What is a Data Governance Council? #datagovernance
(Lights OnData)
2. Data Governance Fundamentals - Nicola Askham
(Hyperight AB)
3. Data Management - Data Governance
(Global Data Store LLC)
4. Collibra Data Governance
(Collibra)
5. Data governance: why and what?
(dataroots)
6. Explanation Of Data Governance & Data Quality || Difference Between Data Governance & Data Quality
(Lights OnData)

Top Articles

Latest Posts

Article information

Author: Edmund Hettinger DC

Last Updated: 01/03/2023

Views: 5622

Rating: 4.8 / 5 (58 voted)

Reviews: 89% of readers found this page helpful

Author information

Name: Edmund Hettinger DC

Birthday: 1994-08-17

Address: 2033 Gerhold Pine, Port Jocelyn, VA 12101-5654

Phone: +8524399971620

Job: Central Manufacturing Supervisor

Hobby: Jogging, Metalworking, Tai chi, Shopping, Puzzles, Rock climbing, Crocheting

Introduction: My name is Edmund Hettinger DC, I am a adventurous, colorful, gifted, determined, precious, open, colorful person who loves writing and wants to share my knowledge and understanding with you.