Data Quality for Business Leaders in the AI Age

13 Jun

Data Quality for Business Leaders in the AI Age

TJ

Tom Jenkin

Data quality issues are as common and ancient as data itself, with businesses continually wrestling with poor data quality and the challenges of 'broken' data. In my professional journey, I've worked with several enterprises across the energy, utility, and financial services sectors, where data presents both significant challenges and opportunities.

‍

From my direct experience, I've observed that business leaders frequently identify poor data quality as a major barrier to the adoption of AI and advanced data analytics. A research report by Mesh-AI identified data quality as the top barrier to AI adoption, amounting to over one third of the votes out of ten different factors. This fosters a “wait and see” approach as organisations delay initiatives that could deliver business value while they wait for an arbitrary data quality yardstick to be met.

‍

‍

This problem is intensified by the vast scale, complexity, and diversity of data sources that modern businesses need to consume, process, and manage. Many of these data sources originate outside the business's boundaries and are not under its control.

‍

Traditionally, businesses have tackled data quality issues in a static manner, such as through manual assessments and expensive one-off data cleansing exercises. This approach hasn't durably solved these issues, which isn't surprising since many businesses still view their data as a static asset and prioritise risk over value.

‍

However, there's growing recognition that in today's world, data is neither static nor solely owned by the organisation. The necessity to consume diverse data sources and enhance the quality and responsiveness of services using data to generate deep and actionable insights is now widely acknowledged. This is where the traditional static view of data and its quality falls short.

‍

To move forward, we must adopt a fundamentally different and pragmatic approach to data quality.

‍

A Pragmatic View of Data Quality

‍

When it comes to data quality, existing views have been shaped by how data has traditionally been utilised in enterprises, particularly to support fairly accurate, albeit not always timely, operational reporting. From this standpoint, data has often been categorised as either “broken” or “good” based on its perceived level of accuracy, completeness and validity.

‍

This approach is impractical as it evaluates data based on a perceived level of perfection within narrow dimensions. However:

Data is inherently complex and messy and can never be perfect or complete. It’s a misconception that data can be fixed once and for all.
While validity, accuracy and completeness are important aspects of data quality, data timeliness is often neglected, which diminishes what is achievable, especially in the context of modern ways of utilising data such as real time analytics or AI.

‍

Instead, we need to adopt a more pragmatic view of data quality that does away with the assumptions of single ownership and perfect completeness.

‍

We need to break down data quality into multiple measurable quality attributes that can be used to determine the suitability of a data set in meeting a real need.

‍

In practical terms, data quality is about the key features that make data useful. Each feature is a specific, measurable trait (called quality attribute) that helps us decide if the data is right for a certain job. The quality we see in the data is based on how these attributes measure up. People who use the data can articulate what they need by pointing out what attribute levels are suitable to them.

‍

A Shift in Thinking

‍

From	To
Seeing data quality as a static, binary state (good or bad).	Defining data quality as the ensemble of measurable quality attributes possessed by the data set.
Non-functional requirements as an afterthought to express data quality	Data quality attributes as an inherent part of the data.
Focusing only on validity, accuracy and completeness.	Considering timeliness to support advanced analytics.
Inferring perceived data quality indirectly through data usage and feedback rather than explicitly measuring it.	Measuring perceived data quality within the system and sharing it alongside the data itself.
Assuming data consumer needs are met through general data cleansing and management without specific attribute targets.	Articulating data consumer needs through the necessary values of data quality attributes.

‍

Data Quality in Practice

‍

Is this that simple? Having described the data in simple but clear terms, numerous powerful implications emerge:

‍

Data Quality as Part of the Data

‍

We’ve sidestepped the trap of labelling these attributes as non-functionals, thereby relegating them to an afterthought in the perception of data. They are integral to the data itself and should be central to identifying usable data (for instance, as part of data contracts visible through data catalogues). Quality attributes are a crucial factor in determining a data set's suitability, not just its content.

‍

Applied Product Thinking

‍

Now that data quality is defined by a set of universal attributes, applying product thinking and techniques to identify and prioritise quality improvements becomes much more straightforward. This approach, grounded in product risks such as usability, feasibility, viability, and value, makes it easier to incorporate this mindset into long-term data management strategies. It helps overcome the dilemma of data quality issues becoming persistent obstacles, clarifying where to begin and what to prioritise in terms of product features or data quality optimisations to enable new capabilities.

‍

Value-based Decision Making

‍

Optimising data quality certainly incurs costs and complexity, and sometimes these costs can be significant, especially if improvements necessitate adopting a new architecture, for example. This is where value-based discussions that balance needs and costs can occur, leading to changes that align with the realities and the needs of the business.

‍

Conclusion

‍

As a business leader, you know the critical role of data in creating value and have seen past efforts to enhance it fall short. What's next? Move from viewing data quality as static and binary to using a measurable framework that captures quality through specific attributes. Focus on optimising data quality to meet business needs effectively, applying product thinking. This shift in managing data quality is crucial for realising data-driven ambitions, especially with AI.

‍

Latest Stories

Why the Enterprise AI Journey Starts with Data: Welcoming Steve, Managing Director of AI

8 Jul

Most Enterprise Data is 'Good Enough' – The Real Problem is Strategic Alignment

17 Jun

Charting the Path to Frictionless Claims: Why Insurers Should Build the Capability Now

11 Jun

I would like to receive marketing communications regarding Mesh-AI news, services and events.

You may unsubscribe from these communications at any time. For more information on how to unsubscribe, our privacy practices, and how we are committed to protecting and respecting your privacy, please review Privacy Policy.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.