Data Proliferation

Data Proliferation

Data Proliferation Jonathan Poland

Data proliferation refers to the rapid growth of data, often resulting in a large amount of replicated and low-quality data. This can be costly to manage and may pose compliance and operational risks to an organization. While it may be necessary to analyze this data in order to understand its structure, sources, and uses, it may ultimately have little value to the organization and can be difficult to discard. The following are illustrative examples of data proliferation.

Customer Data

It is common for multiple systems in an organization to maintain customer data. Such data is commonly out of sync between systems with no clear single source of truth. This can cause operational failures such as sending a bill to the wrong address.

Documents

Knowledge workers tend to create a lot of documents that get checked into a document management system. In many cases, such documents become completely unused with time but are retained as a precaution.

Communication

Communications such as emails can gather at the rate of hundreds per employee per day. Most communications lose their value almost immediately but often are retained for an extended period of time.

Backups

Backups of data, documents and communications often need to be retained in case something important was deleted from the source systems. If someone deletes a critical email, the only copy may be in a backup from a particular day last year. As such, backups are commonly stored for long periods of time. This can consume considerable resources despite the fact that backups are rarely used.

Transactional Data

Transactional data such as market trades and website purchases can grow extremely quickly. Transactional data is often viewed as valuable for historical research. For example, it is common to look at patterns in stock trades going back decades.

Social Data

Data that is shared by people on a public or private social network. Often viewed as valuable for purposes such as market research and machine learning.

Sensors & Machines

Machine and sensor generated data. Sensors have become cheap to the extent than they can be embedded in everyday objects in great numbers. Such data may be generally less valuable than human generated data. For example, video of a train tunnel or data from a tire pressure sensor isn’t interesting for long. Nevertheless, sensor data potentially represents a gigantic source of data that is far larger than all other sources combined.

Brand Quality Jonathan Poland

Brand Quality

Brand quality is the perception of the level of excellence that a brand achieves in the eyes of its customers.…

The Importance of Lobbying 150 150 Jonathan Poland

The Importance of Lobbying

Lobbying is the act of influencing or attempting to influence the decisions of government officials, legislators, or regulators on behalf…

IT Operations Jonathan Poland

IT Operations

IT operations involves the delivery and management of information technology services, including the implementation of processes and systems to support…

Mass Marketing Jonathan Poland

Mass Marketing

Mass marketing, also known as mass media marketing, refers to a marketing strategy that involves using a single marketing message…

Project Metrics Jonathan Poland

Project Metrics

Project metrics are methods for measuring the progress and performance of a project. They are typically tracked continuously in order…

Objection Handling Jonathan Poland

Objection Handling

Objection handling is the practice of addressing and overcoming concerns or hesitations that customers may have about making a purchase.…

Collective Intelligence Jonathan Poland

Collective Intelligence

Collective intelligence refers to the ability of a group to solve problems, make decisions, and generate new ideas more effectively…

What is FMCG? Jonathan Poland

What is FMCG?

Fast moving consumer goods (FMCG) are products that are sold quickly and at a relatively low cost. These products are…

What is Globalization? Jonathan Poland

What is Globalization?

Globalization refers to the increasing interconnectedness and interdependence of the world’s economies, cultures, and populations, brought about by advances in…

Learn More

Service Level Objective Jonathan Poland

Service Level Objective

An service level objective (SLO) is a standard used to measure the performance of a business or technology service. These…

Customer Expectations Jonathan Poland

Customer Expectations

Customer expectations refer to the base assumptions that customers make about a brand, its products and services, and the overall…

Sales Metrics Jonathan Poland

Sales Metrics

Sales metrics are commonly used to assess the performance of a sales team or individual salesperson. These metrics can be…

Leadership Development Jonathan Poland

Leadership Development

Leadership development is the process of helping employees develop the necessary skills and competencies to take on leadership roles within…

Management Decisions Jonathan Poland

Management Decisions

Management decisions are decisions that pertain to the direction and control of a company or organization. These decisions may cover…

First Principles Thinking Jonathan Poland

First Principles Thinking

Overview First principles thinking is a method of reasoning that involves breaking down complex problems into their most basic and…

Exit Planning 150 150 Jonathan Poland

Exit Planning

Exit planning is a comprehensive strategy for business owners to transition out of their company on their terms. It involves…

Budget Variance Jonathan Poland

Budget Variance

Budget variance is the difference between the budgeted amount and the actual amount spent on a department, team, project, or…

Adoption Lifecycle Jonathan Poland

Adoption Lifecycle

The adoption lifecycle refers to the process by which customers adopt and become familiar with a new product or technology.…