Data Proliferation

Data Proliferation

Data Proliferation Jonathan Poland

Data proliferation refers to the rapid growth of data, often resulting in a large amount of replicated and low-quality data. This can be costly to manage and may pose compliance and operational risks to an organization. While it may be necessary to analyze this data in order to understand its structure, sources, and uses, it may ultimately have little value to the organization and can be difficult to discard. The following are illustrative examples of data proliferation.

Customer Data

It is common for multiple systems in an organization to maintain customer data. Such data is commonly out of sync between systems with no clear single source of truth. This can cause operational failures such as sending a bill to the wrong address.

Documents

Knowledge workers tend to create a lot of documents that get checked into a document management system. In many cases, such documents become completely unused with time but are retained as a precaution.

Communication

Communications such as emails can gather at the rate of hundreds per employee per day. Most communications lose their value almost immediately but often are retained for an extended period of time.

Backups

Backups of data, documents and communications often need to be retained in case something important was deleted from the source systems. If someone deletes a critical email, the only copy may be in a backup from a particular day last year. As such, backups are commonly stored for long periods of time. This can consume considerable resources despite the fact that backups are rarely used.

Transactional Data

Transactional data such as market trades and website purchases can grow extremely quickly. Transactional data is often viewed as valuable for historical research. For example, it is common to look at patterns in stock trades going back decades.

Social Data

Data that is shared by people on a public or private social network. Often viewed as valuable for purposes such as market research and machine learning.

Sensors & Machines

Machine and sensor generated data. Sensors have become cheap to the extent than they can be embedded in everyday objects in great numbers. Such data may be generally less valuable than human generated data. For example, video of a train tunnel or data from a tire pressure sensor isn’t interesting for long. Nevertheless, sensor data potentially represents a gigantic source of data that is far larger than all other sources combined.

Learn More
Process Automation Jonathan Poland

Process Automation

Introduction: Process automation refers to the use of information systems to automate business processes in order to improve efficiency and…

Business Risk Jonathan Poland

Business Risk

A business risk is a potential event or situation that could negatively impact an organization’s ability to achieve its objectives.…

What is a Self-Replicating Machine? Jonathan Poland

What is a Self-Replicating Machine?

Self-replicating machines are robots or nanobots that are capable of producing copies of themselves, using scavenged materials and energy to…

Buying Behavior Jonathan Poland

Buying Behavior

Buying behavior refers to the actions and decisions made by consumers when purchasing goods or services. These are relevant to…

Latent Need Jonathan Poland

Latent Need

A latent need is a customer need that is not currently being met by the market and is not actively…

Tribes Jonathan Poland

Tribes

Tribes are groups of people who self-organize around common interests, values, communities, professions, needs, or aspirations. The concept of tribes…

Mission Statement Jonathan Poland

Mission Statement

A mission statement is a statement of purpose that defines the goals and values of an organization. It is a…

Pricing Power Jonathan Poland

Pricing Power

Pricing power refers to a company’s ability to increase prices without significantly impacting demand for their products or services. This…

Contract Awards Calendar 150 150 Jonathan Poland

Contract Awards Calendar

Governments around the world typically follow a structured and organized process for awarding contracts to suppliers, contractors, and service providers.…

Content Database

Search over 1,000 posts on topics across
business, finance, and capital markets.

Risk Prevention Jonathan Poland

Risk Prevention

Risk prevention is the process of identifying, assessing, and mitigating potential risks that may arise in a given situation. It…

Types of Win-Win Jonathan Poland

Types of Win-Win

Win-win, also known as mutually beneficial, refers to a situation or plan that has the potential to benefit all parties…

Unknown Risk Jonathan Poland

Unknown Risk

An unknown risk is a potential loss that is not recognized or identified. In the context of risk management, unknown…

What is FMCG? Jonathan Poland

What is FMCG?

Fast moving consumer goods (FMCG) are products that are sold quickly and at a relatively low cost. These products are…

Income Statement Jonathan Poland

Income Statement

An income statement is a financial statement that shows a company’s revenues, expenses, and profits over a specific period of…

What is Avoidance? Jonathan Poland

What is Avoidance?

Avoidance is the act of avoiding something that one finds unpleasant or inconvenient. This can involve a variety of different…

Communication Strengths Jonathan Poland

Communication Strengths

Communication strengths are qualities or abilities that enable an individual to communicate effectively. These can include general communication skills, such…

Operations Plan Jonathan Poland

Operations Plan

An operations plan is a document that outlines the steps a business will take to establish, improve, or expand its…

Market Environment Jonathan Poland

Market Environment

The market environment refers to all of the factors that can impact a company’s strategy, decision making, and tactics. This…