Data Proliferation

Data Proliferation

Data Proliferation Jonathan Poland

Data proliferation refers to the rapid growth of data, often resulting in a large amount of replicated and low-quality data. This can be costly to manage and may pose compliance and operational risks to an organization. While it may be necessary to analyze this data in order to understand its structure, sources, and uses, it may ultimately have little value to the organization and can be difficult to discard. The following are illustrative examples of data proliferation.

Customer Data

It is common for multiple systems in an organization to maintain customer data. Such data is commonly out of sync between systems with no clear single source of truth. This can cause operational failures such as sending a bill to the wrong address.

Documents

Knowledge workers tend to create a lot of documents that get checked into a document management system. In many cases, such documents become completely unused with time but are retained as a precaution.

Communication

Communications such as emails can gather at the rate of hundreds per employee per day. Most communications lose their value almost immediately but often are retained for an extended period of time.

Backups

Backups of data, documents and communications often need to be retained in case something important was deleted from the source systems. If someone deletes a critical email, the only copy may be in a backup from a particular day last year. As such, backups are commonly stored for long periods of time. This can consume considerable resources despite the fact that backups are rarely used.

Transactional Data

Transactional data such as market trades and website purchases can grow extremely quickly. Transactional data is often viewed as valuable for historical research. For example, it is common to look at patterns in stock trades going back decades.

Social Data

Data that is shared by people on a public or private social network. Often viewed as valuable for purposes such as market research and machine learning.

Sensors & Machines

Machine and sensor generated data. Sensors have become cheap to the extent than they can be embedded in everyday objects in great numbers. Such data may be generally less valuable than human generated data. For example, video of a train tunnel or data from a tire pressure sensor isn’t interesting for long. Nevertheless, sensor data potentially represents a gigantic source of data that is far larger than all other sources combined.

Learn More
Information Security Jonathan Poland

Information Security

Information security is the practice of protecting information from unauthorized access, use, disclosure, disruption, modification, or destruction. It is a…

Management Efficiency Jonathan Poland

Management Efficiency

Management efficiency refers to the ability of a company or organization to effectively utilize its resources, such as capital, labor,…

Security Controls Jonathan Poland

Security Controls

IT security controls are measures that are implemented in order to reduce security risks. These controls may be identified through…

Original Equipment Manufacturer Jonathan Poland

Original Equipment Manufacturer

An OEM (original equipment manufacturer) is a company that produces parts or equipment that is used in the manufacture of…

Relative Advantage Jonathan Poland

Relative Advantage

Relative advantage refers to the extent to which a company’s product, service, or offering is superior to those of its…

Market Risk Jonathan Poland

Market Risk

Market risk is the possibility that the value of an investment will decline due to changes in market conditions. This…

Program Efficiency Jonathan Poland

Program Efficiency

Program efficiency refers to the effectiveness with which a computer program uses resources such as time and memory. In general,…

Agile Change Management Jonathan Poland

Agile Change Management

Agile change management is the practice of leading continuous delivery processes in which changes are shipped within weeks. This approach…

Gap Analysis Jonathan Poland

Gap Analysis

A gap analysis is a method used to determine the distance between an organization’s current state and its desired future…

Content Database

Search over 1,000 posts on topics across
business, finance, and capital markets.

Brand Implementation Jonathan Poland

Brand Implementation

Brand implementation involves the use of project management techniques to plan and execute brand strategy. It is the practical application…

Branding Jonathan Poland

Branding

A brand is a name, term, design, symbol, or other feature that distinguishes one seller’s goods or services from those…

Operational Risk Jonathan Poland

Operational Risk

Operations risk is the risk of financial loss or other negative consequences that may arise from the operation of a…

Business Development Jonathan Poland

Business Development

Business development is a multifaceted discipline that involves identifying and pursuing opportunities to grow a business. It’s a combination of…

The Lobbying Process 150 150 Jonathan Poland

The Lobbying Process

Lobbying the government involves a series of steps to effectively communicate your message, build relationships with decision-makers, and influence public…

Abundance Mentality Jonathan Poland

Abundance Mentality

Abundance mentality is the belief that there is enough for everyone and that abundance, rather than scarcity, is the natural…

Business Values Jonathan Poland

Business Values

Business values are statements that reflect the ethical principles of a company. These values are intended to guide the company’s…

Labor Specialization Jonathan Poland

Labor Specialization

Specialization of labor involves dividing work into specific roles or tasks, with the goal of improving productivity, efficiency, quality, and…

BATNA Jonathan Poland

BATNA

BATNA, or best alternative to a negotiated agreement, is the course of action that a party in a negotiation would…