Data Proliferation

Data Proliferation

Data Proliferation Jonathan Poland

Data proliferation refers to the rapid growth of data, often resulting in a large amount of replicated and low-quality data. This can be costly to manage and may pose compliance and operational risks to an organization. While it may be necessary to analyze this data in order to understand its structure, sources, and uses, it may ultimately have little value to the organization and can be difficult to discard. The following are illustrative examples of data proliferation.

Customer Data

It is common for multiple systems in an organization to maintain customer data. Such data is commonly out of sync between systems with no clear single source of truth. This can cause operational failures such as sending a bill to the wrong address.

Documents

Knowledge workers tend to create a lot of documents that get checked into a document management system. In many cases, such documents become completely unused with time but are retained as a precaution.

Communication

Communications such as emails can gather at the rate of hundreds per employee per day. Most communications lose their value almost immediately but often are retained for an extended period of time.

Backups

Backups of data, documents and communications often need to be retained in case something important was deleted from the source systems. If someone deletes a critical email, the only copy may be in a backup from a particular day last year. As such, backups are commonly stored for long periods of time. This can consume considerable resources despite the fact that backups are rarely used.

Transactional Data

Transactional data such as market trades and website purchases can grow extremely quickly. Transactional data is often viewed as valuable for historical research. For example, it is common to look at patterns in stock trades going back decades.

Social Data

Data that is shared by people on a public or private social network. Often viewed as valuable for purposes such as market research and machine learning.

Sensors & Machines

Machine and sensor generated data. Sensors have become cheap to the extent than they can be embedded in everyday objects in great numbers. Such data may be generally less valuable than human generated data. For example, video of a train tunnel or data from a tire pressure sensor isn’t interesting for long. Nevertheless, sensor data potentially represents a gigantic source of data that is far larger than all other sources combined.

Internal Benchmarking Jonathan Poland

Internal Benchmarking

Internal benchmarking is the process of comparing the performance of one aspect or function within a company to another aspect…

Capital Jonathan Poland

Capital

Capital is an asset that is expected to produce future economic value. It is a productive resource that is used…

Workplace Issues Jonathan Poland

Workplace Issues

Workplace issues can negatively impact employee satisfaction and organizational performance. These issues often arise from cultural and systemic problems, and…

Continuous Improvement Jonathan Poland

Continuous Improvement

Continuous improvement is a systematic approach to improving products, services, and processes over time. It involves a cycle of planning,…

Sentiment Analysis Jonathan Poland

Sentiment Analysis

Sentiment analysis is the process of analyzing and extracting subjective information from text data. It is a type of natural…

Commoditization Jonathan Poland

Commoditization

Commoditization occurs when certain products or services become interchangeable, leading customers to focus on price as the main factor in…

Risk Estimates Jonathan Poland

Risk Estimates

Risk estimates are predictions or projections of the likelihood and potential consequences of risks. They are used to inform risk…

Inferior Good Jonathan Poland

Inferior Good

An inferior good is a type of consumer good for which the demand decreases as the consumer’s income increases. In…

Asset Based Lending Jonathan Poland

Asset Based Lending

Asset-based lending (ABL) is a type of business financing in which a loan or line of credit is secured by…

Learn More

Customer Advocacy Jonathan Poland

Customer Advocacy

Customer advocacy is a customer service strategy that involves employees representing and fighting for the interests of customers, rather than…

Brand Management Jonathan Poland

Brand Management

Brand management is the process of creating, developing, and managing a brand in order to build brand equity and drive…

Human Resources Jonathan Poland

Human Resources

Human resources is the department within a business that is responsible for managing and coordinating the people who work for…

Recursive Self-improvement Jonathan Poland

Recursive Self-improvement

Recursive self-improvement refers to software that is able to write its own code and improve itself in a repeated cycle…

Quality Objectives Jonathan Poland

Quality Objectives

Quality objectives are specific, measurable targets that organizations set in order to improve the quality of their products or services.…

Performance Metrics Jonathan Poland

Performance Metrics

Performance metrics, also known as key performance indicators (KPIs), are measurable values that organizations use to evaluate their progress towards…

Demand Generation Jonathan Poland

Demand Generation

Demand generation is any marketing or sales activity designed to create recognition, awareness and interest in a firm’s brand and…

What is Promotion? Jonathan Poland

What is Promotion?

Promotion refers to any marketing strategy that is aimed at increasing recognition, awareness, and interest in a brand, product, or…

Rule of Three Jonathan Poland

Rule of Three

The rule of three is an economic theory that posits that large, mature markets tend to be dominated by three…