Data Proliferation

Data Proliferation

Data Proliferation Jonathan Poland

Data proliferation refers to the rapid growth of data, often resulting in a large amount of replicated and low-quality data. This can be costly to manage and may pose compliance and operational risks to an organization. While it may be necessary to analyze this data in order to understand its structure, sources, and uses, it may ultimately have little value to the organization and can be difficult to discard. The following are illustrative examples of data proliferation.

Customer Data

It is common for multiple systems in an organization to maintain customer data. Such data is commonly out of sync between systems with no clear single source of truth. This can cause operational failures such as sending a bill to the wrong address.

Documents

Knowledge workers tend to create a lot of documents that get checked into a document management system. In many cases, such documents become completely unused with time but are retained as a precaution.

Communication

Communications such as emails can gather at the rate of hundreds per employee per day. Most communications lose their value almost immediately but often are retained for an extended period of time.

Backups

Backups of data, documents and communications often need to be retained in case something important was deleted from the source systems. If someone deletes a critical email, the only copy may be in a backup from a particular day last year. As such, backups are commonly stored for long periods of time. This can consume considerable resources despite the fact that backups are rarely used.

Transactional Data

Transactional data such as market trades and website purchases can grow extremely quickly. Transactional data is often viewed as valuable for historical research. For example, it is common to look at patterns in stock trades going back decades.

Social Data

Data that is shared by people on a public or private social network. Often viewed as valuable for purposes such as market research and machine learning.

Sensors & Machines

Machine and sensor generated data. Sensors have become cheap to the extent than they can be embedded in everyday objects in great numbers. Such data may be generally less valuable than human generated data. For example, video of a train tunnel or data from a tire pressure sensor isn’t interesting for long. Nevertheless, sensor data potentially represents a gigantic source of data that is far larger than all other sources combined.

Proof of Concept Jonathan Poland

Proof of Concept

A proof of concept (POC) is a demonstration that a certain idea or solution is feasible and likely to be…

Asset Based Lending Jonathan Poland

Asset Based Lending

Asset-based lending (ABL) is a type of business financing in which a loan or line of credit is secured by…

Capital Jonathan Poland

Capital

Capital is an asset that is expected to produce future economic value. It is a productive resource that is used…

Relationship marketing Jonathan Poland

Relationship marketing

Relationship marketing is a type of marketing that focuses on building long-term, mutually beneficial relationships with customers, rather than just…

Market Saturation Jonathan Poland

Market Saturation

Market saturation refers to a state in which a particular market is filled with a high number of similar products…

Needs Identification Jonathan Poland

Needs Identification

Needs identification is the process of discovering and understanding a customer’s needs, constraints, pain points, and motivations. This is a…

Building Trust Jonathan Poland

Building Trust

To build trust, it is necessary to engage in ongoing behavior that helps people trust you. In general, people tend…

What is a Persona? Jonathan Poland

What is a Persona?

Personas are fictional characters that businesses use to represent and model the characteristics, goals, needs, behaviors, and emotions of their…

Product Risk Jonathan Poland

Product Risk

Product risk refers to the potential for negative consequences that may result from the development, production, or use of a…

Learn More

Sales Goals Jonathan Poland

Sales Goals

Sales goals are targets for the revenue or units sold that a sales team or individual is expected to achieve…

Environmental Challenges Jonathan Poland

Environmental Challenges

Environmental issues are detrimental changes to the Earth’s natural surroundings that negatively impact the current quality of life for individuals…

User Story Jonathan Poland

User Story

A user story is a concise description of a specific expectation or need that a user has for a product,…

Baxter Jonathan Poland

Baxter

Baxter International Inc. is a global healthcare company that develops and manufactures medical products and services for a wide range…

Cross Merchandising Jonathan Poland

Cross Merchandising

Cross merchandising is a retail strategy that involves placing related or complementary products in close proximity to each other in…

Security Controls Jonathan Poland

Security Controls

IT security controls are measures that are implemented in order to reduce security risks. These controls may be identified through…

SWOT Analysis 101 Jonathan Poland

SWOT Analysis 101

SWOT analysis is a tool that is used to evaluate the strengths, weaknesses, opportunities, and threats of a business or…

Brand Engagement Jonathan Poland

Brand Engagement

Brand engagement refers to the interaction between a customer and a brand, and can be used as a way to…

Attention Economics Jonathan Poland

Attention Economics

Attention economics is a field of study that focuses on the value of human attention as a limited and highly…