Data Proliferation

Data Proliferation

Data Proliferation Jonathan Poland

Data proliferation refers to the rapid growth of data, often resulting in a large amount of replicated and low-quality data. This can be costly to manage and may pose compliance and operational risks to an organization. While it may be necessary to analyze this data in order to understand its structure, sources, and uses, it may ultimately have little value to the organization and can be difficult to discard. The following are illustrative examples of data proliferation.

Customer Data

It is common for multiple systems in an organization to maintain customer data. Such data is commonly out of sync between systems with no clear single source of truth. This can cause operational failures such as sending a bill to the wrong address.

Documents

Knowledge workers tend to create a lot of documents that get checked into a document management system. In many cases, such documents become completely unused with time but are retained as a precaution.

Communication

Communications such as emails can gather at the rate of hundreds per employee per day. Most communications lose their value almost immediately but often are retained for an extended period of time.

Backups

Backups of data, documents and communications often need to be retained in case something important was deleted from the source systems. If someone deletes a critical email, the only copy may be in a backup from a particular day last year. As such, backups are commonly stored for long periods of time. This can consume considerable resources despite the fact that backups are rarely used.

Transactional Data

Transactional data such as market trades and website purchases can grow extremely quickly. Transactional data is often viewed as valuable for historical research. For example, it is common to look at patterns in stock trades going back decades.

Social Data

Data that is shared by people on a public or private social network. Often viewed as valuable for purposes such as market research and machine learning.

Sensors & Machines

Machine and sensor generated data. Sensors have become cheap to the extent than they can be embedded in everyday objects in great numbers. Such data may be generally less valuable than human generated data. For example, video of a train tunnel or data from a tire pressure sensor isn’t interesting for long. Nevertheless, sensor data potentially represents a gigantic source of data that is far larger than all other sources combined.

Value Proposition Jonathan Poland

Value Proposition

A value proposition is a statement that explains the unique value that a company offers to its customers. It is…

Barriers to Entry Jonathan Poland

Barriers to Entry

Barriers to entry refer to factors that make it difficult for new companies to enter a particular market. These barriers…

Market Penetration Jonathan Poland

Market Penetration

Market penetration refers to the process of increasing the market share of a company’s existing products or services within a…

Over-positioning Jonathan Poland

Over-positioning

Over-positioning refers to the practice of positioning a brand in a way that is too narrow or limited, potentially limiting…

Durable Competitive Advantage Jonathan Poland

Durable Competitive Advantage

The most important aspect of durability is market fit. Unique super simple products or services that does change much if…

Taxation Risk Jonathan Poland

Taxation Risk

Taxation risks refer to the potential for a business to face financial or reputational harm due to issues related to…

Storytelling Jonathan Poland

Storytelling

Storytelling is the act of using narrative to communicate information in an engaging and memorable way. Businesses can use storytelling…

Autonomous System Jonathan Poland

Autonomous System

An autonomous system is a system that is capable of functioning independently, without the need for human intervention. Autonomous systems…

Marketing Costs Jonathan Poland

Marketing Costs

Marketing costs are expenses that are related to promoting and selling products or services to customers. These costs can include…

Learn More

Competition Jonathan Poland

Competition

Competition is a term that refers to the act of engaging in a contest with others in order to determine…

Persistence Jonathan Poland

Persistence

Persistence is the ability to maintain motivation and effort over a prolonged period of time. It is a behavior or…

Relationship marketing Jonathan Poland

Relationship marketing

Relationship marketing is a type of marketing that focuses on building long-term, mutually beneficial relationships with customers, rather than just…

Compliance Risk Jonathan Poland

Compliance Risk

Compliance risk refers to the risk that an organization may face as a result of not complying with laws, regulations,…

Types of Infrastructure Jonathan Poland

Types of Infrastructure

In an industrial economy, the production of tangible goods and infrastructure plays a central role. This type of economy has…

Business Process Reengineering Jonathan Poland

Business Process Reengineering

Business process reengineering, or BPR, involves examining and redesigning current business processes and workflows to achieve greater efficiency, cost-effectiveness, and…

Personal Data Jonathan Poland

Personal Data

Personal data is any information that can be used to identify an individual, including their name, date of birth, address,…

Test Marketing Jonathan Poland

Test Marketing

Test marketing involves testing different marketing strategies or variations on customers in order to gather data and evaluate their effectiveness.…

Examples of Tact Jonathan Poland

Examples of Tact

Tact is the ability to sensitively and skillfully handle a situation or conversation so as to avoid giving offense. It…