Site icon Tutorial

Data Management and Types

Go back to Tutorial

Data are characteristics or information, usually numerical, that are collected through observation to create information suitable for making decisions. Data is measured, collected and reported, and analyzed, to create information suitable for making decisions.

Types of data

Discrete and Continuous

Attribute or discrete data – It is based on counting like the number of processing errors, the count of customer complaints, etc. Discrete data values can only be non-negative integers such as 1, 2, 3, etc. It includes

Variable or continuous data – They are measured on a continuum or scale. Data values for continuous data can be any real number: 2, 3.4691, -14.21, etc. Continuous data can be recorded at many different points and are typically physical measurements like volume, length, size, width, time, temperature, cost, etc.

Data are said to be discrete when they take on only a finite number of points that can be represented by the non-negative integers. An example of discrete data is the number of defects in a sample.

Data could easily be presented as variables data like 10 scratches could be reported as total scratch length of 8.37 inches. The ultimate goal for the data collection and the type of data are the most significant factors in the decision to collect attribute or variables data.

Cross-sectional and Time series data – Mostly financial analysts are interested in particular types of data such as time-series data or cross-sectional data002E

Population and Sample Data

When it comes to the term “population,” we all usually think of people in our town, region, state or country. And their respective characteristics such as gender, age, marital status, ethnic membership, religion and so forth. While in statistics the term “population” takes on a slightly different meaning. The “population” in statistics comprises all members of a defined group that we are studying or collecting information on for data driven decisions.

A segment of the population is called a sample. It is a proportion of the population, a slice of it, a part of it and all its characteristics.

A population includes all of the elements from a set of data. A sample consists of one or more observations from the population.

Converting Data Types – Continuous data, tend to be more precise due to decimal places but, need to be converted into discrete data. As continuous data contains more information than discrete data hence, during conversion to discrete data there is loss of information.

Discrete data cannot be converted to continuous data as instead of measuring how much deviation from a standard exists, the user may choose to retain the discrete data as it is easier to use. Converting variable data to attribute data may assist in a quicker assessment, but the risk is that information will be lost when the conversion is made.

Data Structuring – It refers to structuring of data elements and is classified as

Data collection methods

Data collection is based on crucial aspects of what to know, from whom to know and what to do with the data. Factors which ensure that data is relevant to the project includes

Few types of data collection methods include:

Data Management

Few important data management related terms are

Techniques for Assuring Data Accuracy and Integrity

Data integrity and accuracy have a crucial in the data collection process as they ensure the usefulness of data being collected. Data integrity determines whether the information being measured truly represents the desired attribute and data accuracy determines the degree to which individual or average measurements agree with an accepted standard or reference value.

Data integrity is doubtful if the data collected does not fulfill the purpose like data collected on finished good departure gathers data from truck departures but if the data is recorded on computing device present in the warehouse then integrity is doubtful. Similarly, data accuracy is doubtful if the measurement device does not conforms to the laid down device standards.

By following few precautions like avoiding emotional bias relative to tolerances, avoiding unnecessary rounding and screening data to detect and remove data entry errors bad data can be avoided.

Digital Data

With change and spread of technology, companies are moving towards digital marketing as consumers are moving towards e-commerce and mobile commerce. Availability of low-cost internet access and devices has also spurned this shift amongst consumers. Digital data like html footprints that consumers leave behind when they visit a website or social media data, have significant value over these traditional tools of analytics in multiple ways.

Big Data

Big data is a circumscribing term for any collection of data sets so large and complex that it becomes difficult to process using on-hand data management tools or traditional data processing applications.

Big data is a large volume unstructured data which can’t be handled by standard database management systems like DBMS, RDBMS or ORDBMS. Big Data is very large, loosely structured data set that defies traditional storage. Few examples are as

In defining big data, it’s also important to understand the mix of unstructured and multi-structured data that comprises the volume of information.

Big Data is usually characterized by following “V” attributes

Big data can come from multiple sources, as

Certified Inventory and Warehouse Analytics Professional

Go back to Tutorial

Exit mobile version