Site icon Tutorial

Data Management and Types

Data is raw fact, which is collected and analyzed to create information suitable for making decisions. Data is measured, collected and reported, and analyzed, to create information suitable for making decisions.

Types of data

Discrete and Continuous

Data are said to be discrete when they take on only a finite number of points that can be represented by the non-negative integers. An example of discrete data is the number of defects in a sample. Data are said to be continuous when they exist on an interval, or on several intervals. An example of continuous data is the measurement of pH. Quality methods exist based on probability functions for both discrete and continuous data.

Data could easily be presented as variables data like 10 scratches could be reported as total scratch length of 8.37 inches. The ultimate purpose for the data collection and the type of data are the most significant factors in the decision to collect attribute or variables data.

Cross-sectional and Time series data – Often financial analysts are interested in particular types of data such as time-series data or cross-sectional data.

Population and Sample Data

When we think of the term “population,” we usually think of people in our town, region, state or country and their respective characteristics such as gender, age, marital status, ethnic membership, religion and so forth. In statistics the term “population” takes on a slightly different meaning. The “population” in statistics includes all members of a defined group that we are studying or collecting information on for data driven decisions.

A part of the population is called a sample. It is a proportion of the population, a slice of it, a part of it and all its characteristics. A sample is a scientifically drawn group that actually possesses the same characteristics as the population – if it is drawn randomly.(This may be hard for you to believe, but it is true!)

A population includes all of the elements from a set of data. A sample consists of one or more observations from the population.

Converting Data Types – Continuous data, tend to be more precise due to decimal places but, need to be converted into discrete data. As continuous data contains more information than discrete data hence, during conversion to discrete data there is loss of information.

Discrete data cannot be converted to continuous data as instead of measuring how much deviation from a standard exists, the user may choose to retain the discrete data as it is easier to use. Converting variable data to attribute data may assist in a quicker assessment, but the risk is that information will be lost when the conversion is made.

Data Structuring – It refers to structuring of data elements and is classified as

Data collection methods

Data collection is based on crucial aspects of what to know, from whom to know and what to do with the data. Factors which ensure that data is relevant to the project includes

Few types of data collection methods includes

Data Management

Few important data management related terms are

Techniques for Assuring Data Accuracy and Integrity

Data integrity and accuracy have a crucial in the data collection process as they ensure the usefulness of data being collected. Data integrity determines whether the information being measured truly represents the desired attribute and data accuracy determines the degree to which individual or average measurements agree with an accepted standard or reference value.

Data integrity is doubtful if the data collected does not fulfill the purpose like data collected on finished good departure gathers data from truck departures but if the data is recorded on computing device present in the warehouse then integrity is doubtful. Similarly data accuracy is doubtful if the measurement device does not conforms to the laid down device standards.

Bad data can be avoided by following few precautions like avoiding emotional bias relative to tolerances, avoiding unnecessary rounding and screening data to detect and remove data entry errors.

Digital Data

With change and spread of technology, companies are moving towards digital marketing as consumers are moving towards e-commerce and mobile commerce. Availability of low cost internet access and devices has also spurned this shift amongst consumers. Digital data like html footprints that consumers leave behind when they visit a website or social media data, have significant value over these traditional tools of analytics in multiple ways. To begin with, by analyzing digital data you are ‘listening in’ to natural, honest conversations that are not limited. It isn’t a forced conversation. Second, the sample size is enormous. If you’re looking at 2000 consumers in a traditional survey, you’re talking about over 200,000 with digital data. Finally, the analysis is less expensive than traditional research, fast and therefore can be conducted multiple times in a year to answer different questions or hypotheses.

Big Data

Big data is an all-encompassing term for any collection of data sets so large and complex that it becomes difficult to process using on-hand data management tools or traditional data processing applications.

Big data is a large volume unstructured data which cannot be handled by standard database management systems like DBMS, RDBMS or ORDBMS. Big Data is very large, loosely structured data set that defies traditional storage. Few examples are as

In defining big data, it’s also important to understand the mix of unstructured and multi-structured data that comprises the volume of information.

Big Data is usually characterized by following “V” attributes

Big data can come from multiple sources, as

Exit mobile version