Got a question? [email protected]
5.1.1 Advantages and Disadvantages of Representation of Data
In this topic we will learn how to:
- select a suitable way of presenting raw data, and discuss the advantages and/or disadvantages that particular representations may have
We will learn how to represent data using a stem-and-leaf diagrams, box-and-whisker plots, histograms and cumulative frequency graphs. We will also learn to calculate measures of central tendency and variation. We’re going to explore the advantages and disadvantages of particular representations and statistical data. \textbf{\large\textcolor{gray}{Stem and Leaf Diagram}} \textbf{\textcolor{gray}{Advantages}}
- It shows all of the original data
- It shows the shape of the distribution i.e skew
- The mode, median and quartiles can be found from the diagram
- It is useful for comparing two sets of data
\textbf{\textcolor{gray}{Disadvantages}}
- It is not suitable for large amounts of data
\textbf{\large\textcolor{gray}{Box and Whisker Plot}} \textbf{\textcolor{gray}{Advantages}}
- It is easy to see whether the distribution is symmetrical or whether there is a tail to the left or right
- It can be used to investigate extreme values (outliers)
- It is easy to see the range and interquartile range
- You can compare two or more sets of data by drawing on the same diagram
- It does not show frequencies
- It only shows particular values of the data
\textbf{\large\textcolor{gray}{Histogram}} \textbf{\textcolor{gray}{Advantages}}
- It can represent groups of different widths
- It shows whether the distribution is symmetrical or skew
- The mean and standard deviation can be estimated from the histogram
- The visual impact can be altered by using different scales
\textbf{\large\textcolor{gray}{Cumulative Frequency Graph}} \textbf{\textcolor{gray}{Advantages}}
- The median and quartiles can be estimated from the graph
- Sets of data can be compared by drawing graphs on the same diagram
\textbf{\Large\textcolor{gray}{Measures of Central Tendency}} \textbf{\large\textcolor{gray}{Mean}} \textbf{\textcolor{gray}{Advantages}}
- It is calculated using all the data so it represents all the items
- It is calculated using a mathematical formula so calculators can be programmed to find it
- It is extremely useful for further analysis
- It can be unduly affected by one or two extreme values
\textbf{\large\textcolor{gray}{Mode}} \textbf{\textcolor{gray}{Advantages}}
- Useful when the most popular category is required, e.g clothes or shoe sizes
- Not very useful for small data sets, or when there are more than two modes
- There may not be a mode
- It may not be representative, e.g. it could be the lowest value
- Modal class depends on the grouping of the data
- It is not useful for further analysis
\textbf{\large\textcolor{gray}{Median}} \textbf{\textcolor{gray}{Advantages}}
- It is not affected by extreme values
- It can be found as soon as a middle value is known
- It does not use the whole data set
\textbf{\Large\textcolor{gray}{Variation}} \textbf{\large\textcolor{gray}{Range}} \textbf{\textcolor{gray}{Advantages}}
- It is easy to calculate
- It represents the complete spread of the data
- It is affected by extreme values
\textbf{\large\textcolor{gray}{Interquartile Range}} \textbf{\textcolor{gray}{Advantages}}
- It is not unduly influenced by extreme values
- It can be used to investigate extreme values
- It depends only on particular values when the data is ranked
\textbf{\large\textcolor{gray}{Standard Deviation}} \textbf{\textcolor{gray}{Advantages}}
- It is calculated using all the data and so represents every item
- It is very useful for further analysis
- It is useful in comparing two sets of data, for example by showing which is more consistent
- For a single set of data its value is difficult to interpret
Let’s look at some past paper questions. 1. Twenty children were asked to estimate the height of a particular tree. Their estimates, in metres, were as follows. (9709/53/M/J/22 number 2)
It is given that the mean is 6.17 and the median is 5.45 . Give a reason why the median is likely to be more suitable than the mean as a measure of the central tendency for this information. Since we have a value that appears anomalous (does not follow the trend), 19.4 , the mean will be inflated due to this value. However, this extreme value has no effect on the median. 2. Twelve tourists were asked to estimate the height, in metres, of a new building. Their estimates were as follows. (9709/62/O/N/19 number 1)
Give a disadvantage of using the mean as a measure of central tendency in this case. The mean will be unduly affected by the extreme value, 110 . 3. The heights, in cm, of the 11 basketball players in each of two clubs, the Amazons and the Giants, are shown below. (9709/52/M/J/21 number 7)
State an advantage of using a stem-and-leaf diagram compared to a box-and-whisker plot to illustrate this information. The stem-and-leaf diagram includes all the data, whereas the box-and-whisker plot does not.
- Engineering Mathematics
- Discrete Mathematics
- Operating System
- Computer Networks
- Digital Logic and Design
- C Programming
- Data Structures
- Theory of Computation
- Compiler Design
- Computer Org and Architecture
Advantages and disadvantages of Data Visualization
Data visualization is the change of crude information tables into numeric delineations that recount a story. Choosing what data to share, just as how to share it, are the two principal decisions in the making of a viz.
Data visualization can take numerous structures. As a rule, perceptions are diagrams, outlines, plots, and different types of mathematical clarifications. However, depending on it, information representation doesn’t end there. Guides, pictures, and air pocket diagrams are additional sorts of information perception. Any time you see a guide with nations featured for accentuation, you’re taking a gander at an information representation.
Also, the utilization of intelligent devices is viewed as the most elevated type of information representation. By and large, this just method the utilization of channels inside standard representations. For instance, envision you have a bar graph that shows the richness paces of the three most affluent nations in North America. An intuitive information perception may incorporate a drop-down menu so the client can change to another mainland. In the event that she chooses Europe, we would see the ripeness rates in Germany, France, and Italy.
We can’t fail to remember that the story part is critical. Information representation without a message behind isn’t information perception by any means. Information representation is a basic apparatus for chiefs across each business area and size. Regardless of whether you’re a startup or a worldwide partnership, information perception is fundamental in catching key data, helping dynamic, finishing serious examinations, planning, and drawing experiences.
Advantages of Data Visualization :
- Better agreement – In business numerous a period it happens that we need to look at the exhibitions of two components or two situations. A conventional methodology is to experience the massive information of both the circumstances and afterward examine it. This clearly will kill a great deal of time.
- A superior method – It can tackle the difficulty of placing the information of both perspectives into the pictorial structure. This will unquestionably give a superior comprehension of the circumstances. For instance, Google patterns assist us with understanding information identified with top ventures or inquiries in pictorial or graphical structures.
- Simple sharing of data – With the representation of the information, organizations present another arrangement of the correspondence. Rather than sharing the cumbersome information, sharing the visual data will draw in and pass on across the data which is more absorbable.
- Precise investigation – With the assistance of information perception, it gets more obvious the patterns and hence draws a superior surmising of the information. Accordingly, giving associations an edge over the adversaries.
- Deals investigation – With the assistance of information representation, a salesman can without much of a stretch comprehend the business chart of items. With information perception instruments like warmth maps, he will have the option to comprehend the causes that are pushing the business numbers up just as the reasons that are debasing the business numbers. Information representation helps in understanding the patterns and furthermore different variables like sorts of clients keen on purchasing, rehash clients, the impact of topography, and so forth.
- Discovering relations between occasions – A business is influenced by a lot of elements. Finding a relationship between’s these elements or occasions encourages chiefs to comprehend the issues identified with their business. For instance, the Online business market is anything but another thing today. Each time during certain happy seasons like Christmas or Thanksgiving the diagrams of online organizations go up. Along these lines, state if an online organization is doing a normal of a $1 million business in a specific quarter and the business ascends in straightaway, at that point they can rapidly discover the occasions comparing to it.
- Adjustment of information – The solid purpose of information perception is that the information based on which the data is introduced in a visual configuration can be changed or altered along these lines giving a possibility for the business personals to build up a better correspondence with the crowd.
- Investigating openings and patterns – With the huge loads of information present, the business chiefs can discover the profundity of information in regard to the patterns and openings around them. Utilizing information representation, the specialists can discover the examples in the conduct of their clients, subsequently preparing for them to investigate patterns and open doors for the business.
- Geological perception – One of the solid purposes of information perception is geological representation. In this, specialists have the upside of area data giving information to day by day investigation.
Disadvantages of Data Visualization :
- It gives assessment not exactness – While the information is exact in foreseeing the circumstances, the perception of similar just gives the assessment. It without a doubt is anything but difficult to change over the robust and protracted information into simple pictorial configuration yet such a portrayal of data may prompt theoretical ends now and then.
- One-sided – The essential arrangement of information representation occurs with the human interface, which means the information that turns out to be the base of perception can be one-sided. The individual bringing the information for the equivalent may just think about the significant part of the information or the information that requirements center and may reject the remainder of the information which may prompt one-sided results.
- Absence of help – One of the downsides of information perception is that it can’t help, which means an alternate gathering of the crowd may decipher it in an unexpected way.
- Inappropriate plan issue – On the off chance that information perception is viewed as such a correspondence. At that point, it must be certifiable in clarifying the reason. In the event that the plan isn’t legitimate, at that point, this can prompt disarray in correspondence.
- Wrong engaged individuals can skip center messages – One of the issues with information perception is however it could be logical its clearness in clarification is totally subject to the focal point of its crowd.
Similar Reads
- Advantages and disadvantages of Data Visualization Data visualization is the change of crude information tables into numeric delineations that recount a story. Choosing what data to share, just as how to share it, are the two principal decisions in the making of a viz. Data visualization can take numerous structures. As a rule, perceptions are diagr 5 min read
- Advantages and Disadvantages of Normalization Normalization : It is the methodology of arranging a data model to capably store data in an information base. The completed impact is that tedious data is cleared out, and just data related to the attribute is taken care of inside the table. Normalization regularly incorporates isolating an informat 6 min read
- Advantages and Disadvantages of SQL Structural Query Language (SQL) is a powerful and widely used programming language designed for managing and manipulating relational databases. It was first developed in the 1970s by IBM researchers, and has since become a standard language for managing and querying databases across various platform 5 min read
- Advantages and Disadvantages of an ER-Model Pre-requisites: ER Model Entity is a thing or an object in real world. As the name suggests that Entity Relationship model uses collection of basic objects called entities & relationships.It develops a very simple and easy to design view of data. Entity relationship model is widely used in Datab 4 min read
- Advantages and Disadvantages of Using Stored Procedures - SQL A Stored Procedure is a type of code in SQL that can be stored for later use and can be used many times. So, whenever you need to execute the query, instead of calling it you can just call the stored procedure. You can also pass parameters to a stored procedure, so that the stored procedure can act 3 min read
- Advantages of Database Management System Database Management System (DBMS) is a collection of interrelated data and a set of software tools/programs that access, process, and manipulate data. It allows access, retrieval, and use of that data by considering appropriate security measures. The Database Management system (DBMS) is really usefu 6 min read
- Advantages of Distributed database Distributed databases basically provide us the advantages of distributed computing to the database management domain. Basically, we can define a Distributed database as a collection of multiple interrelated databases distributed over a computer network and a distributed database management system as 4 min read
- Disadvantages of DBMS You might have encountered bulks of files/registers either at some office/school/university. The traditional file management system has been followed for managing the information or data at many organizations and by many businesses. It used to be cost-effective and easily accessible. With evolving t 9 min read
- Data Transformation in Data Mining Data transformation in data mining refers to the process of converting raw data into a format that is suitable for analysis and modeling. The goal of data transformation is to prepare the data for data mining so that it can be used to extract useful insights and knowledge. Data transformation typica 6 min read
- Difference Between Traditional Data and Big Data Data is information that helps businesses and organizations make decisions. Based on volume, variety, velocity, and mode of handling data, traditional data, and big data. It is quite helpful for organizations to understand these key dissimilarities to enable them to select the right approach in data 8 min read
- Data Marts (storage component of HDFS) Datawarehouse and Data Mart, both are storage components of HDFS. Data mart is such a storage component which is concerned on a specific department of an organization. It is a subset of the data stored in the datawarehouse. Data mart is focused only on particular function of an organization and it i 4 min read
- Data Reduction in Data Mining Prerequisite - Data Mining The method of data reduction may achieve a condensed description of the original data which is much smaller in quantity but keeps the quality of the original data. INTRODUCTION: Data reduction is a technique used in data mining to reduce the size of a dataset while still p 7 min read
- Difference between Data Warehousing and Data Mining A Data Warehouse is built to support management functions whereas data mining is used to extract useful information and patterns from data. Data warehousing is the process of compiling information into a data warehouse. The main purpose of data warehousing is to consolidate and store large datasets 5 min read
- Difference between Database and Data Structure It is important to understand the fundamental difference between a database and a data structure. Basically, the database is used to store large amounts of data in a specific manner, that can be assessed, maintained, and updated by a database management system. There are many ways of organizing the 4 min read
- Difference between Spatial and Temporal Data Mining Data Mining is an information discovery process that integrates the organization of large databases to discover implicit patterns that have significant value. Spatial data mining and temporal data mining are two important sub-disciplines in data mining as both of them involve data having either spat 5 min read
- Types of Sources of Data in Data Mining In this post, we will discuss what are different sources of data that are used in data mining process. The data from multiple sources are integrated into a common source known as Data Warehouse. Let's discuss what type of data can be mined: Flat FilesFlat files is defined as data files in text form 6 min read
- Difference between Big Data and Data Analytics 1. Big Data: Big data refers to the large volume of data and also the data is increasing with a rapid speed with respect to time. It includes structured and unstructured and semi-structured data which is so large and complex and it cant not be managed by any traditional data management tool. Special 4 min read
- Difference between Data and Metadata The term 'data' may also refer to various forms such i.e. numbers, text, images, or audio that provide information concerning facts, phenomena, or even concepts. In other words, it is the basic information that is subjected to processing or interpretation to find meaning or generate products. Metada 6 min read
- Difference between Database System and Data Warehouse Organizations employ a variety of solutions in the field of data management to efficiently handle and analyze data. The Data Warehouse and Database System are two examples of such essential systems. Although both systems handle and store data, their functions and task-specific optimizations vary. Wh 4 min read
IMAGES
VIDEO