Topic Application of statistics in geography Estimated reading: 57 minutes 14 views STATISTICSStatistics is a branch that deals with every aspect of the data. Statistical knowledge helps to choose the proper method of collecting the data and employ those samples in the correct analysis process in order to effectively produce the results.Statistics refers to a scientific and systematic methods of collecting, recording, summarizing, analyzing and representation of numerical data in precise manner.OrThe study of methods of collecting, recording, summarizing, analyzing and presentation of data in precise manner by using numbersOrA science of observing, collecting, recording, summarizing, analyzing and presentation of data in precise manner by using numbers.NATURE OF DATAStatistical data according to their varied natureStatistical data according to their varied nature include the following:-Discrete data.It is a form of statistical data for variables whose values expressed or given in whole numbers. i.e. The data is for cases which do not exist in fractions. For instance; the data for the number of people which can be given as 102 people who can not be divided into either decimal or fractionsContinuous data.The data for the variables whose values can be expressed in fraction or decimals. In this type of data, any value within the range can be given. For instance; the data for temperature, rainfall, pressure, distance, growth rate, and other cases which also reflect the same. They are presented in continuity manner of fraction or decimalsIndividual data.The set of data which provides specific value to every item in a sample given. For instance; Juma has weight of 47 kg.They consider every item as an important entity and singly presentedGrouped data.It is a form of data which gives values in range or classes. This type of data is of no precise as exact figures are quoted but values range in groups. The classic example of the grouped data is that of population distribution by age and sex which may appear as follow:-AGEFEMALESMALES0-914,89714,56710-1915,43214,32920 – 2917,98713,09830 – 3916,87617,654Statistical data according to scale of measurementsThis aspect is considerably on how the values of statistical data are given. The scale of measurement include the following.Nominal dataThe type of data according to scale of measurement of which the values are given according to the name of items in a given sample. e.g. 10 apples, 5 oranges, 7 mangoes, 5 banana and 2 cherish.Ordinal dataThe data of which the values are given in an order of magnitude of observation in such a way the numbers indicate the rank order among objects. i.e. the values are commonly given in either ascending or descending order e.g. 91, 82, 79, 74, 68, 67, 58, 54 and 49.The interval dataThe data of which values are given in range at regular distance by being grouped. e.g. The data for population distribution by age and sex expressed in interval scale.Ratio dataThe data of which the values given show the number of times items of has relatively to another e.g. 1:3, 2:5, 3:7. e.t.c.VARIABLESVariable is an attribute that has values of which fluctuate under a given condition . For instance; production is a considerable variable as whose values change under conditions of policies lie; climate, technology, marketability and other which may make the same.Variables are considerably varied and are classified into dependent and independent variables.Dependent variableDependent variable is the one whose values fluctuate due to the force of another variable. i.e. the variable whose values change irregularly as controlled by another variable. For instance; production is one among the most pronounced variables as changes due to the force of other variables like climate, level of technology applied, demand of the products produced, and others which might cause it to change.CLASSIFICATION OF STATISTICS.Statistics being the scientific and systematic methods dealing with numerical facts is broadly categorized into two depending on how data handled. The main broad categories include; descriptive and inferential statistics.1. Descriptive StatisticsDescriptive statistics deal with recording, summarization, analyzing and presentation of numerical facts that have been actually collected. The actual collection of data can be like to population by conducting census.2. Inferential statisticsInferential statistics deal with recording, summarization, analyzing and presentation of numerical facts that have been handled by quantifying the uncertainties through prediction e.g. the likely harvest output in the next year or season.STATISTICAL DATAAs already pointed out, statistical data are understood as the exact numerical facts or figures collected systematically and arranged for a certain purpose or body of information which is usually treated in numerical values.Statistical data assessed being extremely varied and thus recognized be of different types. The categories of statistical data recognized with regards to their derived sources, varied nature and scale of measurements.Statistical data according to their varied sourcesData by sources classified into two and include primary and secondary data.Primary dataThese are the numerical facts collected from the field or handled for the first time. i.e. They are the first hand or original information. The data are not available in the existing sources like books. Primary statistical data are handled by the techniques of interview, the use of questionnaires, observation, counting, measurements and other methods.Secondary dataThese are the numerical facts derived from the stored sources. The data were compiled by other people who carried out research. The sources of this type of data include; text books, reference books, magazines, maps, video tapes, audio tapes, and other sources which deliver the same.Independent variableIndependent variable is the one whose values change on its own without being influenced by another variable. i.e. the variable whose values change steadily and regularly e.g. distance.SOURCES OF STATISTICAL DATAThe sources of statistical data are simply the techniques employed to gather the numerical facts. These are broadly two and include; the numerical facts. These are broadly two and include; primary and secondary sources. Some of the primary techniques (sources) providing statistical data include the following:-1. Interview method2. Questionnaire3. Scheduling4. Field observation method5. Literature review1. Interview methodThe technique of interview involves the collection of data through the asking of questions verbally by researcher to a respondent.OrIs a verbal interaction between an interviewer and interviewee designed to list the information, news, opinion and feelings they have on their own. Generally an interview is an oral organization of questions asked to respondents by a researcher.2. Questionnaire methodQuestionnaire is a set of research questions printed on a piece of paper then presented to respondents to replay the questions in writing. It is thus; questionnaire method is a way (means) of gathering statistical details done with the use of questionnaires given to the respondents to answer.3. Field observation methodIt is a method of gathering primary research data which done by a researcher looking over the phenomena. It is of two types and include; participant and non participant observation.4. Scheduling methodThis method of data collection is very much familiar to questionnaire. But it has little difference to questionnaire. The difference is that, schedule involves a prepared set of questions which are filled in by enumerators who are especially appointed for the purpose and of which carefully selected and trained enough to perform their job well. This method of data collection is very useful for carrying out population census.The secondary sources providing statistical data include5. Literature review methodIt is a systematic survey of the past documentary sources prepared by other researchers related to the study. The documentary sources include; text books, statistical obstruct census report, research articles, journals, news paper, and official reports.Other methods for data collection include; measurements, counting and the carrying out of experiments.Strengths of statistics application in GeographyApplication of statistics in geography offers the following vital significance.1. Summarizes massive information by making more simple and thus, enable the geographers to handle large sets of data.2. Statistics facilitate the process of data computation techniques possible in geography3. Statistics make easy the process of data comparison. It is so; as it is impossible to make comparison without statistics of the variables to be compared.4. Statistics application facilities the process of drawing relationship between the geographical variables like; climate and production, population and time; rainfall and temperature etc.5. Application of statistics makes easy the process of data storage inform of numbers, tables, graphs, diagrams, and maps.6. Application of statistics makes the geographical data be clearly understood and easy for being analyzed and interpreted.7. Statistics enhance validity testing of the geographical models, theories, and concepts to the real world situations.STATISTICAL MEASURESNumerical values which make statistics are analyzed or examined to judge their implication (results) by taking into consideration of the statistical measures. It is thus; statistical measures refer to the computed numerical values used to make data analysis as related to other values in a data set provided.Statistical measures are numerous but with regards to their nature and roles, broadly divided into the following categories.1. Measures of central tendency2. Measures of variabilityMEASURES OF CENTRAL TENDENCYThese are the measurements which show the central values and include; arithmetic mean, mode and median.A. ARITHMETIC MEANArithmetic mean is an average of all values in a set of distribution. It is determined by adding up all values and divided by the sum of observation added. Arithmetic mean is used to assess the distribution value weather was high or low.Computation of the arithmetic meanComputation of the arithmetic mean depends up on the nature of data given whether ungrouped or grouped.For the ungrouped data set; arithmetic mean is computed by applying the following formulaWhere by:N = The total number of observation added.Example:Find the arithmetic mean for the following set of data. 5,7,10,12,13,14,15,7, and 2.SolutionThe arithmetic mean for the given set of data above is calculated as follow:5+7 +10+12+13+14+7+2=85N = 9Thus: The Arithmetic mean = 9.4For the grouped data set; the arithmetic mean is calculated by the following application:Where by;X = Class mark f = Frequency Example:Find the arithmetic mean for the following s cores of marksClass IntervalFXfx91-95093086-901888881-8568349876-80107878071-751573109566-703468231261-652263138656-60105858051-55253106Solution:-According to the given data;fx= 6845f= 100Thus; the arithmetic mean = 68.45Advantages of the Arithmetic mean1. It is easy to calculate and the majority of people use to understand it2. It is used to check the values if high or low3. It can be used for further calculation. For instance; arithmetic mean is used to calculate standard deviation.Disadvantage of the arithmetic mean1.Arithmetic mean has a big weakness of being pulled towards an outlier (extreme scores).2. It needs high mathematical knowledge to calculate arithmetic mean for the grouped data set.B. MODEMode is a value number which occurs most frequently in a data set given OrIs the most commonly attained measurement value in a data set OrIs the measurement value that appears most in a particular variable among a sample of subjects. Mode helps us to know concentration of values which can stimulate scientific investigation.Calculation of a modeDetermination of a mode is depend much up on the nature of data set whether ungrouped or grouped.For the ungrouped data set; mode is obtained by taking the number that appears most frequently or the one that has highest frequency than the restExample;Determine the mode for the following data set. 2, 4, 2, 2, 5, 6, 4ValueConcentration23425161Thus; the mode for the data set given = 2NoteSometimes; a given data set may have more than one modes or no more at all. The one mode obtained in a set of distribution is known as unimodal or monomodal. If two modes obtained from data set; described as bimodal.Example:(1) 2, 5, 4, 3, 5, 6, 6, 8, 5, 6.The modes for the data set are 5 and 6 (2) 4, 9, 8, 5, 6, 7The given data set has no mode.For the grouped data; mode is assessed by the following application.Whereby:L = The lower limit of the modal classt1 = The excess of the modal frequency over the frequency of the next lower classt2 = the excess of the modal frequency over the frequency of the next higher class(i) = the class intervalExample;-The tabled data below shows the score of marks in geography subject test form V studentsClass intervalFrequency40 – 44745 – 49850 – 541155 – 591060 – 644SolutionThe mode for the given data set above is calculated as follow:-According to the given data set;L = 49.5t1 = 3t2 = 1i = 5 Then;49.5 + (0.75 x 5)49.5 + 3.75 = 53.25Thus; the mode = 53.25Advantages of a mode1. It helps to make determination of predominance of a certain geographical feature in a place.2. It helps to know number of occurrence of the values in data set.Disadvantages of a mode1. It needs high mathematical knowledge to calculate mode for the grouped data set2. It is unreliable measures of central tendency as a data set may have more than one modes or no mode at all.C. MEDIANMedian refers to a point value that divides the other values in a set of distribution into two equal parts after to have been arranged in ascending or descending order.Computation of the medianThe computation of the median chiefly depends on the nature of data set given if ungrouped or grouped.For the ungrouped data set, the calculation of median should further take into account the nature of data set given whether odd or even.If the ungrouped data set is odd; the median is just the middle value and it is obtained after the value numbers to have been arranged in ascending or descending order.E.g.1, 2, 1, 4, 6, 5, 3SolutionThe ascending order of the values is as follow:- 1, 1, 2, 3, 4, 5, 6Thus; the median = 3.If the data set is even; median is the average of the two middle values and obtained after the value numbers to have been arranged in ascending descending order.E.g.1,4,5,2,7,8,3,2The ascending order for the values is as follows:- 1,2,2,3,4,5,7,8Thus; the median = 3.5Median determination for the grouped dataFor the grouped data; median is determined by applying the following formula:-Where by:-L = The lower limit of the median class N = Total number of observationnb = the number of elements in the classes below the median classnw = number of elements in the median classi = class intervalExample:-The tabled data below: shows the score of marks in geography subject for form V students.Class intervalFrequency40 – 44745 – 49850 – 541155 – 591060 – 644Example:-The tabled data below; shows the score of marks in geography subject for form V students. According to the given dataL = 49.5N = 40nb = 15nw = 11i = 5nb = the number of elements in the classes below the median classnw = number of elements in the median class i = class interval49.5 + (0.45 x 5)49.5 + 2.25 = 51.75Thus the median = 51.75Advantages of median1. It helps to understand the middle value among of the numerous values in a certain data set.2. It is easy to make determination particularly for the simple data set.Disadvantages of the median1. If the values are numerous, it becomes cumbersome to arrange in ascending or descending order to get the median2. It needs high skill to determine median for the grouped data set.MEASURES OF VARIABILITYThese are the ones which asses the variation of values in data set. The common measures of variability include the following:-1. Range2. Standard deviation3. Variance4. Mean deviation1. RANGERange is the difference between highest and lowest values in a given set of distribution. It is used to assess the existing variation between the highest score and lowest score.Calculation of the rangeCalculation of a range also considers the nature of a data set given whether ungrouped or grouped.For the ungrouped data set, range is calculated by subtracting the lowest value from the highest value in a data set given.Example:-Determine the range for the following data set 4, 2, 3,5, 6,4, 8SolutionThe range for the data set given is computed as following:-According to the given data set:-Highest value = 8Lowest value = 2· 8 – 2 = 6Thus; The range = 6With the result of range; If it is high implies greater variation. If the range is small, it implies there is small variation.For the grouped data; range is calculated by subtracting the lowest class mark from the highest subtracting the lowest lower boundary from the highest lower boundary or by subtracting the lowest higher boundary from the highest higher boundary.Example:-Determine the range for the following data set.SolutionThe range for the data set given is calculated as follow:Determination of the class markClass intervalClass marks10 – 141215 – 191720 – 242225 – 292730 – 343235 – 3937According to the computed class marksHighest class mark = 37Lowest class mark = 12 37 – 12 = 25,Thus, the range = 25Advantages of a rangeRange gives a quick rough estimate of variabilityIt is simple to calculate and the majority are much aware with it.Disadvantages of a rangeIt considers only two values of highest and lowest and thus not sensitive to the total distributionIt is affected by the extreme valuesSTANDARD DEVIATIONDeviation is the difference between the value and the mean. It is computed by subtracting a the mean from the value.Whereby:-X = value given in a set of distribution= average of all valuesStandard deviation refers to the common difference of all values from the mean. It is the root mean square deviation from the mean. It is the measure which determines how far or scattered are the values from the mean.Standard deviation is represented by sigma symbol ofComputation of a standard deviationCalculation of a standard deviation also depends on the nature of dataset given whether ungrouped or grouped.For the ungrouped data; standard deviation is calculated by the following application.Where by:-X = value in a set of distributionN = The total number of observationExample:-Calculate the standard deviation for the following data set. 3, 2, 1, 4, 6SolutionMean determinationX32146X--0.2-1.2-2.20.82.8X-X20.0.41.444.840.647.84·Then;Hence; The SD = 1.541For the grouped data set; standard deviation is computed by the following application:-Example:-Calculate the SD for the following set of grouped data.Class intervalFrequency40 – 44745 – 49850 – 541155 – 591060 – 644Procedure:Determination of the meanClass intervalFXFx40 – 4474229445 – 4984737650 – 54115257255 – 59105757060 – 64462248Hence; 51.5Then:-X4247525762X – X-9.5-4.50.55.510.5(X-X)290.2520.250.2530.25110.25F(X – X)2631.751622.75302.5441= 1540= 40Thus; The SD = 6.204Note:-The square root of SD is known as variance. Its computation is done by the following applications which also consider the nature of data set whether ungrouped or grouped.For the ungrouped data; variance is computed by the following application:-MEAN DEVIATIONMean deviation is the average of all deviation values. Or is the amount by which the individual values deviate from mean irrespective of its sign. It is computed by dividing the sum of all deviations irrespective of signs by the number of observation.Calculation of mean deviationCalculation of a mean deviation also depends on the nature of data set given whether ungrouped or grouped.For the ungrouped data set; the mean deviation is calculated by the following application:-Example:-Determine the mean deviation for the following data set. 4, 7, 8, 2, 9, 6SolutionMean determination4 + 7 + 8 +2 + 9 + 6 = 36Hence; the mean = 6 Deviations determinationX – X D44 – 6277 – 6188 – 6222 – 6499 – 6366 – 60The sum of deviations determination.·2 + 1 + 2 +4 + 3 + 0 = 12Then;Thus; the mean deviation = 2For the grouped data set, mean deviation is computed by the following application:-Example:-Class intervalFrequency40 – 44745 – 49850 – 541155 – 591060 – 644Determination of the meanClass intervalFXFx40 – 4474229445 – 4984737650 – 54115257255 – 59105757060 – 64462248Hence; The mean = 51.5Determination of the deviations.Where by:X = Class markXX –DFFd4242 – 51.59.5766.54747 – 51.54.58365252 – 51.50.58365757 – 51.55.510556262 – 51.510.5442The sum of (fd) determination66.5 + 36 + 5.5 + 55 + 42 = 205Then;Thus; The mean deviation = 5.125METHODS OF PRESENTING DATAAs it has been introduced in the chapter one; the numerical data after being collected, summarized and analyzed; are presented to provide pictorial view (visual idea). One of the useful ways for presenting the numerical facts is by diagrams. It is thus; statistical diagrams designed to illustrate values of geographical items and in turn allow quantitative analysis.The most useful statistical diagrams for the illustration of quantitative data include the following.1. Pie chart2. Proportional semi divided circles3. Divided rectangle4. Proportional circles5. Scatter diagram6. Wind rose7. Polar chart1. PIE CHARTPie chart is also known as divided circle or pie graph. It is a method of drawing a circle of any convenient size divided proportionally into a number of segments to show the values of items in percentages. The number of segments the circle is divided into depends on the number of items whose have to appear in the circle. The proportional size of segments is determined by the degree values of the percentages.Construction of pie chartConsider the given data below for world production of cocoa by countries in 1968.CountryProductionBrazil1,936,297Ghana4,042,988Nigeria2,308,066Ecuador805,499Cameroun1,270,211Ivory coast1,796,883Others33,304,430Procedures:-a) Total values determination.1,936,297 + 4,042,988 + 2,308,066 + 805,499 + 1,270,211 + 1,796,883 + 33,304,430 =15,490,374b) Percentage values determinationc) Degrees of the percentage values determination:CountryProductionX%X0Brazil1,936,29712.5%40Ghana4,042,98826.1%93.60Nigeria2,308,06614.9%53.60Ecuador805,4995.2%18.70Cameroun1,270,2115.2%29.50Ivory coast1,796,88311.6%41.80Others33,304,43021.5%77.40a) The circle of any convenient size should be drawn. It should be divided into proportional segments with respect to the computed degree values. Too small circle is not required.It is thus; the pie chart for the data appears as follow.Strengths of the pie chart1. The method is more pleasing to eye and it is one among the most popular methods in statistics for data representation.2. The values given by the method are more simplified as appear in percentages.3. It allows the easy making of quantitative analysis.Setbacks of the pie chart1. It does not give the absolute values of items represented2. It consumes much of time to prepare. Hence it is tedious enough.3. It needs high skill to prepare it4. A problem may arise in selecting the varied shade of textures.2. PROPORTIONAL SEMI DIVIDED CIRCLEThis pictorial method, involves the drawing of two semi circles linked to one another and each is proportional to the total quantity represented. Each semi circle is proportionally divided into segments and the number of segments the semi circle is made to have, depends on the number ofitems. In making the segments in the semi circle, 1800 is used as the total degree for each semi cycle.The method is very useful in making comparison of items for two major cases like dates or places.Construction of the proportional semi divided circlesConsider the following tabled data showing motor vehicles production for passengers and commercial in the industrialized nations.Procedure:-Find the total values for each variableCommercial: 1896 + 241 + 2052 + 409 + 242 + 750 + 940 = 6530Passengers: 8222 + 2862 + 2055 + 1816 + 1832 + 280 + 4653 = 21720b) Angle determination of the segmentsCommercial:-c) Passengers.Estimation of the diameters of the two proportional semi circles. It is much up on specific scale. The scale is developed by proposing specific value to be represented by 1cm. Le say 1cm should represent 20000 motor vehicles.Thus the proportional semi divided circle for the data given appears as follow:-WORLD PRODUCTION OF MOTOR VEHICLES BY COUNTRIES (000)Diameter scale:-1cm represents 2000 motor vehicles.Merits of the semi divided proportional circles.1. It is useful technique for showing comparison of item values for two major cases2. It provides visual idea3. It allows the making of quantitative analysisSetbacks of the proportional semi divided circles.4. It needs high skill to extract actual values from the diagram5. It consumes much time to prepare6. It needs high skill to be prepared7. It encounters a problem shade textures selection3. DIVIDED RECTANGLEIt is one among the most useful and versatile method of statistical presentation of data. However it is not frequently used. By this method, the total quantity is presented by a rectangle which is then sub divided to represent the constituent parts.Depending on the function, the divided rectangle is of two types including:-A. Simple divided rectangleB. Compound divided rectangleA. SIMPLE DIVIDED RECTANGLEIt is a rectangle drawn to have a length proportional to the total quality represented, then divided into proportional segments to show the values of the cases.Construction of the simple divided rectangleConsider the following data of coffee production in Tanzania in ‘000’ tons in 1980.REGIONPRODUCTIONArusha9Kilimanjaro10Bukoba18Ruvuma11Mbeya7Tanga5a) Determine the scale value to be used in drawing the rectangle.Hence; 1 cm represents 6 tons.b) Determine the length of the values along the rectangleThus; the simple divided rectangle for the given data appears as follow:-Simple divided rectangle coffee production in ‘000’ tons from 1980 to 1985B. COMPOUND DIVIDED RECTANGLEBy the compound divided rectangle, each proportional strip in the rectangle is also proportionally divided to show further information of the cases represented. This is drawn with two scales. One scale is for horizontal dimension, and it is designated as the horizontal scale; the other is for the vertical dimension and is designated as vertical scale. It is much better for the two scales graduated in separate values. The horizontal scale is absolute values and the vertical scale be in percentage.Example:-Consider the given data below showing land use partners for the six village:-COUNTRIESSIZE LANDUSE OF TOTAL AREA ‘000’ KM2WestlandPastureArableforestryRuvu Darajani166.52731.5225Vigwaza94.4–40.4202.2Buyuni226.8–32.464.8Kidogozero7.75.221.58.6Visezi8.513.36.85.4Kitonga8.88.2107Procedure:-a) Cumulative values determination.Ruvu Darajani:- 166.5 + 27 + 31.5 + 225 = 450Vigwaza:- 94.4 + 40.4 + 202.2 = 337Buyuni:- 226.8 + 32.4 + 64.8 = 324Kidogezero:- 7.7 + 5.2 + 21.5 + 8.6 = 43Visezi:- 8.5 + 13.3 + 6.8 + 5.4 = 34Kitonga:- 8.8 + 8.2 + 10 + 7 = 34b) The percentage values determinationRuvu Darajani:-Vigwaza:-·Buyuni:-Kidogezero:-Kitonga:-c) Scale determinationd) Horizontal scaleHence; Horizontal scale: 1 cm represents 125%f) Vertical scaleMerits of the divided rectanglei. It is useful method for showing cumulative valuesii. It is more illustrative as it provides visual idea to the users in statisticsiii. It allows the easy making of quantitative analysisiv. The data represented by compound divided graph can also be represented by percentage bar graph.Set backs of the divided rectanglei. It is not much pleasing to peopleii. It consumes much time to prepare especially the compound divided rectangleiii. It needs high skill to prepare the compound divided rectangleiv. It needs high skill to prepare the compound divided rectanglev. It is much less used for statistical data representationvi. A problem can be encountered in selecting the varied textures provided items are numerous.4. PROPORTIONAL CIRCLESIt is diagram with circles whose size proportional to the quantity represented. The area size of a circle is calculated by the following application:-But in our case; is ignored. The radius varies with the quantity to be represented. Hence; proportional circles are drawn with radii proportional to the square root of the quantity represented.Construction of proportional circles Consider the given data below:-Hydroelectricity production for some stations in country X.HEP StationProduction in MWA100B144C255D400E625Procedure:a) The values should be arranged in ascending or descending order. i.e. 100,144, 255, 400, 625.b) Find the square roots of the values.√100 = 10√255 = 15√400 = 20√625 = 25c) Estimate the radius value scale to be used for all proportional circles. In the estimation, propose the highest radius to be used. Then the highest square root should be divided by the proposed highest radius.Thus; 1 cm to 5 square root.d) The proportional circles should be drawn accordingly.In drawing the proportional circles; the following procedure should be followed.The circles are drawn proportionally to the quantity represented depending on the scale that has been decided.The two perpendicular lines should be drawn to follow the arrangement of the circles.The central line should be drawn through all circles.PROPORTIONAL CIRCLES SHOWING HEP PRODUCTION FOR THE STATIONSNoteThe proportional circles can be drawn on a map. This is done under the recommendation of showing values of places which appear on the map. The proportional circles on the map, sometimes may overlap. This is not a problem. But if it is possible, the best should be tried to minimize the size of the circles. One of the ways is to minimize the scale size.Consider the map with proportional circles on the next page.Advantages of proportional circlesIt is a good method of comparing absolute valuesThe proportional circles give good visual impressionDisadvantages of the proportional circlesIt is much tedious in constructionIt becomes difficult to determine the exact values from the circles.SCATTER DIAGRAMThis method is also known as scatter graph. It is a statistical diagram designed to show correlation between two types of data. The diagram is made to have two lines axis. The vertical axis is used to show the values for the dependent variable; while the horizontal axis is used to show the values for independent variable.On the diagram; a straight line is drawn to follow the distribution of dots.If the plotted dots appear closer to straight line, indicates greater correlationIf the plotted dots appear widely scattered from the line indicates low or no correlation.Construction of the scatter diagramConsider the given data below showing the amount of rainfall at varied altitudes.Altitude (m)Rainfall (mm).5006006008007001200800150090017001000200011002400Procedure:a) Identify the variablesDependent variable – Rainfall distribution valuesIndependent variable – Altitudeb) Estimation of both vertical and Horizontal scalesHence; VS 1cm represents 500mmHence; HS 1cm represents 250mAccording to scatter diagram above, the plotted dots are much closer to the line, This shows greater positive correlation between rainfall and altitudes. i.e. rainfall greatly influenced by altitude.5. WIND ROSEIt is a statistical diagram designed to show the number values of wind blow frequencies per varied direction and speed in a given month as it was recorded at a certain weather station.Wind rose is of two types including simple and compound wind roses.A. SIMPLE WIND ROSESimple wind rose only shows number of wind blow frequencies per directions. It is made to have octagon sides or a circle of any convenient size. If octagon used; on each side, a rectangle of equal or varied length to others is drawn to represent the directions from which winds were blowing. If rectangles are made to have equal length, in each, small lines established to represent the number of wind blow frequencies. If are made of not equal length, each whose length is made proportional to the number of wind blow frequencies. The number of days which didn’t experience wind blow (calm days) written in a circle inside the octagon.Example:Construct the simple wind rose to represent the following data. Wind blow frequencies at X weather station for the month of June.DIRECTIONNNEESESSWWNWWIND Fq5441343WIND ROSE FOR X WEATHER STATIONB. COMPOUND WIND ROSECompound wind rose is employed to show the average wind blow frequencies per varied direction and speed commonly in percentage of a given month for station weather station.Example:Construct the compound wind rose to present the following data.Wind blow frequencies at X weather station for the month of June in percentages.Wind speed/DirectionNNEESESSWWNWLess than 4kph223342544 – 12 kph3425234213 – 22 kph22113223Total111091111101511Calm days = 18%The compound wind rose for the given data is constructed as follow.Scale value determinationHence; 1cm represents 3% frequencyThus; the wind rose appears as follow:-Advantages of wind rose:i. It gives a visual impression of wind frequenciesii. It is relatively easy to construct and takes a short time provided a scale is well assessediii. It is easy to understand information represented.Disadvantages of a wind rosei. Numerical values not easily extracted as it needs measuring and calculating using the given scale.ii. One cannot know the exact time or day when wind blew from a particular, direction since the wind rose is a summary of the conditions over a period of time.iii. The pattern of wind blow over a given period cannot easily be seen from the diagram.6. POLAR CHARTThe graph is also known as circular graph or clock graph. It is a graph in circular form designed to have bars and circular line to show two attributes whose values appear in vaired unit. It is basically employed to illustrate the amount of temperature and rainfall together in a year. However polar chart can also be used in other cases of distribution recorded in a year.For the case of showing climatic records, polar chart employ the use of both bars and line to illustrate rainfall and temperature values respectively.The circle is divided into twelve equi angular radii.Construction of the circular graphThe following tabled data show the climatic condition for certain weather station in Jerusalem.MonthJFMAMJJASONDTemp oC89.112.2172122232423211712Rain mm15016070301800000000228090StepsEstimation of the value scales to be used.Thus, the value scale for rain fall is 1cm to 40 mmHence; the temperature vertical scale; 1cm represents 50cThe polar chart has to be drawn as follow:-Strength as of the circular graphi. It is useful graphical method for showing the distribution values of climateii. It is more illustrative, as it provides visual idea to the users in statisticsiii. It allows the easy making of quantitative analysisSetbacks of the circular graphi. It needs high skill to make quantitative analysis from the Graphii. It is time consuming graphical method in constructioniii. Needs high skill to construct the graphSTATISTICAL GRAPHSThese are the graphs designed to illustrate values of geographical items by means of lines or bars and in turn allow quantitative analysis.The most useful statistical graphs for the illustration of values include the following.a) Line graphsb) Bar graphsc) Combined bars and line graphA. LINE GRAPHSThese are the graphs which use line (s) to illustrate the values of items to give quantitative analysis.Any line graph has two axes of the following:-X – axis; This is also known as the base or horizontal axis. It is used principally to show the value of independent variable like date or places.Y – axis: This is also known as the vertical axis. It is used show the values for the dependent variable of like output of crops, minerals etc.TYPES OF LINE GRAPHSLinear graphs are extremely varied. They are differently deigned to meet varied functions (roles). With respect to this consideration, linear graphs recognized to be of the following forms:1. Simple line graph2. Cumulative line graph3. Divergent line graph4. Group line graph5. Compound line graph1. Simple line graphIt is a form of line graph, designed to have one line to illustrate the values of one item in relation to dependent and independent variables. i.e. It is designed to show the values of one item per varied date or places.CONSTRUCTION OF THE SIMPLE LINE GRAPHConsider the given hypothetical data below showing maize production for country X in 0,000 metric tons (1990 – 1995).YEARPRODUCTION199010019912501992300199315019945001995400Procedurea) Variables identificationDependent variable ….. production values Independent variable ….. Date (Years).Y – axis …… production valuesX – axis ……. Yearsb) Vertical and horizontal scales estimationHence; VS is 1 cm to 50000 tons.Horizontal scale is up on decisionHence; 1cm represents 1 yearMAIZE PRODUCTION FOR COUNTRY X IN (0,000) Metric tonsSource:Hypothetical dataStrengths of the simple line graphi. It is much easier to prepare as it involves to complicated mathematical works, and also a single line establishes the graph.ii. From the graph, the absolute values are extractediii. It is comparatively easier to read and interpret the valuesiv. It has perfect replacement by simple bar graphSetbacks of the simple line graphi. It is a limited graphical method as only suited to represent the value for one item.ii. Sometimes it becomes difficult to assess the vertical scale if the variation between the highest and lowest values appear wider enough.B. Cumulative line graphIt is a form of line graph designed to show the accumulated total values at various dates or possibly places for a single item. This graphical method has no alternative graphical bar method as it can be compared to other linear graphical methods.Construction of the cumulative line graphConsider the given hypothetical data below showing maize production for country X.YEARPRODUCTION19905019914019929019931001994901995130Procedurea) Variables identificationDependent variable production valuesIndependent variable Date (Years)Y – axis Production valuesX – axis Yearsb) Vertical and horizontal scales estimationc) Determination of the cumulative values.YEARPRODUCTIONCUM VALUES199050501991409019929018019931002801994903701995130500Hence: VS; 1cm represents 50 tonsThus; the cumulative lien graph appears as follow. Cumulative line graph: Maize production for country X.SCALE:-VS….. 1cm represents 50 tonsHS ….. 1cm represents 1 year Source ….. Hypothetical data.Merits of the cumulative line graphi. The graphical method shows cumulative valuesii. From the graph the values can be revealed and quantitatively analyzedSetbacks of the cumulative line graphi. The graphical method is not suited to show cumulative values for more than one item, it is thus; the graphical method limited for showing the values of a single item.ii. It needs high skill to reveal the actual values of the item representediii. It has no alternative graphical bar method.2. Divergent line graphIt is a form of line graph designed to illustrate the increase and decrease of the distribution values in relation to the mean. The graph is designed to have upper and lower sections showing positive and negative values respectively. The two portions are separated by the steady line graduated with zero value along the vertical line. The steady line also shows the average of all values.Construction of the divergent line graphConsider the following tabled data which show export values of coffee for country X in millions of dollars.YEAREXPORT VALUES (000,000 dollars)19523451953256.51954283195550019563351957330.5a) Variables identificationDependent variable Export valuesIndependent variable Date (Years)Y – axis Export valuesX – axis Yearsb) Computation of the arithmetic mean·345 + 256 + 283 + 300 + 335 + 330.5 = 1850Then;Computation of the deviation values1952 345-308 = 371953 256.5 – 308 = 52.51954 283-308 = -251955 300 – 308 = -81956 335 – 308 = 271957 330.5 – 308 = 22.5c) Estimation of the vertical scale.Thus: the vertical scale1cm represents 15 or -15 million dollarsd) The graph has to be redrawn accordingly as follows:-Source:- Hypothetical data Scales:-Vertical scale 1cm represents 15 or 15 tons Horizontal scale 1cm represents 1 yearMerits of the divergent line graphi. The graphical method is useful for showing increase and decrease of the values.ii. The graphical method shows the average of all valuesiii. It has perfect replacement by divergent bar graphSetbacks of the divergent line graphi. The graphical method is not suited to show the increase and decrease values for more than one items, it is thus; the graphical method is limited to a single item.ii. It needs high skill to reveal the actual values of the item represented.iii. It is time consuming graphical method as its preparation involves a lot of mathematical works. It requires high skill to construct the divergent line graph.3. Group line graphIt is a form of statistical line graph designed to have more than one lines of varied textures to illustrate the values of more than one items. Group line graph is alternatively known as composite, comparative, and multiple line graph.Construction of the group line graphConsider the given data below showing values of export crops from Kenya (Ksh Million).Crop/Year19971998199920002001Tea24,12632,97133,0653515034,448Coffee16,85612,81712,02911,7077,460Horticulture13,75214,93817,64121,21619,846Tobacco1,7251,6071,5542,1672,887a) Variables identificationDependent variable …… export values Independent variable …. Date (years)Y – -axis export valuesX – axis… Yearsb) Verticals identificationDependent variable… export valuesIndependent variable …… Date (Years)Hence; VS 1cm represents 5000 export valueThus; the group line graph appears as follows:-KENYA: CROPS EXPORT VALUESScales:-Vertical scale: 1cm to 5,000 export values Source: Kenya Economic Survey 1969 Strengths of the group line graphi. It is much easier to prepare as it involves no complicated mathematical worksii. It is useful graphical method for showing the values of more than one cases.iii. From the graph, the absolute values are extracted as the values directly showniv. It is comparatively easier to read and interpret the values.v. It has perfect replacement by group bar graph.Setbacks of the group line graphi. Some times; it becomes difficult to assess the vertical scale if the variation between the highest and lowest values appears wider enoughii. Crossing of the lines on the graph may confuse the interpreter.iii. A problem may arise in the selection of the varied line textures.4. Compound line graphIt is a line graph designed to have more than one lines compounded to one another by varied shade textures to show the cumulative values of more than one items.Construction of the compound line graphConsider the given data below showing cocoa production for the Ghana provinces in 000 tons.YEAR/PROVTV TogolandE. provinceW. provinceAshanti1947/48404030351948/495060451001949/504546891101950/514547441241951/524723501001952/53511457118Procedurea) Variables identificationDependent variable…… export values Independent variable ….. Date (Years) Y – -axis export valuesX – axis… Yearsb) Cumulative values determination for the dates.1947/48 40+40+30+35 = 1451948/49 50+60+45+100 = 2251949/50 45+46+89+110 = 2901950/51 45+47+44+124 = 2601951/52 47+23+50+100 = 2201952/53 51+14+57+118= 240Vertical and horizontal scales determinationHence; The vertical scale, 1cm represent 50 tons Thus the graph appear as follow:-Strengths of the compound line graphi. It is useful graphical method for showing the cumulative values of more than one case.ii. Depending on the skill the interpreter has, from the graph, the absolute values are extracted as the value directly shown.iii. It has perfect replacement by compound bar graphiv. It is comparatively easier to assess the vertical scale to be used.Setbacks of the compound line graphi. It needs high skill to interpret the graphii. It needs high skill to construct the graphiii. A problem may arise in the selection of the varied line textures.4. BAR GRAPHSThese are the graphs which use bars to illustrate the values of items to give quantitative analysis. Any bar graph has two axesX-axis; This is also known as the base or horizontal axis. It is used principally to show the values of independent variable like date or places.Y – axis; This is also known as the vertical axis. It is used show the values for the dependent variable of like output of crops, minerals etc.TYPES OF BAR GRAPHSLike line graphs, bar graphs are also extremely varied as differently designed to meet varied functions. With respect to this consideration, bar graphs categorized into the following:-Simple bar graphDivergent bar graphGroup bar graphCompound bar graphPercentage bar graphPopulation pyramidSimple bar graphIt is a form of bar graph, designed to have bars of similar texture to illustrate the values of one item in relation to dependent and independent variables. i.e. It is designed to show the values of one item per varied date or places.Construction of the simple bar graphConsider the given data below showing cocoa purchase by areas, in 000 metric tons (1953)ProvincePurchaseAshanti104W-Province39E-Province45TV Togo land22Proceduresa) Variable identificationDependent variable …… Purchase Independent variable …. ProvincesY – -axis………purchase valuesX – axis Provincesb) Verticals identificationDependent variable… export valuesIndependent variable …… Date (Years)Thus; the vertical scale: 1cm represents 20,000 tons. Bar width – 1cmBar space = 0.5 cmThe graph has to be constructed accordingly.COCOA PURCHASE BY PROVINCES (1953/54Vertical scale; 1cm represents 20000 tons.Strengths of the simple bar graphi. It is much easier to prepare as it involves no complicated mathematical works, and also bars of similar texture established in the graph.ii. From the graph, the absolute values are extracted.iii. It is comparatively easier to read and interpret the valuesiv. It has perfect replacement by simple line graph.Setbacks of the simple bar graphi. It is a limited graphical method as only suited to represent the values for one itemii. Some times; it becomes difficult to assess the vertical scale if the variation between the highest and lowest values appear wider enough.Divergent bar graphIt is a form of bar graph designed to illustrate the increase and decrease of the distribution values in relation to the mean. The graph is designed to have upper and lower sections showing positive and negative values respectively. The two portions are separated by the steady lien graduated with zero value along the vertical line. The steady lien also shows the average of all values.Construction of the divergent line graphConsider the following tabled data which show export values of coffee for country X in millions of dollars.YEAREXPORT VALUES(000,000 dollars)19523451953256.51954283195530019563351957330.5a) Variable identificationDependent variable …… Export values Independent variable …. Date (Years)Y – -axis Export valuesX – axis… Yearsb) Computation of the arithmetic mean345 + 256 + 283 + 300 + 335 + 330.5 + 1850c) Computation of the deviation values 1952 345 – 308 = 371953 256.5 – 308 = 52.51954 283 – 308 = -251955 300 – 308 = -81956 335 – 308 = 271957 330.5 – 308 = 22.5d) Estimation of the vertical scaleThus: the vertical scale 1cm represents 15 or –15 million dollars Bar width – 1cmBar space – 1cmThe graph has to be redrawn accordingly as follows.COFFEE EXPORT VALUES FOR COUNTRY XIn million dollarsScales:-Vertical scale 1cm represents 15 or – 15 tons Horizontal scale: 1cm represents 1 year Source:- Hypothetical dataMerits of the divergent bar graphThe graphical method is useful for showing increase and decrease of the valuesThe graphical method shows the average of all valuesIt has perfect replacement by divergent line graph.Setbacks of the divergent bar graphThe graphical method is not suited to show the increase and decrease values for more than one item, it is thus; the graphical method is limited to a single item.It needs high skill to reveal the actual values of the item represented.It is time consuming graphical method as its preparation involves a lot of mathematical work.It requires high skill to construct the divergent bar graph.Grouped bar graphIt is a form of statistical bar graph designed to have more than one bars of varied textures to illustrate the values of more than one items.Grouped bar graph is alternatively known as composite, comparative, and multiple bar graph.Construction of the group bar graphConsider the given data below for cocoa purchase by provinces in Ghana (1947/48 – 1950/51)YEAR/PROVTV TogolandE. provinceW. provinceAshanti1947/482054281061948/492680461261949/502467401161950/51227245123a) Variables identificationDependent variable…… purchase values Independent variable ….. DateY – -axis purchase valuesX – axis… Dateb) Vertical scale estimationHence; Vs, 1cm to 20,000 tons Bar width = 1cmBar space = 1cmc) The graph should be drawn accordingly.COCOA PURCHASE BY PROVINCES (1953/54)Strengths of the grouped bar graphIt is much easier to prepare as it involves no complicated mathematical worksIt is useful graphical method for showing the values of more than one cases.From the graph, the absolute values are extracted as the value are directly shownIt is comparatively easier to read and interpret the values.It has perfect replacement by group line graph.Setbacks of the grouped graphSome times; it becomes difficult to assess the vertical scale if the variation between the highest and lowest values appear wider enough.A problem may arise in the selection of the varied bar textures.Compound Bar graphIt is a bar graph designed to have bars divided proportionally showing the cumulative values of more than one items per varied dates or placesCompound bar graph is alternatively known as divided bar graph, or superimposed bar graph. Construction of the compound bar graphConsider the given data below showing cocoa production for the Ghana provinces in 000 tons. Consider the given data below showing cocoa purchase by provinces (1947/48 to 1950/51)REGION/YEAR1947/481948/491949/501950/51Ashanti106,000126,000116,000123,000W.province28,00046,00040,00045,000E.Province54,00080,00067,00072,000T.Volta20,00026,00024,00022,000Procedurea) Variable identificationDependent variable ….. export valuesIndependent variable … Date (Years).Y – -axis purchase valuesX – axis… Yearsb) Cumulative values determination for the dates.1947/48……….. 106,000 + 28,000 + 54,000 + 20,000 = 2081948/49………… 126,000 + 46,000 + 80,000 + 26,000 = 278,0001949/50 ………… 116,000 + 40,000 + 67,000 + 24,000 = 247,0001950/51…………..123,000 + 45,000 + 72,000 + 22,000 = 262,000b) Vertical scale determination.Thus; the VS … 1cm represents 50,000 tons.The graph should be drawn accordingly.COCOA PURCHASE BY PROVINCE (1947/48 – 1950/51)Strength of the compound bar graphIt is useful graphical method for showing the cumulative values of more than one casesDepending on the skill the interpreter has, from the graph, the absolute values are extracted as the value directly shown.It has perfect replacement by compound line graphIt is comparatively easier to assess the vertical scale to be used.Setbacks of the compound bar graphIt needs high skill to interpret the graphIt needs high skill to construct the graphA problem may arise in the selection of the varied textures of the proportional segments It is very fedious /tiresome as it involve mathematical calculationIt is time consuming in preparationPercentage bar graphIn percentage bar graph, all bars must be drawn on the same height representing 100% and suitable scale is chosen such as 5, 10, 20 etc, and marked along the sides. The percentages of the total each area stands for must start from zero line. Also it is advised to include the actual percentages of the face of the bars.Construction of the Percentage bar graphConsider the given data below showing cocoa purchase by provinces (1947/48 to 1950/51)REGION/YEAR1947/481948/491949/501950/51Ashanti106,000126,000116,000123,000W.province28,00046,00040,00045,000E.Province54,00080,00067,00072,000T.Volta20,00026,00024,00022,000ProcedureVariables identification Dependent variable ….. export values Independent variable … Date (Years). Y-axis purchase valuesX-axis… Yearsb) Cumulative values determination for the dates.1947/48 106,000 + 28,000 + 54,000 + 20,000 = 2081948/49………… 126,000 + 46,000 + 80,000 + 26,000 = 278,0001949/50 ………… 116,000 + 40,000 + 67,000 + 24,000 = 247,0001950/51…………..123,000 + 45,000 + 72,000 + 22,000 = 262,000c) The percentages by provinces in each year determination.1947/48:1948/49: 1949/50:1950/51:Hence; VS; 1 cm represents 20%The percentage bar graph should be drawn accordingly as follow:-COCOA PURCHASE BY PROVINCES (1947/48 – 1950/51)Vertical scale; 1cm represents 20%Strengths of the percentage bar graphIt is useful graphical method for showing the values of more than one casesThe data represented appear in a more simplified form as given in percentages.It is comparatively easier to assess the vertical scale to be usedSetbacks of the percentage bar graphIt does not give the absolute valuesIt needs high skill to interpret the graphIt needs high skill to construct the graphA problem may arise in the selection of the varied textures of the proportional segmentsIt consumes much time to be prepared.Population pyramid graphIt is a form of bar graph designed to show population distribution by age and sex. It is a double bar chart showing the age sex structure of the population. It consists of two sets of horizontal bars; one is for each sex showing either the p percentages or absolute numbers.Rules for drawing the population pyramid graphIt is a principle in drawing population pyramid; the number of male population illustrated by the left set of bars; while that of females by the right set of bars.The young population distribution is always at the bottom while that of old at the top.Usually the last age group should be left open handled because; some people may survive beyond 100 years and their number have been omitted.The bottom scale can be graduated as percentages or absolute numbers.If percentages are opted to be used; the total population of both combined sexes should be used to compute the percentages.After all the bars have been drawn, they can be shaded in one colour or separated colours for each sex.CONSTRUCTION OF THE POPULATION PYRAMIDThere are two techniques of drawing the horizontal bars of an age sex pyramid.In the first technique, the bars are drawn proportionally to the actual population numbers (absolute values).In the second technique, the bars are drawn to represent percentages.Age groupMaleFemaleTotal0 – 42291936224296645349025-920005801962556396313610-1420349802003655403863515-1916819841721194340317820-2413285291504389283291825-29109490911664594225950330-34840692845230168592235-39695263723749141901240-44516502516989103349145-4941984141898783882850-5434463934016768480655-5922369123632546001660-6419451321471540922865-6914096916036430133370-7411860113552425412575-79791668162016078680+95300121038216338Age not stated10348786956190443All ages142055891448101828686607The absolute value techniqueThe following steps are followed when constructing a population pyramid using absolute values.Decide a suitable scale for the horizontal axis (baseline) by considering the values of the biggest and smallest age group, as well as the size of the paper on which the pyramid is to be drawn. Horizontal scale is determined as follows.Hence by considering the data in the table, scale of 1cm to represent 400,000 people would be suitable.Choose a suitable scale for the vertical axis. This scale will determine how wide the bars will be and also the interval between the age groups. The width of the bars should not exceed 6mm otherwise the pyramid will look untidy.Take a clean graph paper and on it draw horizontal axis at least 3 cm from the bottom of the page. Draw two vertical axes of 1 cm apart and about 10 cm long, until they touch the horizontal axis.Where the vertical axes touch the horizontal axis, mark as zero. On the horizontal axis, and at intervals of 1cm from the zero mark on the both sides, mark of the values representing the female and male populations.In the middle column, fill in the age groups starting with the youngest at the bottom. The age groups should be within the width of the horizontal bars.Using the horizontal scale, and starting with the first age group for females, draw a bar from the vertical axis on the right hand side of the central column towards the right to represent the female population of that group. The scale chosen in step 1 above will determine the length of the bar.From the left hand side of the vertical axis, draw a bar representing the male population of the same age group. Steps 6 and 7 should be repeated for all the subsequent age group until the last one has been represented.The percentages techniqueBy this technique, the values for population distribution by age and sex given in percentages. The percentages of each female or male group over the total populations is calculated from the absolutevalues in our example and a new set of data will be derived from data in the table. This new data will be used to draw the graph.An example on how to calculate the percentage values is shown below. The application for calculating the percentage is as follows.For instance:The absolute values for the females aged between 0-4 years from the table is 2242 966, while that for males is 2291936. The total populations according to the 1999 census, was 28686607.Therefore the percentage of females is as follows:-The percentage of male is as follows:-The worked out percentage values from the figure in the table are given in the table next page.Age Group5male%femaleTotal1-48.07.815.85-97.06.813.810-147.17.014.115-195.96.011.920-244.65.29.825-293.84.17.930-342.92.95.835-392.42.54.940-441.81.83.645-491.51.53.050-541.21.22.455-590.80.81.660-640.70.71.465-690.50.61.170-740.40.50.975-790.30.30.680+0.30.40.7After the calculation of the percentages, the following steps should be taken to come up with the age – sex pyramid.Choose a suitable scale for the horizontal axis by considering the highest and the lowest percentages in the table. According to the values I the table, a scale of 1cm is representing 1% would be suitable.Follow step 2 and 3 as outlined under the absolute values techniques discussed earlier.Where the vertical axis touch the horizontal axis, mark zero and at intervals of 1cm, mark of the percentage value towards the right for females, and towards left for the males.The age group should be indicated in the middle column just as we did when constructing an age sex pyramid using absolute values..Using the horizontal scale and starting with age group 0-4 draw a bar on the right hand side to represent the percentage values of the female population in this age group. In our example, the percentage is 7.8 Draw a similar bar on the left hand side to represent the value of the male population, which in our case is 0.8.Draw bars to represent all the age groups follow steps 9 and 10 under the absolute value technique to complete the pyramid.Kenya population by age:NotePyramid may also be for the purpose of making comparison either in terms of time or location. This can be by means of a double combined population pyramid. The double combined population pyramid looks as follows.Advantages of the age-sex pyramidIt is visually attractive method of presenting data.A variety of information is shown on the same graph. The details include; age, sex and number of people.It can be used to compare the age sex structure of number of countriesIt gives a clear picture and summary of the population composition of a country.Disadvantages of the age-sex pyramidIt is tedious to construct because it involves many values.It is difficult to tell the exact values represented because of the small scale of the horizontal axis.Reasons for the differences in population numbers cannot be obtained from the graph directly. Therefore additional information has to be thought from elsewhere.COMBINED BAR AND LINE GRAPHIt is a form statistical graph designed to have both bars and line to show two attributes whose values appear in varied unit. It is basically employed to show the values of rainfall and temperature together in a year.In the graph, the bars used to illustrate the values on amount of rainfall in mm or inch, while the line is used to illustrate the values on amount of temperature in 0C or 0F. This is also known as climo graph.Construction of the bar and line graphConsider the following climatic data for Dar-win weather station Australia.MonthJFMAMJJASONDTemp oC28.927.828.92926.72625.126.428.129.729.829Rain(mm)38833024611417.852.52.512.753.3132261Procedurea) Identification of the variablesDependent variable – Rain and temperature valuesIndependent variable – Data (months).Y – -axis – Rain and temperature values X – axis… monthsb) Estimation of the vertical scale to be usedThus; the vertical scale for rainfall is 1cm to 50mm.Thus; the vertical scale for temperature is 1 cm to 10 cThe graph has to be drawn as follows;CLIMATIC CONDITION FOR DARWIN AUSTRALIAStrengths of the combined bar and line graphIt is useful graphical method for showing the distribution values of climateIt is more illustrative, as it provides visual idea to the users in statistics.It allows the easy making of quantitative analysisSetbacks of the combined bar and line graphIt is more illustrative, as it provides visual idea to the users in statisticsNeeds high skill to make quantitative analysis from the graphIt is time consuming graphical method in constructionIt needs high skill to construct the graphIt is tedious as it involves mathematical calculationSTATISTICAL MAPSAs it has been introduced in the chapter one; the numerical data after to have been collected, summarized and analyzed; are presented to provide pictorial view (visual idea). One of the useful ways for representing the numerical facts is by maps. The method of maps is established with an emphasize of showing distribution values of phenomena of places over the earth’s surface.Moreover; the places whose values to be shown on maps should lie adjacently to one another in such a way they can all appear on a similar map. It is thus; statistical maps are the ones designed to show the values on spatial distribution of geographical events (phenomena).OrThe maps designed to show spatial distribution of certain geographical events in quantitative manner and in turn allow quantitative analysis.The main useful statistical maps which allow quantitative analysis include the following.Choropleth mapsDot mapsFlow line mapsIsopleths mapsChoropleth mapsThese are the statistical maps which use the system of varied shade textures to illustrate the density spatial distribution values of a certain phenomena for the places. The maps mostly designed to show population density of places. On the map, places with similar shade texture have almost the same distribution density.Construction of the choropleth mapObtain the base map with suitable scale. The map should have the boundaries of administrative areas. The scale is used to asses the size of the administrative areas which then are related to the amount of distribution.Obtain the data and summarize into the table. The tabled data should show clearly the names of administrative areas, area size and amount of distribution.Determine the density values of distribution for areasThe worked densities should be grouped using regular interval. In this respect; more than one classes should be selected and all should have the worked densities. It is also important that; the classes should not be numerous.Example:-Use the following data to prepare the choropleth map:-ProvincePopulationLand area (km2)Nairobi2,143,254696Central3,724,15913,220Coast2,487,26482,816Eastern4,631,779153,473North Eastern962,143128,124Nyanza4,392,19612,547Rift valley6,987,036182,539Western3,358,7768,264Calculation of the population densities for the regions.The suggested interval is of 100 and thus; the groups include:- 0-99, 100-199, 200-299, 300-399,400-499 and 500.With regards to the groups, the choropleth map appears as follows:KENYA: DENSITY POPULATION BY PROVINCE 1999Advantages of the choropleth mapIt is most suited to show distribution values of a certain geographical phenomena in relation to area size i.e. It is the most suited to show densities over space.The data can be analyzed quantitatively from the mapIt provides visual idea (impression) to people on varied densities of distribution.The disadvantages of choropleth mapThe shades indicated on the map remove the political boundariesIt is tedious enough in construction as it involves many values. i.e. preparation consumes much time.If no topographical map provided the map may give wrong picture about the distribution of the phenomena.Problem may occur in deciding the varied shade textures to be used on the map.The map might be realized to show abrupt change of distribution area to area some thing which is not realisticIt is not possible to obtain absolute values or exact densities from the map because the shades represent categories of densities.It is not possible to insert additional details on the map.Dot mapsDot map is a considerable form of statistical map which involves the use of fixed size dots to show the spatial distribution of a certain geographical phenomena like people, cattle etc.OrA map which shows the spatial distribution numerical quantities using dots. A dot is a simplest symbol used in representing quantities on maps. A dot represents fixed amount similarly to others.Construction of a dot mapTake into consideration the base map given. The base map should have the clear boundaries of the administrative areas.Obtain the data and summarize into a table. The tabled data should show the names of administrative areas and their amount of distribution for the phenomena.Make decision on dot value. In this, it is important for the dot value should not be too small or too large. With too large dot value, there is a possibility for the regions with small amount of distribution to lack dots and thus; may impress that, the areas less occupied. If too small dot value chosen, may cause a problem of dots overlapping. It is thus; the dot value should be reasonable.Determine the number of dots to be allocated in the administrative areas on the map. It is by diving the amount of distribution to dot value.Insert the dots on the map accordingly. It is important for all dots to have the same size and evenly distributed.Example:-ProvincePopulationNairobi2,143,254Nairobi2,143,254Central3,724,159Coast2,487,264Eastern4,631,779North Eastern962,143Nyanza4,392,196Rift valley6,987,036Western3,358,776Procedure:-Dot value determinationAccording to the given data; 1 dot represents 100,000 peopleNumber of dots determinationKENYA:POPULATION BY PROVINCES 1999 (* 100,000 people).Fg. 1.3 Kenya: Population by provinces, 1999 (100,000 people)Advantages of dot mapsThe data can be analyzed quantitatively from the map.It is easy to get the amount of distribution of each area by considering the number of dots present and the dot value.Preparation of the map is fairly easyThe map provides visual impressionThey are the most widely used statistical maps for showing distribution.Disadvantages of dot mapsThe map is facing a problem of double counting during of making quantitative analysis. This give wrong quantitative picture.If no topographical map provided, the map may give wrong picture about the distribution of the phenomenaWith larger or smaller dot values, problem may occur in representing distribution on the map.Fractional values may not be represented on the mapDrawing many dots of uniform size is difficult. Special pens may be needed for this purpose.FLOW LINE MAPSThese are the maps which illustrate the volume of goods or number of vehicles, people, cattle e.t.c. moving between points or areas along established routes of like roads, railways, canals, or air and sea routes.OrA statistical map designed to show the movement of the geographical phenomena from one place to another through an established route way of like road, railway, water way, airway and others.With the flow line map, a line shows the direction of the movement; while, the amount of movement is by varied width line. The character of the movement can be by varied shade textures or colours.It has to be noted that; the direction of the movement and the distance involved have no significance as far as quantities are concern.E.g.Construction of the flow line mapDraw the base map of route waysAsses the data given. The data should have names and amount of movement between the check points (stations) along the route way.Decide the width scale value. This has to take into consideration the highest and lowest values. It is much better to avoid too large or too small scale values. Too large scale values makes very fine flow lines and too small scale value may result into wider flow lines.With respect to the decided scale, draw the flow lines along the routes on the map.Example:-Use the data and map given, to show the amount of movement of the passengers between the check points along the route ways. CHECK POINTS PASSENGERS A – B10,000B – C8,000B – D7,000B – E7,000E – F3,000E – G2,000ProcedureScale value determination:Thus; along the flow line; 1mm represents 1,000The flow lien map for the data given appears as follows.Advantages of the flow line mapThe map is most useful for showing the amount (volume) of movement between the check points along the route ways.The data from the map can be quantitatively analyzed by regarding the width of the flow lines and the value scale.It provides visual impression to peopleCalculation and drawing of it is fairly easy once the scale value has been decided.Disadvantages of the flow line mapWide variation between the highest and the lowest values given difficult to assess the scale valueThe volume (amount) of movement cannot be exactly analyzed from the map.Difficult may arise in drawing the double flow linesThe very small values always are not accurately represented on the map.ISOPLETH MAPSIt is form of statistical map which uses the system of lines to show amount of distribution of phenomena. The lines on the map are drawn to connect points of equal values and the lines are called isolines.Isopleths maps are also called isoline map, isarithm map and isometric map.Examples of isopleths maps include; relief map by contours, meteorological maps showing atmospheric pressure, rainfall, temperature, etc. and maps which show depth of water bodies.The isolines established on the isopleths map have special terms for specialized purposes.Isotherms – TemperatureIsobars – Atmospheric pressureIsohyets – RainfallIsoneph – cloudinessIsobaths ocean depthIsohaline – salinityConstruction of the Isopleths mapsObtain the outline base map and the appropriate data and mark in the points and their values in pencil on the map.Decide the interval to be usedSelect the critical values. These are the ones which correspond (match) with the chosen interval.Join the critical values with smooth lines according the chosen interval.Advantages of isopleth mapIt provides good visual impression if it is well presentedIt is useful for showing distribution of phenomenon particularly climate. 3 The map preparation is fairly easy.4. It can be analysed qualitativelyDisadvantage of isopleth map.It is time consuming in preparation especially drawing2. It is difficulty to quantify the data presented 3.It needs high skills to interpret dataTagged:Advance GeographyApplication of statistics in geographyForm SixNotesStatistics Next - Topic Field research techniques