STATISTICS
– DESCRIPTION AND SPREAD
It
is good valuation practice to provide a descriptive analysis of sale or
rental data for inclusion in the valuation report. Basic statistical
analysis used as a descriptive tool is particularly useful in this
regard. Statistics 1 covers the use of measures of central tendency and
spread. This will be done by using an example:
EXAMPLE
The following are the
residential sales analyzed in a suburb in which the subject comparable
property is being valued:
SUBURB: Summerton
Number of sales: 19
$'000
250 300 350 350 400
400 400 450 450 450 500 500 600 650 700 750 1000 1200 1500
How should such data be included in the valuation report? A very
effective method is to summarize it as a frequency distribution. The
following commands shows how this is done using Excel spreadsheet
commands:
PREPARING A FREQUENCY HISTOGRAM IN EXCEL
On spreadsheet:
1. Enter data to be
analyzed
2. Enter bin values: 100 300 500 700 900 1100 1300 1500 1700
<Options><Analysis tools><Histogram>
Enter input range
Enter bin range
Enter output range
<Enter>
Excel will construct the following table showing the frequency values,
cumulative totals and a histogram as a chart. If the pareto
chart option is chosen an additional table will be constructed on the
right of the frequency table:
TABLE 1
Bin Frequency
Cumulative %
Bin Frequency Cumulative % 100 1 5.26% 300 9 47.37%
300 9 52.63% 500 4
68.42% 500
4 73.68% 700 2 78.95% 700 2 84.21% 100 1 84.21% 900 1 89.47% 900 1 89.47%
1100 1 94.74% 1100 1
94.74% 1300
0 94.74% 1500 1 100.00% 1500 1 100.00% 1300 0
100.00% 1700 0 100.00% 1700 0 100.00%
The subsequent pareto chart is shown in the diagram below. It
is
useful in that it shows the progressive accumulation of sale data from
the highest column at the left to the lowest column at the right. The
right hand table the is the pareto table in which the
frequencies are ranked in ascending order, according to frequency.
EXCEL
- TO SEE HISTOGRAM
<Window><Chart 1>
DIAGRAM 2
The distribution in
diagram 2 is "non symmetrical" being "skewed" to the right. Histograms
should be analyzed according to their deviation from the centre and
shape.
RULES FOR HISTOGRAMS
- Classes must
be even in width ie, the bins must have a constant range. In table 2
this is 200.
- Class
size should be the correct size to show the overall trend. If it is too
large or too small, the trend will be lost. A class which is too small
will hide the information.
- Do NOT
use 3D histograms and graphs. The 3D perspective makes the reading of
information and visual interpretation much more difficult compared to
the normal 2 dimensional histograms.
COMPARING
WITH ANOTHER SUBURB: Williamville
The valuer considers that another suburb Williamville, is comparable
with Summerton for valuation purposes. Sales from the two suburbs can
be compared to see whether or not there is comparability between the 2
samples.
SALES FOR WILLIAMVILLE
$'000
100 200 250 300 300
350 400 450 450 450 500 550 550 600 600 650 700 700 750
If above data results
in the frequency histogram shown in the diagram below:
Although the diagrams
for the two suburbs have a similar centre line, the sales for
Williamville are more concentrated around the centre, that is, there is
less spread. Prima facie, the mean value of the sales in
Wikkiamville are more reliable than for Summerton. However, the use of
stemplots is a better way to compare two sales data.
See stemplots
DESCRIBING DATA
Diagrammatic analysis
is very useful and should be an integral part of the valuation report
however, for further analysis, quantitative measures are required. The
most common quantitative measures are the measures of centrality; MEAN,
MEDIAN and MODE.
See mean
See median
See mode
MEASURES OF CENTRALITY FOR SUMMERTON AND WILLIAMVILLE
MEAN MEDIAN MODE
SUMMERTON: 589.4 450
300-500
WILLIAMVILLE: 465.7 450 300-500
The 3 measures together provide a good description or analysis of the
sale data. Although the means are substantially different the medians
and modes are the same. Therefore, some other measure of central
tendency would be necessary to further analyse the sets of data.
Although the above measures are useful, measures of centrality do not
measure spread or range. The following useful statistics add a great
deal to the analysis by measuring the spread of the sale data around
the mean.
See quartiles
MAXIMUM
AND MINIMUM VALUES - THE RANGE
Adding MAXIMUM and MINIMUM values provides a very good picture of the
spread of the data. For unsorted data these values can be readily found
with the following spreadsheet commands:
Excel: =MIN(cell
addresses) =MAX(cell addresses)
The range provides a
good measure of spread and the valuer would feel more confident about
using data with a small range than data with a high range.
For the sales in
Summerton:
RANGE = 1500 - 250 = 1250(000)
See box plot
See standard deviation
See Z scores
6