Unit 1: Introduction to Statistics and Data Presentation

Table of Contents

1.1 Definition, Scope, and Limitations of Statistics

Definition of Statistics

Statistics is the science of collecting, organizing, analyzing, interpreting, and presenting data. It provides tools and methods to find patterns, make decisions, and deal with uncertainty.

Scope of Statistics

Statistics is used in almost every field:

Limitations of Statistics

1.2 Population and Sample

1.3 Types of Data

By Source

By Nature

By Time

1.4 Presentation of Data (Tables and Diagrams)

Tabulation

Organizing data into rows and columns in a table. A good table has a clear title, labeled columns and rows, and (if needed) a source note.

Diagrams

Visual representations of data. The choice of diagram depends on the type of data.

1.5 Frequency Distributions

A table that organizes raw data by summarizing the number of times (frequency) each value or group of values (class interval) occurs.

For Discrete Data

Simply list the values and their frequencies.

Number of Siblings (x)Frequency (f)
015
130
225
3+10

For Continuous Data

Data is grouped into class intervals.

1.6 Graphical Representation of Frequency Distributions

Histogram

The most common graph for a continuous frequency distribution. It uses bars to represent frequency, but unlike a bar chart, the bars are adjacent (touching) to represent the continuous nature of the data. The x-axis is marked with class boundaries.

Unequal Class Intervals: If class widths are unequal, you must plot Frequency Density on the y-axis, not frequency.
Frequency Density = Frequency / Class Width
In this case, the area of the bar represents the frequency.

Frequency Polygon

A line graph formed by connecting the midpoints of the tops of the bars in a histogram. It gives a clearer picture of the shape of the distribution.

Cumulative Frequency Curves (Ogive)

An Ogive (or Ogive curve) is a graph of a cumulative frequency distribution. It is very useful for finding the median and other partition values (like quartiles).

Finding the Median: The median is the x-value (on the horizontal axis) corresponding to the intersection point of the "Less Than" and "More Than" Ogives.