Unit 4: Introduction to Data Processing and Data Analysis

Table of Contents

1. Introduction to Data Processing

Data processing is the crucial "middle step" between collecting data (Unit 3) and analyzing it. It involves converting raw, messy data into a clean, organized, and machine-readable format.

Key Steps in Data Processing:

  1. Editing:
    • Checking the raw data (questionnaires, schedules) for errors, inconsistencies, or missing answers.
    • Example: Finding a respondent who ticked both "Yes" and "No," or who listed their age as "150," and deciding how to handle it (e.g., discard, or mark as "missing").
  2. Coding:
    • The process of assigning numerical codes to non-numerical answers. This is essential for statistical analysis.
    • Example:
      • Question: "What is your gender?" (Male = 1, Female = 2)
      • Question: "How do you feel about the economy?" (Very Good = 1, Good = 2, Bad = 3, Very Bad = 4, Don't Know = 99)
  3. Tabulation:
    • Organizing the coded data into tables to see the patterns.
    • Simple Tabulation: A simple frequency count (e.g., "How many people said 'Good'?").
    • Cross-Tabulation: A more complex table that compares two variables (e.g., "What was the opinion of *men* vs. *women* on the economy?").

2. Introduction to Data Analysis

Data analysis is the process of *interpreting* the processed data to find patterns, answer research questions, and discover insights.

There are two broad types of analysis: **Quantitative** (deals with numbers) and **Qualitative** (deals with words, meanings, and interpretations).

3. Content Analysis

Definition: Content analysis is a research method used to analyze the content of communications in a systematic and quantitative way. It turns qualitative text data into quantitative, countable data.

4. Discourse Analysis

Definition: Discourse analysis is a research method used to study language *beyond* the level of the sentence. It is a qualitative method that analyzes *how* language is used in social contexts to create meaning and power.

5. Comparison: Content vs. Discourse Analysis

This is the key distinction for this unit.

The simplest way to think about it: If you analyze a news article...

Feature Content Analysis Discourse Analysis
Method Quantitative (counting) Qualitative (interpreting)
Focus Frequency of words/themes (Manifest content) Underlying meaning, assumptions, power (Latent content)
Research Question "How many?" (e.g., How many women are in ads?) "How?" and "Why?" (e.g., *How* are women portrayed in ads?)
Goal To be objective and systematic. To be interpretive and critical.

6. Exam Corner: Key Distinctions

Common Exam Questions:

How to Answer the "Difference" Question:

Use the table above. The key is **Quantitative vs. Qualitative**. Content analysis *counts*, while Discourse analysis *interprets*.

Example to use:

"If analyzing a political manifesto, content analysis would create a table counting how many times 'farmers' or 'industry' are mentioned. Discourse analysis would study *how* the manifesto talks *about* farmers (e.g., as 'hard-working patriots' or as a 'problem to be solved') and what political purpose that language serves."