Henry's EdTech: Day 34: Exploratory Factor Analysis (EFA) in SPSS

Day 34: Exploratory Factor Analysis (EFA) in SPSS – Identifying Underlying Constructs

Welcome to Day 34 of your 50-day SPSS learning journey! Today, we’ll explore Exploratory Factor Analysis (EFA), a technique used to identify latent constructs (hidden variables) that explain patterns in your data. EFA is commonly used in survey research, psychology, social sciences, and marketing to group related items into meaningful factors.

What is Exploratory Factor Analysis (EFA)?

Exploratory Factor Analysis (EFA) is a statistical method used to:

Reduce many observed variables into a smaller number of underlying factors.
Identify relationships among variables and detect latent constructs.
Determine which variables group together, forming meaningful dimensions.

For example:

In a customer satisfaction survey, EFA can group items into Service Quality, Product Value, and Customer Support.
In psychological research, EFA can reveal factors like Extraversion, Conscientiousness, and Neuroticism from personality traits.

When to Use EFA?

Use Exploratory Factor Analysis when:

You suspect underlying factors exist but don’t know their exact structure.
You need to reduce the number of variables for further analysis.
Your dataset contains correlated variables measuring related concepts.

Key Assumptions of EFA

Adequate Sample Size: At least 5-10 cases per variable for reliable results.
Multicollinearity: Variables should be correlated but not perfectly correlated.
Kaiser-Meyer-Olkin (KMO) Test: Measures sampling adequacy (values above 0.6 are acceptable).
Bartlett’s Test of Sphericity: Tests whether the correlation matrix is suitable for factor analysis (p < 0.05 indicates suitability).

How to Perform Exploratory Factor Analysis in SPSS

Step 1: Open Your Dataset

For this example, use the following survey data measuring employee job satisfaction:

ID	Work_Env	Salary	Growth	Benefits	Workload	Job_Security	Manager_Support
1	8	7	6	8	5	7	9
2	6	6	5	7	4	6	8
3	9	8	7	9	6	8	9
4	7	6	5	7	5	6	8
5	8	7	6	8	5	7	9

The goal: Identify key job satisfaction factors (e.g., Work Conditions, Compensation, Career Growth).

Step 2: Access the Factor Analysis Tool

Go to Analyze > Dimension Reduction > Factor.
A dialog box will appear.

Step 3: Define Variables

Move all seven job satisfaction variables (Work_Env, Salary, Growth, Benefits, Workload, Job_Security, Manager_Support) to the Variables box.
Click Descriptives, then check:
- KMO and Bartlett’s Test to assess suitability.
- Anti-Image Correlation to check multicollinearity.

Step 4: Select Extraction Method

Click Extraction:
- Choose Principal Axis Factoring (better for non-normally distributed data) or Principal Components (default).
- Set Eigenvalues > 1 to retain meaningful factors.
- Check Scree Plot to visualize the number of factors.
Click Continue.

Step 5: Select Rotation Method

Click Rotation:
- Choose Varimax (orthogonal rotation for independent factors) or Oblimin (correlated factors).
Click Continue, then OK.

Interpreting the Output

1. KMO and Bartlett’s Test

KMO > 0.6: Indicates sufficient sample adequacy for factor analysis.
Bartlett’s Test p < 0.05: Confirms correlations among variables.

2. Total Variance Explained Table

Eigenvalues > 1: These components explain significant variance.
Example: If two factors explain 75% of the variance, these two dimensions capture most of the information.

3. Scree Plot

Identifies the ideal number of factors (look for the "elbow" where eigenvalues level off).

4. Rotated Factor Matrix

Shows which variables belong to which factors (loadings above 0.4 are considered strong).
Example output:

Variable	Factor 1 (Compensation)	Factor 2 (Work Conditions)
Salary	0.85	0.20
Benefits	0.82	0.25
Job_Security	0.78	0.30
Work_Env	0.10	0.88
Manager_Support	0.15	0.85
Workload	0.30	0.75
Growth	0.50	0.60

Interpretation:

Factor 1 (Compensation): High loadings for Salary, Benefits, Job_Security.
Factor 2 (Work Conditions): High loadings for Work_Env, Manager_Support, Workload.

This suggests job satisfaction has two primary factors: Compensation and Work Environment.

Practice Example: Perform EFA

Use the following dataset of customer experience survey responses:

ID	Service_Quality	Response_Time	Ease_of_Use	Price_Fairness	Reliability	Trust
1	8	7	6	9	8	7
2	6	6	5	7	7	6
3	9	8	7	9	9	8
4	7	6	5	8	7	7
5	8	7	6	9	8	7

Perform EFA to identify key factors in customer experience.
Use KMO and Bartlett’s Test to check suitability.
Interpret the Rotated Factor Matrix to define meaningful customer experience dimensions.

Common Mistakes to Avoid

Using Too Many Factors: Retain factors with eigenvalues >1 or use the scree plot to determine the ideal number.
Ignoring Rotation: Varimax improves interpretation by making factor loadings clearer.
Forgetting KMO and Bartlett’s Test: Ensure your data is suitable for factor analysis before proceeding.

Key Takeaways

Exploratory Factor Analysis (EFA) identifies hidden constructs that explain variable relationships.
Scree plots and eigenvalues help determine the optimal number of factors.
Rotated Factor Matrix reveals which variables group together into meaningful factors.

What’s Next?

In Day 35 of your 50-day SPSS learning journey, we’ll explore Confirmatory Factor Analysis (CFA) in SPSS. You’ll learn how to test whether a predefined factor structure fits your data. Stay tuned for more advanced statistical techniques!