The Essential Guide to Stata

The comprehensive guide to Stata! Continuously updated.

Make sure to check out my twitter feed for promo codes and other updates (easystats3).

What you’ll learn

  • An essential introduction to Stata.
  • Data manipulation in Stata.
  • Data visualisation in Stata.
  • Data analysis in Stata.
  • Regression modelling in Stata.
  • Simulation in Stata.
  • Count data modelling.
  • Categorical data modelling.
  • Survival analysis.
  • Panel Data Analysis.
  • Epidemiology.
  • Instrumental Variables.
  • Power Analysis.
  • Difference-in-Differences.

Course Content

  • Introduction –> 1 lecture • 2min.
  • Getting Started –> 6 lectures • 39min.
  • Exploring Data –> 6 lectures • 39min.
  • Manipulating Data –> 14 lectures • 1hr 32min.
  • Visualising Data –> 15 lectures • 1hr 52min.
  • Testing Means, Correlations and ANOVA –> 4 lectures • 26min.
  • Linear Regression –> 11 lectures • 1hr 50min.
  • Categorical Choice Models –> 3 lectures • 31min.
  • Fractional/Proportional Variable Models –> 1 lecture • 15min.
  • Random Numbers and Simulation –> 4 lectures • 24min.
  • Count Data Models –> 5 lectures • 42min.
  • Survival Analysis –> 7 lectures • 52min.
  • Panel Data Analysis –> 6 lectures • 56min.
  • Difference-in-Differences Analysis –> 3 lectures • 46min.
  • Instrumental Variable Regression –> 4 lectures • 59min.
  • Epidemiological Tables –> 5 lectures • 41min.
  • Power Analysis –> 3 lectures • 31min.
  • Basic Matrix Operations –> 4 lectures • 32min.
  • The Big Stata Practice Test –> 0 lectures • 0min.

The Essential Guide to Stata


  • There are no requirements.

Make sure to check out my twitter feed for promo codes and other updates (easystats3).

The Essential Guide to Data Analytics with Stata

Learning and applying new statistical techniques can be daunting experience.

This is especially true once one engages with “real life” data sets that do not allow for easy “click-and-go” analysis, but require a deeper level of understanding of programme coding, data manipulation, output interpretation, output formatting and selecting the right kind of analytical methodology.

In this course you will receive a comprehensive introduction to Stata and its various uses in modern data analysis. You will learn to understand the many options that Stata gives you in manipulating, exploring, visualizing and modelling complex types of data. By the end of the course you will feel confident in your ability to engage with Stata and handle complex data analytics. The focus of each session will consistently be on creating a “good practice” and emphasising the practical application – and interpretation – of commonly used statistical techniques without resorting to deep statistical theory or equations.

This course will focus on providing an overview of data analytics using Stata.

No prior engagement with is Stata needed. Some prior statistics knowledge will help but is not necessary.

The course is aimed at anyone interested in data analytics using Stata.

Like for other professional statistical packages the course focuses on the proper application – and interpretation – of code.

Some basic quantitative/statistical knowledge will be required; this is not an introduction to statistics course but rather the application and interpretation of such using Stata.

Topics covered include:

  1. Getting started with Stata
  2. Viewing and exploring data
  3. Manipulating data
  4. Visualising data
  5. Correlation and ANOVA
  6. Regression including diagnostics (Ordinary Least Squares)
  7. Regression model building
  8. Hypothesis testing
  9. Binary outcome models (Logit and Probit)
  10. Fractional response models (Fractional Logit and Beta Regression)
  11. Categorical choice models (Ordered Logit and Multinomial Logit)
  12. Simulation techniques (Random Numbers and Simulation)
  13. Count data models (Poisson and Negative Binomial Regression)
  14. Survival data analysis (Parametric, Cox-Proportional Hazard and Parametric Survival Regression)
  15. Panel data analysis (Long Form Data, Lags and Leads, Random and Fixed Effects, Hausman Test and Non-Linear Panel Regression)
  16. Difference-in-differences analysis (Difference-in-Difference and Parallel Trends)
  17. Instrumental variable regression (Endogenous Variables, Sample Selection, Non-Linear Endogenous Models)
  18. Epidemiological tables (Cohort Studies, Case-Control Studies and Matched Case-Control Studies)
  19. Power analysis (Sample Size, Power Size and Effect Size)
  20. Matrix operations (Matrix operators, Matrix functions, Matrix subscripting)
Get Tutorial