Spring 2025 - ECON 233 D100
Introduction to Economic Data and Statistics (3)
Class Number: 1823
Delivery Method: In Person
Overview
-
Course Times + Location:
Jan 6 – Apr 9, 2025: Fri, 12:30–2:20 p.m.
Burnaby
-
Instructor:
Brian Krauth
bkrauth@sfu.ca
-
Prerequisites:
MATH 150, MATH 151, MATH 154, or MATH 157, with a minimum grade of C-; 15 units. MATH 150, MATH 151, MATH 154, or MATH 157 may be taken concurrently with ECON 233.
Description
CALENDAR DESCRIPTION:
Introduces statistical methods, concepts and their application to economic data using both spreadsheets (e.g., Excel) and a specialized statistical programming language such as R. Students who have taken ECON 333 first may not then take this course for further credit. STAT 270 or BUS 232 will be accepted in lieu of this course.
COURSE DETAILS:
This course introduces the basics of probability and statistics for economics students. Like BUS 232 or STAT 270, it satisfies the lower-division statistics requirements for the economics major. It differs from these courses in two important ways:
- We will emphasize the use of economic data to answer economic questions.
- We will learn to use both Excel (the standard analysis tool in the business world) and R (a more powerful statistical analysis package that is increasingly used in the business and academic world, and is the main tool used in ECON 333).
Extensive support will be available to help students complete the computer assignments and build their computer skills, so do not shy away from this course just because you lack confidence in your current skills.
Topics:
- Data cleaning and management with Excel
The first (and often the longest) step in any data analysis project is to obtain the necessary data and “clean” it by correcting errors or clarifying ambiguities in the variables, adding new variables, and linking cases (for example, parents with children). We will learn the basic features of Excel and will use Excel to clean an economic data set. In the process, we will also learn some principles and practices for data management and reproducible data analysis. - Probability and random events
To use data to learn about the world, our analysis needs to be statistical in nature: it needs to be situated in a mathematical model of how the data was generated. To do that, we need to learn some theory. We will start by learning the language of probability and tools for thinking rigorously about random events. - Random variables
Roughly speaking, a random variable is a number describing a random outcome: the unemployment rate last month, the score in a baseball game, or a number in our data set. We will apply the concepts of probability to develop terminology and tools for working with random variables. - Basic data analysis with Excel
We will take a break from theory to learn some more applied skills: using Excel to calculate basic statistics and create simple graphs. - Statistics
A statistic is any number calculated from data. If we think of our data set as a set of random variables, then any statistic based on that data is also a random variable. We will learn how to apply the tools of probability, random events, and random variables to statistics that are calculated from data. - Statistical models, parameters, and estimation
We typically use statistics to make guesses (estimate) about some unknown quantity in the real world (a parameter). For example, the slope of a supply or demand curve, the effect of a treatment on a medical condition, or the effect of a policy on an economic outcome. We will use the theory of probability and statistics to think about how our estimate relates to the unknown true value of the parameter of interest, and to construct the most accurate estimate available for a given data set. - Statistical inference
Statistical estimates are always imprecise. Statistical inference is a set of ideas and procedures that we use to account for this imprecision, and to make clear statements about what can and cannot be concluded from the available data. - Advanced Excel
We have learned to use Excel for basic data cleaning and analysis, but Excel also has many tools and techniques that are important in more complex environments. We will learn how to construct lookup tables, complex formulas, and pivot tables, as well as some tools for protecting and managing large or important data sets. - Introduction to R, R Markdown, and the Tidyverse
R is a powerful statistical package that can handle complex data analysis tasks that are impossible or difficult in Excel. We will learn how to write simple R programs and create data-based documents using R Markdown. As in most courses these days that use R, we will use an add-on package for R called the Tidyverse. The Tidyverse simplifies and modernizes the R programming language and its capabilities. - Data management and analysis with R and ggplot
Having learned the basics of R, we next use it to clean data, calculate statistics, and create graphs. The package we will use to create graphs is ggplot, which produces dramatically better-looking and more flexible graphs than Excel. - Prediction and visualization
Sophisticated technologies like machine learning, automatic captioning and tagging, and large language models (e.g. GPT/ChatGPT) use very large data sets to form statistical models and make predictions. While these specific techniques are out of scope for this introductory course, the principles of using data to construct prediction models are not. We will finish the semester by learning some introductory concepts and tools for prediction models, including linear regression which is used extensively in ECON 333.
Grading
NOTES:
- Weekly Canvas quizzes (10%)
These usually have 5 questions and can be completed in a few minutes. Each quiz has a “second chance” version you can take to increase your grade. - Weekly computer assignments (20%).
These are designed to be (mostly) completed in tutorial. - Two midterms (20% each)
- Final exam (30%)
Bonus points can be earned on quizzes and assignments by consistently attending lecture.
Materials
REQUIRED READING:
Brian Krauth, Introductory Statistics for Economics. Free e-book.
REQUIRED READING NOTES:
Your personalized Course Material list, including digital and physical textbooks, are available through the SFU Bookstore website by simply entering your Computing ID at: shop.sfu.ca/course-materials/my-personalized-course-materials.
Department Undergraduate Notes:
Please note that, as per Policy T20.01, the course requirements (and grading scheme) outlined here are subject to change up until the end of the first week of classes.
Final exam schedules will be released during the second month of classes. If your course has a final exam, please ensure that you are available during the entire final exam period until you receive confirmation of your exam dates.
Students requiring accommodations as a result of a disability must contact the Centre for Accessible Learning (CAL) at 778-782-3112 or caladmin@sfu.ca.***NO TUTORIALS DURING THE FIRST WEEK OF CLASSES***
Registrar Notes:
ACADEMIC INTEGRITY: YOUR WORK, YOUR SUCCESS
SFU’s Academic Integrity website http://www.sfu.ca/students/academicintegrity.html is filled with information on what is meant by academic dishonesty, where you can find resources to help with your studies and the consequences of cheating. Check out the site for more information and videos that help explain the issues in plain English.
Each student is responsible for his or her conduct as it affects the university community. Academic dishonesty, in whatever form, is ultimately destructive of the values of the university. Furthermore, it is unfair and discouraging to the majority of students who pursue their studies honestly. Scholarly integrity is required of all members of the university. http://www.sfu.ca/policies/gazette/student/s10-01.html
RELIGIOUS ACCOMMODATION
Students with a faith background who may need accommodations during the term are encouraged to assess their needs as soon as possible and review the Multifaith religious accommodations website. The page outlines ways they begin working toward an accommodation and ensure solutions can be reached in a timely fashion.