Spring 2023 - MBB 110 D100
Data Analysis for Molecular Biology and Biochemistry (3)
Class Number: 5961
Delivery Method: In Person
Introductory data analysis focusing on molecular biology data sets and examples and including basic programming skills using Python and basic statistics skills using R. Students with credit for MBB 243 may not take this course for further credit. CMPT 120 will be accepted in lieu of MBB 110.
The purpose of this introductory data analysis course is to teach students in molecular biology or any students who will analyze molecular data, basic knowledge of molecular biology data types, data analysis methods including basic programming skills using Python, and basic statistics skills using R.
Lecture 1 Molecular biology data: flavours and common file formats
Lecture 2 Molecular biology data: genomes and annotation
Lecture 3 Fundamentals of R and Python
Lecture 4 Getting/parsing/manipulating sequence data using shell, R and Python
Lecture 5 Regular expressions and patterns
Lecture 6 Quantitative DNA/RNA sequence analysis
Lecture 7 Genome-scale data analysis
Lecture 8 Plotting with ggplot2
Lecture 9 Techniques for tidying, merging and manipulating tabular data
Lecture 10 Basic modeling
Lecture 11 Visualization of genomic annotations
Lecture 12 Advanced data visualization methods
Lab 1 Shell competency 1: manipulating and searching plain text formats
Lab 2 Shell competency 2: installing, managing and running command-line utilities
Lab 3 Basic data types and operations in Python and R
Lab 4 Conditionals and control flow in Python and R
Lab 5 Application of regular expressions in Python and R
Lab 6 Quantitative analysis of DNA and protein sequences
Lab 7 Tidy data, data frames and data tables
Lab 8 Plotting genomic and other flavours of data
Lab 9 Cleaning and combining data sets
Lab 10 Creating and visualizing models
Lab 11 Fundamentals of Bioconductor
Lab 12 Selected topics
- In class lab tasks: In each lab, there is a list of tasks that should be accomplished in class. Results are submitted by the end of each lab. 20%
- Lab assignments: Short assignments will be handed out in lab sessions and will be due at the start of your lab one week later, unless indicated otherwise. There is a 10% per day late penalty for assignments received after the due date time. 35%
- Midterm and final exams - a mixture of multiple choice, short answer and written questions. (10% for midterm exam and 25% for final exam) 35%
- Participation 10%
Students with credit for CMPT 102, 120, 125, 126, 128 or 130 may not take this course for further credit.
Python for Biologists. Martin Jones. 2013. CreateSpace Independent Publishing Platform.
REQUIRED READING NOTES:
Your personalized Course Material list, including digital and physical textbooks, are available through the SFU Bookstore website by simply entering your Computing ID at: shop.sfu.ca/course-materials/my-personalized-course-materials.
Department Undergraduate Notes:
- For help with writing, learning and study strategies please contact the Student Learning Commons at
- Students requiring accommodations as a result of a disability, must contact the Centre for Accessible Learning (778-782-3112 or e-mail: firstname.lastname@example.org)
ACADEMIC INTEGRITY: YOUR WORK, YOUR SUCCESS
SFU’s Academic Integrity website http://www.sfu.ca/students/academicintegrity.html is filled with information on what is meant by academic dishonesty, where you can find resources to help with your studies and the consequences of cheating. Check out the site for more information and videos that help explain the issues in plain English.
Each student is responsible for his or her conduct as it affects the university community. Academic dishonesty, in whatever form, is ultimately destructive of the values of the university. Furthermore, it is unfair and discouraging to the majority of students who pursue their studies honestly. Scholarly integrity is required of all members of the university. http://www.sfu.ca/policies/gazette/student/s10-01.html