- Undergraduate
- Prospective Students
- Current Students
- Research Awards & Scholarships
- Undergraduate Research Presentation Award
- CSC Silver Medal Award
- E. J. Wells Chemistry Book Award
- Melanie O'Neill Chemistry Undergraduate Award
- SCI Canada Student Merit Award
- Tony Parsad Award in Chemistry
- Chemistry Undergraduate Scholarship
- TransCanada Pipelines Research Scholarship
- Evelyn and Leigh Palmer Scholarship
- Undergraduate Research
- Graduate
- Research
- Department
- News & Events
- Contact Us
- EDI
Dr. David Wishart
University of Alberta
Chemistry, Big Data, and Machine Learning
Wednesday, September 22, 2021
Room C9002 @ 3:30 p.m.
Host: Dr. Roger Linington
Abstract
Computing has been an integral part of chemistry for more than 60 years. Indeed, the use of computers to draw structures, model molecular dynamics or calculate molecular orbitals is now ubiquitous. More recently, computers have been impacting chemistry in a different way. In particular, chemists are now starting to make greater use of "big" data (i.e. chemistry databases), the internet and techniques such as artificial intelligence and machine learning to make important advances. In this presentation I will describe some of the work that my lab has been doing in the field of big data, machine learning, and chemistry. I will first describe a number of the open-access, web-enabled databases that we have been developing to "house" chemical data. These include DrugBank, the Human Metabolome Database (HMDB), and the Natural Products Magnetic Resonance Database (NP-MRD). I will briefly describe the purpose of each of these databases and what kind of data they house. Next, I will describe how we have developed software to help classify and describe the data in these chemical databases so that they are more machine (i.e. computer) readable and also more understandable by regular people. Then I will describe how we (along with others) have been combining these datasets with machine learning techniques to develop a variety of very fast and accurate tools for predicting chemical properties, chemical reactions, chemical structures, and various molecular spectral features. Finally, I will show how some of these databases and software tools are having significant impacts in the fields of medicinal chemistry, drug discovery, biochemistry, and organic chemistry.