The raw data consisted of 690,000 randomly collected geotagged tweets across the City of Vancouver generated by TwitterAPI. There is no organization or any pattern in this giant dataset. The aim for us was to use these tweets to explore the eating habits and potential obesity risks lying in the people of the City of Vancouver. LEXICON Tool Used: Wordnet, Twitter, Google Knowledge Graph During this large process of analyzing Big Data, the first and most important step in this project was to create a keyword searching lexicon. With this keyword lexicon, we could then extract or “massage” the raw data to a refined categorized dataset. Synonyms were our first consideration. We listed a range of words associated with fat, emotion and skinny and used the online tool “WordNet” to look for any relative terms. WordNet is a large lexical database of English language that groups words together based on their meanings into sets of cognitive synonyms (WordNet, 2015). It considers both formal and informal terms which serve to be particularly effective when analyzing Twitter data. These sets of synonyms are called synsets and are able to link other synsets by means of a small number of “conceptual relations” (WordNet, 2015) thereby producing an even larger relevant lexicon in a short period of time. Figure 1. WordNet search example In addition to the emotion and body image based synsets, the next step was to look for food-related words. We used the Google Knowledge Graph to search for unhealthy foods (e.g. Coke, ice-cream, food with high sugar), healthy foods (e.g. kale, cabbage, types of vegetables) and names of fast food restaurants (e.g. KFC, MacDonald, etc.). This was effective in starting the lexicon with key words which were then searched using Twitter to see if tweets containing those words were indicative of consumption patterns. Any word that was not qualified was eliminated from the database. More word terms could also be found during this validation process that were not captured using the previous methods. Figure 2. Search keyword lexicon created by Excel (Part 1) Figure 3. Search keyword lexicon created by Excel (Part 2) WORD EXTRACTION Tool Used: Excel After creating a complete lexicon database, word extraction was the next big stage. We divided our data into three categories: