About the Project

As a senior student majoring in Data Science at Brigham Young University-Idaho, it is expected that I show and apply the skills I’ve learned in a capstone project. Because I’ve not yet been able to work with business datasets per se (and rather my projects have been with other datasets such as house pricing, weather temperature, heights, etc), I want to use this senior project as a way to show what I can do for a business. This senior project will encompass three of the most common and helpful techniques of data science: exploratory data analysis (commonly known as EDA), creating a regression model, and creating a classification model.

The data source can be found online (https://github.com/amitbasuri/Adventure-Works-Cycle or on Kaggle).

The three sections will incorporate parts of Pythons because I feel I lack the knowledge of how to apply Python in data science problems, and it’s a language that is highly demanded for data science jobs.

About the Company

In 1998, the Adventure Works Cycles company collected a large volume of data about their existing customers, including demographic features and information about purchases they have made. The company is particularly interested in analyzing customer data to determine any apparent relationships between demographic features known about the customers and the likelihood of a customer purchasing a bike. Additionally, the analysis should endeavor to determine whether a customer’s average monthly spend with the company can be predicted from known customer characteristics.