In this project, we designed Machine Learning models such as Linear models, K-Nearest Neighbors (KNN), Support Vector Machine(SVM), XGBoost, Gradient Boosted tree, and Neural Network. Performed Regression and Classification Analysis on a de-identified dataset from a manufacturing company using R programming.
- Performed Exploratory Data Analysis (EDA) on data generated from computer simulations by a manufacturing company.
- Trained, evaluated, and tuned 15 different Machine Learning models of various complexity from a simple linear model to XGboost, Neural Network, SVM, and KNN for both Regression and Classification analysis.
- Evaluated and chose the best model based on their performance using metrics such as RMSE and ROC and identified the important variables that influence the output.
- The files include rmd, HTML, and one ppt files.
- The rmd files are run in R studio to generate HTML files.
- Since the data set is not included, you won't be able to run the rmd files.
- You review the result in the HTML files for each rmd file.