Data Science Projects
Performed Classification for Breast Cancer Diagnostics Prediction (Kaggle)| Nov‘20
Using seaborn, matplotlib , scikit learn implemented SVM and predicted accuracy 97% to find better fit model.
Performed Regression Analysis for House Sales Price Prediction (Kaggle)| Oct‘20
Using seaborn, matplotlib , scikit learn, statsmodels, implemented linear regression and predicted lower RMSE to find better fit model.
Performed Exploratory Data Analysis on Students Performance in Exams (Kaggle)| Sep ‘19
Using seaborn and matplotlib , implemented EDA on collected marks secured by the students in various subjects.
Predict the onset of diabetes based on diagnostic measures (Kaggle)| Oct ‘18
Implemented models using Random Forest and Decision Tree algorithms and predicted the accuracy of 77.83%.
Predict behaviour to retain customers for Telecom Churn problem (Kaggle)| Sep ‘18
Generated models with Random Forest and Decision Tree algorithms and predicted the accuracy of 79.57%.
Performed Exploratory Data Analysis on NYC Police Parking Tickets |July ‘18
Using Big Data Spark, implemented EDA on collected data for parking tickets of NYC Police Department.
Data Ingestion and Processing using trip data of TLC |June ‘18
Using HIVE, performed detail Trip level data analysis on New York City Taxi & Limousine Commission (TLC) data.
Forecasting the sales and demand for an online Retail company |May ‘18
Deployed Time Series algorithm using Classical Decomposition and ARIMA and identified most profitable segment & market
Classifying the handwritten digits based on the pixel values given as features.| Apr ‘18
Deployed Support Vector Machine algorithms and performed with 95.73% Accuracy
Identifying key factors to reduce the employee attrition rate | Jan ‘18
Employed logistics regression algorithm to predict employee attrition accurately by 74%
List of projects are available in the following tableau public url
Power BI Project
Blogs are related to Power BI project