Implemented full fledged end-to-end Big Data Pipeline which involves Apache Sqoop, HDFS, Apache Spark,
MySQL, Jupyter Notebook as base tools. Complete project is Implemented in PySpark for performance efficiency
& scalability of the project in the real world! Top products for a customer, & also Top customers for a
customer has been identified which can be used to boost the economy of a company!
Predicting Avocado Prices in Future!
Facebook's Prophet is used to predict the Avocado price in future for Las Vegas particularly in this example,
but this model can be used to predict the price for the same in multiple countries as they are present in the dataset.
It is a time series based project which can be used for predicting future price for any product!
Classification of cancer cells
Cancer cells are classified into `Belign` or `Malignant` category based on their various
features using Support Vector Machine (Supervised Machine Learning)!