Portfolio

AWS - SparkML Tip or Not

NYC Taxi

Methodologies & Tools

Programming Language | PySpark
Methodologies | Big Data, AWS(EMR), SQL, SparkML

Introduction
merge-data.ipynb: in this notebook a merged and cleansed datasetand is created and stored it in your S3 bucket.

model-data.ipynb: in this notebook the merged dataset is trained with models to predict wether a trip will receive a tip or not.

Live Preview or GitHub Repo

Get in Touch!

HAVE A GREAT OPPORTUNITY FOR ME? |
Feel free to drop me a line

Contact me!