Portfolio

AWS - Reddit Comment Score Prediction

Reddit Logo ROC for Logistic Regression

Methodologies & Tools

Programming Language | PySpark
Methodologies | Big Data, AWS(EMR), SQL, pysparML

Introduction
The Reddit Comments archive contains various information about the Reddit posts, the data collected includes the files/comments information for Oct/Nov/Dec 2018 and Jan 2019. For this mini project, I will mainly focus on predicting how different factors can affect score on public posts.

Live Preview or GitHub Repo

Get in Touch!

HAVE A GREAT OPPORTUNITY FOR ME? |
Feel free to drop me a line

Contact me!