pyspark

Use spark to analyze user churn behaviour data from music app company as they move from paid and free tier services or cancel their subscription all together. The dataset contains two months of user activity logs.

python pyspark churn-prediction

Updated Jan 8, 2020
HTML

hcvazquez / handling-time-data-engineering

Star

The data engeneering process for the handling time problem

python data-science machine-learning scikit-learn jupyter-notebook pyspark data-engineering

Updated Jun 25, 2019

N3ll / PySpark-logistic-regression

Star

bigdata pyspark logistic-regression

Updated Aug 2, 2019
Jupyter Notebook

gtatiya / Twitter-Hashtags-Polarity

Star

This project fetches live Twitter data in a stream and computes polarity of tweets with hashtags

sentiment-analysis pyspark tweepy

Updated Dec 28, 2019
Python

sanogotech / pyspark-examples

Star

Pyspark RDD, DataFrame and Dataset Examples in Python language

python spark pyspark join pyspark-notebook

Updated Dec 1, 2022
Python

Improve this page

Add a description, image, and links to the pyspark topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pyspark topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pyspark

Here are 3,387 public repositories matching this topic...

basel-ay / Hands-on-Apache-Spark

zuliani99 / All-Pairs-Docs-Similarity

phricardorj / pyspark-study

JonathanPollyn / Spark

data-miner00 / spark

furkancets / PrescreiberPipelineSpark

simonediluna / Distributed-Data-Analysis-and-Mining

mdh266 / Spark-Practice

lalet / Big-Data-Spark

SmartDataInnovationLab / dirhash

edvardvb / tdt4305-phase2

DouglasFletcher / sparkpython

nagilla-venkatesh / Data-Wrangling-with-MongoDB

jasuncion2 / Learning-Jupyter

HerveMignot / CloudProvisioning

devindatt / Spark-Churn-Analysis

hcvazquez / handling-time-data-engineering

N3ll / PySpark-logistic-regression

gtatiya / Twitter-Hashtags-Polarity

sanogotech / pyspark-examples

Improve this page

Add this topic to your repo