Skip to content
#

mapreduce

Here are 263 public repositories matching this topic...

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Mar 20, 2024
  • Python

《大数据挖掘技术》@复旦 课程项目,试图从搜狗实验室用户查询日志数据(2008)中找出搜索记录中有较高支持度关键词的频繁二项集。在实现层面上,我搭建了一个由五台服务器组成的微型 Hadoop 集群,并且用 Python 实现了 Parallel FP-Growth 算法中的三个 MapReduce 过程。

  • Updated Mar 29, 2021
  • Python

Improve this page

Add a description, image, and links to the mapreduce topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mapreduce topic, visit your repo's landing page and select "manage topics."

Learn more