Approx-SMOTE: fast SMOTE for Big Data on Apache Spark
-
Updated
Apr 27, 2022 - Scala
Approx-SMOTE: fast SMOTE for Big Data on Apache Spark
Implementation of SMOTE - Synthetic Minority Over-sampling Technique in SparkML / MLLib
SMOTE-MR: A distributed Synthetic Minority Oversampling Technique (SMOTE) for Big Data which applies a MapReduce based-approach. SMOTE-MR is categorized as an `approximated/ non exact` solution. Also, there is an `exact` solution called SMOTE-BD written by the author (See: https://github.com/majobasgall/smote-bd)
Utility for common use cases and bug workarounds in Apache Spark 2
Add a description, image, and links to the smote topic page so that developers can more easily learn about it.
To associate your repository with the smote topic, visit your repo's landing page and select "manage topics."