Healthcare Dataset with #PySpark
Back to some #BigData stuff: article about quick-and-dirty development of prediction algorithm using #Spark engine.
Link: https://towardsdatascience.com/healthcare-dataset-with-spark-6bf48019892b
Back to some #BigData stuff: article about quick-and-dirty development of prediction algorithm using #Spark engine.
Link: https://towardsdatascience.com/healthcare-dataset-with-spark-6bf48019892b
Medium
Healthcare Dataset with Spark
Spark is an open source project from Apache. It is also the most commonly used analytics engine for big data and machine learning.
Data Science by ODS.ai 🦜
Hitchhiker’s guide to Exploratory Data Analysis Exploratory Data Analysis — stage of finding out distribution of the data, volume, number of missing values and all the other characteristics of the available dataset. Part 1: https://towardsdatascience.com/hitchhikers…
3 articles on practical #ExploratoryDA in Spark
These articles might be useful for those, who just starting their Hadoop Path as long with those, who want to learn how to create fast-and-dirty dashboards with #Zeppelin
Links:
http://blog.madhukaraphatak.com/statistical-data-exploration-spark-part-1
http://blog.madhukaraphatak.com/statistical-data-exploration-spark-part-2
http://blog.madhukaraphatak.com/statistical-data-exploration-spark-part-3
#Spark #Hadoop #production #BigData
These articles might be useful for those, who just starting their Hadoop Path as long with those, who want to learn how to create fast-and-dirty dashboards with #Zeppelin
Links:
http://blog.madhukaraphatak.com/statistical-data-exploration-spark-part-1
http://blog.madhukaraphatak.com/statistical-data-exploration-spark-part-2
http://blog.madhukaraphatak.com/statistical-data-exploration-spark-part-3
#Spark #Hadoop #production #BigData
Madhukaraphatak
Statistical Data Exploration using Spark 2.0 - Part 2 : Shape of Data with Histograms
Thoughts on technology, life and everything else.