Databricks Community Edition: Apache Sparks for All (Ep. 1)

Learn to use cloud-based Spark platform on Databricks community edition, upload data to FileStore, and run your first PySpark/Scala/SQL script

Korkrid Kyle Akepanidtaworn
4 min readNov 24, 2018

Disclaimer: This is my personal blog, therefore anything I post, share, and comment don’t reflect the view of my employer. This article is a part of my Databricks series.

Databricks Made Big Data Simple!

Founded in 2013 by the creators of Apache Spark, Databricks helps clients with cloud-based big data processing using Spark. Databricks grew out of the AMPLab project at University of California, Berkeley that was involved in making Apache Spark, a distributed computing framework built atop Scala. It’s a super cool project that has extensive support for Python, R, Scala, and SQL and good place to start for learning Apache Spark, Image Source: https://databricks.com/blog/2015/11/30/building-a-just-in-time-data-warehouse-platform-with-databricks.html

Introduce Databricks Community Edition

Before 2016, if you are keen on using Apache Sparks, there was a free 14-day full feature trial for customers or businesses to get the feeling how the platform works. However, you do not have to let that feeling hang anymore because Databricks launched the community edition designed for “developers, data scientists, data engineers and anyone who want to learn Spark”. Simply, anyone can share and host their notebooks free of charge with Databricks. “For those who are already using IPython…

--

--

Korkrid Kyle Akepanidtaworn
Korkrid Kyle Akepanidtaworn

Written by Korkrid Kyle Akepanidtaworn

AI Specialized CSA @ Microsoft | Enterprise AI, GenAI, LLM, LLamaIndex, ML | GenAITechLab Fellow, MScFE at WorldQuant, MSDS at CU Boulder

No responses yet