About | Features | Technologies | Requirements | Starting | Author
This repository contains a collection of Spark projects and exercises aimed at refreshing your knowledge of Apache Spark. The projects are designed to cover various use cases and scenarios, allowing you to apply your Spark skills to real-world problems.
To get started, simply clone this repository to your local machine and follow the instructions in the README files for each project. Each project directory contains a separate README file with detailed instructions on how to run the project and what itβs designed to accomplish.
βοΈ Programming in Python;
βοΈ Programming in Scala;
βοΈ Case-Study & Databricks Implementations;
The following tools were used in this project:
Before starting :checkered_flag:, you need to have Git, Spark, Scala and Python installed.
# Clone this project
$ git clone https://github.com/Vivek-Murali/spark-refresher-projects
# Access
$ cd spark-refresher-projects
# Install dependencies for python
$ cd Python
$ pip install -r requirements.txt
# Run the project
Β Β Made with β€οΈ by Vivek Murali
Β