- Download spark from https://spark.apache.org/downloads.html (say to home/ajay/Desktop/BDA )
- Change to the directory from terminal cd /home/ajay/Desktop/BDA
- Unzip the file
tar -xvf spark-1.4.1-bin-hadoop2.6.tgz
- Change to the directory created ( say you unizpped spark file above and renamed it spark) ajay@Inspiron-3542:~/Desktop/BDA$ cd spark
- Run the command ./bin/pyspark
- To look at local jobs see http://192.168.1.18:4040/jobs/ (or based on what you get from your terminal after running command in step 4)
to do this with R just use .bin/sparkR
Sources- http://stackoverflow.com/questions/30483409/installing-apache-spark-on-ubuntu-14-04