Monday, August 15, 2016

Install iPython (Jupyter) Notebook on Amazon EMR


  1. Use the bootstrap script on this link to install iPython Notebook: https://github.com/awslabs/emr-bootstrap-actions/tree/master/ipython-notebook
  2. Although the iPython server is running, it's not integrated with Spark. Follow the instructions according to this blog post: https://districtdatalabs.silvrback.com/getting-started-with-spark-in-python
  3. Create the initial SparkContext and SQL context as follows:

from pyspark import  SparkContext
sc = SparkContext( 'local', 'pyspark')
from pyspark.sql import SQLContext
sqlContext = SQLContext(sc)

2 comments:

  1. as many as i visit this blog i can find some fantastic information . Thanks for sharing knowledge . best wishes to be guided
    free ads
    advertise for free
    real estate

    ReplyDelete