Documentation

Documentation

    ›Getting started

    Overview

    • About Lentiq
    • Lentiq introduction
    • What is a Data Pool?
    • What is a Project?
    • Lentiq Architecture

    Getting started

    • Deploying on GCP
    • Deploying on AWS
    • Creating a data pool
    • Upload data to Lentiq
    • Deploy applications and clusters
    • How to connect a notebook to Spark?
    • Publishing notebooks

    User Guide

    • Data Management
    • Applications
    • Sharing data between data pools
    • Glossary

    How to connect a notebook to Spark?

    In Lentiq applications can be interconnected seamlessly. In this guide we are going to explore how a notebook can be connected with a Spark cluster when it is needed to scale data science tasks.

    Prerequisites

    There are some prerequisites for this:

    1. An up and running data pool
    2. An up and running project
    3. An up and running Jupyter Notebook instance
    4. An up and running Spark cluster

    How to connect a notebook to a Spark cluster

    Once all the prerequisites are in place, follow the next steps.

    1. Connect to the Jupyter Notebook instance. Use the URL and password provided in the interface.

    Connect to jupyter

    1. Create a new notebook or enter the Getting Started Guide notebook. Enter the Getting Started Guide notebook

    2. Add a new cell where you will configure connection to a Spark cluster or identify the Spark connection cell in the Getting Started Guide notebook. Connection to Spark cluster

    3. Now copy the Spark Master connection URL. You can find it in the Application Management view in the Spark cluster application card. Spark Master

    4. Enter the Spark Master connection URL in the newly created cell or in the Spark connection cell in the Getting Started Guide notebook.

    5. Run the cell by hitting Shift+Enter.

    6. Wait for the cell to be run and check the Spark Master Web UI to see that the application is registered. Spark Master UI

    Spark Application

    ← Deploy applications and clustersPublishing notebooks →
    • Prerequisites
    • How to connect a notebook to a Spark cluster
    Copyright © 2019 Lentiq