Documentation

Documentation

  • Home
  • Blog
  • API
  • Contact

›Overview

Overview

  • Lentiq introduction
  • Lentiq architecture
  • What is a data pool?
  • What is a project?
  • Migrating from Hadoop

Getting started

  • Deploying applications and processing clusters
  • Connecting to Spark from a notebook
  • Uploading data to Lentiq
  • Creating a data pool
  • Deploying on GCP
  • Deploying on AWS

User Guide

    Managing applications

    • Working with applications
    • Managing compute resources

    Managing data

    • Working with data and metadata
    • Sharing data between data pools
    • Querying data with SQL (DataGrip)
    • Connecting Tableau to Lentiq

    Managing models

    • Working with models
    • Publishing notebooks
    • Training and serializing a model
    • Managing model servers

    Managing workflows

    • Working with workflows
    • Creating a reusable code block from a notebook
    • Creating a docker image based reusable code block
  • Glossary
  • API

Tutorials

  • End-to-end Machine Learning Tutorial

What is a project within a data pool?

A data pool represents a centralized pool of compute resources that can be distributed among multiple teams, departments, business units and use cases.

lentiq_project

In the data pool, a data pool manager can create one or multiple projects that have several characteristics:

  • a project consumes all or a quota of the compute resources of the data pool
  • a project has an object storage bucket associated with it that is visible only to users with access to the project
  • in a project the user can provision applications from the application catalogue
  • a project's quota can be changed at any point in time by the project manager, as long as there are free, unallocated resources on the data pool
  • multiple users can be added either with "manager" or "access" rights to ensure a level of control on the project's compute resources and data
  • connection to the project is secured via a firewall rule that is generally applicable to all provisioned applications
  • a workflow can be defined to run in a particular project, to have access to local data and to consume resources from this dedicated pool of resources.
← What is a data pool?Migrating from Hadoop →
Copyright © 2019 Lentiq