Azure Data Lake Storage Gen1

Azure Data Lake Storage Gen1 Article 01/26/2022 8 minutes to read 3 contributors

Note Microsoft has released its next-generation data lake store, Azure Data Lake Storage Gen2 . Azure Data Lake Storage Gen1 (formerly Azure Data Lake Store, also known as ADLS) is an enterprise-wide hyper-scale repository for big data analytic workloads. Azure Data Lake Storage Gen1 enables you to capture data of any size, type, and ingestion speed in a single place for operational and exploratory analytics. Azure Data Lake Storage Gen1 is specifically designed to enable analytics on the stored data and is tuned for performance for data analytics scenarios. Note Azure Databricks also supports the following Azure data sources: Azure Blob storage , Azure Cosmos DB , and Azure Synapse Analytics . There are three ways of accessing Azure Data Lake Storage Gen1: Pass your Azure Active Directory credentials, also known as credential passthrough . Mount an Azure Data Lake Storage Gen1 filesystem to DBFS using a service principal and OAuth 2.0. Use a service principal directly. Access automatically with your Azure Active Directory credentials You can authenticate automatically to Azure Data Lake Storage Gen1 from Azure Databricks clusters using the same Azure Active Directory (Azure AD) identity that you use to log into Azure Databricks. When you enable your cluster for Azure AD credential passthrough, commands that you run on that cluster will be able to read and write your data in Azure Data Lake Storage Gen1 without requiring you to configure service principal credentials for access to storage. For complete setup and usage instructions, see Access Azure Data Lake Storage using Azure Active Directory credential passthrough .


