Submit a ticket My tickets
Welcome
Login  Sign up

Azure Data Lake Services Gen1 (ADLS Gen1) Connector

⚠️ The DataGalaxy ADLS Gen1 connector will be removed from the list of supported connectors end of June 2026. It won't be available in your platform anymore.
If you're already using the ADLS Gen1 connector plugin of the Desktop connector, you can continue using it at your own risk. But it will not be updated anymore (including regarding vulnerabilities) and the support team won't be able to help you if you encounter issues.

This article describes how to use DataGalaxy Azure Data Lake Services Gen1 (ADLS Gen1) Connector 

This connector is available in the following modes:

Desktop mode ✅SaaS Online mode ❌

Connector scope

The extracted objects and their correspondences are detailed in the following table: 

ADLSGen1 Object
DataGalaxy Object
Comments
DirectoryDirectory (Container)
FileFichier (Structure)
FieldFieldThe definition of the columns is imported if the processed file is a CSV file (separator ";") 

Step 1 : Installation

  • Download DataGalaxy connector from the portal (see here)
  • Extract the connector archive in the directory of your choice
  • Download the Azure Data Lake Gen1 plug-in from the portal and copy it into the /lib directory of the connector

Step 2 : Run Azure Data Lake Services Gen1 connector

  • After starting the connector, access to Dictionary connectors: 
  • If it has been correctly installed, ALDS Gen1 plug-in appear in the list
  • The following information is requested:  
ParameterMandatoryDescription
Cluster NameYes
Tenant IdYesAzure tenant identifier 
Client IdYesAzure Client Service Account ID 
Clientsecret YesClient Secret 
PathNoRoot path to navigate
Quik modeNoThis option is to be preferred for large datalakes. In this mode, the connector does not compute a summary before import, and the columns of the CSV files present in the datalake are not imported into DataGalaxy 
Only include directories (filter)NoThe connector does not list the files contained in the directories.
Depth (filter)NoLimits the depth of the directories to be browsed to N levels of subdirectories. All subdirectories are browsed when this parameter is equal to 0.
PatternsNoMasks allow you to define strategies for grouping and filtering folders and files according to naming patterns. Example: /datasource/{YYYYMMDD}/file_{YYYYMM}_{zz}.csv

Masks must be absolute paths from the root and each character is important, so it may be necessary to define multiple masks to cover all your cases.

More information about this setting is available when running the connector.

Once these information filled, you can test the connection and start the import.

Technical information

The service account used must be configured to have the following rights:

  • Authorized APIs (to be defined when the application is registered): Azure Data Lake 
  • Role assignment (to be defined at the storage account level): Drive

You can optionally set additional restrictions using ACL' s to limit the resources that the service account will have access to.

The ADLS Gen1 connector uses the Azure Data Lake Store Gen1 REST APIs (https://docs.microsoft.com/fr-fr/rest/api/datalakestore/)

Releases

DatePlugin
Version
DataGalaxy
release
Desktop Connector
version (minimum)
Description
23/08/20244.0.1v3.69.05.2.3Updated the logger to show more information when using verbose mode  
06/08/20244.0.0v3.65.0
5.0.5Migrated from java 11 to java 17 + CVE fixes

Did you find it helpful? Yes No

Send feedback
Sorry we couldn't be helpful. Help us improve this article with your feedback.