Submit a ticket My tickets
Welcome
Login  Sign up

Presentation of attributes

The attributes, to define the different facets of the context, are the basic elements of a data governance strategy.

The purpose of this article is to explain the notion of attribute and to describe how it works in the DataGalaxy platform.

To know how to create and delete attributes, you can refer to the article Attribute Management.

What is an attribute?

One of DataGalaxy's missions is to make the different Data profiles (Analysts, Dev, architect, business, producers, consumers...) collaborate around a shared vision of the different facets of the data. This involves managing different types of heterogeneous information. Attribute management addresses this need. They correspond to fields which, once grouped into categories, making it possible to establish an object identity card or the object card.

The objective of attributes is to be able to contextualize as much as possible the different facets of an object and thus to enrich its knowledge. 

In order to support the data governance approach (corporate vision), attributes are created at the Client Space level. This means that they are common to all workspaces depending on this customer space.

It is important to ask yourself a few questions before you start creating dozens of attributes: 

  • What is the proportion of users concerned with the information carried by the attribute? 
  • What are the critical attributes for your use case, which need to be cured? 
  • Is the meaning of the attribute explicit? for example, what does the attribute "frequency" mean? is it the update frequency? the data deletion frequency? 
  • Does this attribute not already exist or does it largely cover another attribute? It is advisable to converge the terms as much as possible so as not to end up with several interchangeable fields, which would be a source of confusion. 
  • Completeness of the values proposed in the attribute list? some types of attributes contain lists of values: this one must be rich enough for everyone to find their way around but not too rich to avoid getting lost.  
  • Which language should we use? The platform is available in French and English, but the values positioned on the attributes can be in any language, which one should prevail? Should we create the attribute in both languages?
  • Are there any attributes specific to a domain, pole or project? Should management be delegated?
  • .....

Tips

There is no limit to the number of attributes that can be created. However, it should be kept in mind that 20% of the attributes carry 80% of the knowledge expected by data users.

The multiplication of attributes can be counterproductive and discourage users (poor ergonomics, lack of understanding of differences, the complexity of maintenance...).

To maintain an optimal level of ergonomics, we recommend evaluating the ROI of creating and displaying each attribute.

How does it work?

The diagram below summarizes the attribute structuring in DataGalaxy:

The attributes are structured into two main families:

  • System attributes

In order to ensure a quick start, DataGalaxy offers and maintains a number of attributes, known as system attributes.

These are attributes available "by default", which cannot be modified by users. They can be identified by the cloud icon next to their name.

Some system attributes allow you to view data from source tools, for example the number of records in a table or the source query of a data set. The information returned depends on the source tools, you will find the details in the articles of the concerned technologies

  • Custom attributes

These are the attributes created and managed by the administrators of the clientspace, they are usable/available for all workspaces of the clientspace.

Consequently, any modification or deletion of a custom attribute will impact all workspaces that use this attribute.

Two types of attributes can be proposed in each family:

  • Common attributes

They correspond to the attributes that are usable/available for all objects on the platform, regardless of the module to which the object belongs.

Example: The description of the object, the DCP classification, the responsible person,...

  • Module attributes

They correspond to specific attributes that are not exclusively usable/available for a subset of platform objects.

  • Specific attributes for Glossary objects,
  • Specific attributes for Data Sources (DBD, FileStore,...),
  • Specific attributes for Structures (Table, File, ...),
  • Specific attributes for Fields (Column,...),
  • Specific attributes for Uses,
  • Specific attributes for processing objects.

The structure of an attribute

All attributes, regardless of their type or their format, share a common definition base:

  1. Attribute label,
  2. Description of the attribute,
  3. Attribute format (text, date, label...),
  4. The Recommended option which allows you to calculate the completion rate of the object,
  5. The default value (optional).

Supported attribute formats

Formats

Description

Comment

Boolean    

The attribute is in the form of a Yes/No field

Example of Boolean type fields:

  • Contains personal data: yes or no 
  • Is personal data: yes or no 
DateThe date attribute is in dd/mm/yyyyy format
All information in date format. 

Formatted text

Rich text attribute with formatting tools
Formatted text is to be used for attributes that require a long description or advanced formatting of content.
Hyperlink
Allows you to insert a link to external references

For example, in the context of a GDPR project, the hyperlink field can be used to link to the CNIL's Privacy Impact Assessment tool 

Number
The field in format Number
It can be an integer or a decimal number
Value list
Single choice-value list

Text
Plain text field
Attributes of this type are limited to 512 characters.
Tag

Classification attribute

Each tag created has a color that allows you to easily identify the group to which the object tagged with this tag belongs.

Example of classification: Marketing, Sales, Finance, HR

Multi-value list
Multiple choice field
Multi-value list works like Tags.
Users
Multiple choice field with a list of users in the customer area as a value list    


Person
Multiple choice field with a list of users in the customer area as a value listThe type used is the attribute definition to identify stakeholders and external actors  
Hierarchy
Fields with a list of multiple-choice hierarchical values

Time seriesFields allowing to follow the evolution of a value in time
Example : a quality indicator

Did you find it helpful? Yes No

Send feedback
Sorry we couldn't be helpful. Help us improve this article with your feedback.