Discover your Data with Data Catalog in Purview
Aktualisiert: 30. Sept. 2022
Browse and search assets[/et_pb_text][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]When the key users have a group of target tables, and they don’t know the whole data model structure, they can use browse asset to find the data. Users can browse the assets by collection or datasets. It can list all the resources and the hierarchy of the collection. When searching for a certain table by collection, you can filter the assets by classification, glossary, label, and so on. When the collection has a massive amount of assets, this filter can be a fast and simple way. The result is sorted by relevance. After finding the table, you can click on the table to see more detailed information such as schema classification, lineage, and schema. [/et_pb_text][et_pb_image src=”https://hubsters.de/wp-content/uploads/2021/12/DataCatalog_BrowseAssets.png” title_text=”DataCatalog_BrowseAssets” _builder_version=”4.8.1″ _module_preset=”default”][/et_pb_image][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]If we know the table is from which kind of source, we can also find it by source type. In this way, all the assets are listed in a hierarchy structure. After we click on one resource type, we can see the list of all the databases in this resource type. For example, if we choose a storage account, then all the containers all be shown on left panel, and the child asset for a container can be shown on the right side.[/et_pb_text][et_pb_image src=”https://hubsters.de/wp-content/uploads/2021/12/DataCatalog_SourceType.png” title_text=”DataCatalog_SourceType” _builder_version=”4.8.1″ _module_preset=”default”][/et_pb_image][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]To speed up the process of finding a certain table, we can also directly use the search bar in data catalog. Purview can show the relevant result based on the key work users put in. The keyword can be the classification, glossary term, or data type of the assert. [/et_pb_text][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]
Lineage[/et_pb_text][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]Lineage is one of the most important features that purview provides. It can show the process between two data assets. The sources like data factory and Power BI can capture these processes for assets and provide the visualized track for data. After we scan Power BI or the pipeline is triggered in the data factory, this lineage can be found of the relevant assets of data and processes. [/et_pb_text][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]Inside the lineage for an asset, it also shows the schema for the asset on the left panel. We can click the column name to find out how this column is called and generated from the previous steps. In another word, lineage can not only track the data at the table level but also at the column level. For example, we can see from in the picture below that there are 4 columns in the customer_master.csv file which generate the column ‘costumer_id’ after the data flow activity in the data factory. If you want to check the other assets that are in this lineage, just click this asset and switch asst to see the detailed information. [/et_pb_text][et_pb_image src=”https://hubsters.de/wp-content/uploads/2021/12/DataCatalog_Lineage.png” title_text=”DataCatalog_Lineage” _builder_version=”4.8.1″ _module_preset=”default”][/et_pb_image][/et_pb_column][/et_pb_row][et_pb_row _builder_version=”4.4.8″][et_pb_column type=”4_4″ _builder_version=”4.4.8″][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]
Business glossary[/et_pb_text][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]Azure purview allows users to create business glossary terms to enrich their data. A glossary can categorize different business terms and help key users to understand more about what these terms mean in different situations and contexts. These terms can map to different resources, tables, and columns. The terms can be created in the hierarchy format which means the data estate can have a better-structured business glossary. [/et_pb_text][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]When we add a new term into the glossary, we can use the system default template or create a new one. Default template includes Name as a must, Definition, Data stewards, Data, experts, Parent, Acronym, Synonyms, Related terms, Resources as optional terms. In the custom term template, we can add attributes for a date, text, single or multiple choices according to need. The attribute can also be marked as required. After a term is created, the responsible worker can check the content and approve the glossary term. [/et_pb_text][et_pb_image src=”https://hubsters.de/wp-content/uploads/2021/12/DataCatalog_Glossary.png” title_text=”DataCatalog_Glossary” _builder_version=”4.8.1″ _module_preset=”default”][/et_pb_image][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]All the glossary terms can be shown in a hierarchy view. It gives key users a good understanding of the glossary structure. Inside the term, you can find information like the parent term, definition, contact, and attribute info. By clicking on view assets, you can find all the assets which belong to this term. [/et_pb_text][et_pb_image src=”https://hubsters.de/wp-content/uploads/2021/12/DataCatalog_CustomerValueModel.png” title_text=”DataCatalog_CustomerValueModel” _builder_version=”4.8.1″ _module_preset=”default”][/et_pb_image][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]In an organization, multiple terms can represent the same object from a different view, they can have relationships with each other. The same term can represent also more than one object. We can use Synonyms to connect other terms which have a similar definition and use Related terms to bridge the terms with a different definition, for example, groups from different departments. [/et_pb_text][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]
Conclusion[/et_pb_text][et_pb_text _builder_version=”4.8.1″ header_2_font_size=”22px”]Purview data catalog provides the information about the whole data estate assets and enables the key users to explore the valuable datasets. Users can use browse asset, lineage view, and business glossary to get a deep understanding of the data model and get in touch with the other department which also responses for part of the data model. Now, start the Purview trip to know your data estate easily and fast! [/et_pb_text][et_pb_divider color=”#74c2d2″ _builder_version=”4.4.8″][/et_pb_divider][/et_pb_column][/et_pb_row][/et_pb_section][et_pb_section fb_built=”1″ _builder_version=”4.4.8″][et_pb_row column_structure=”1_3,2_3″ _builder_version=”4.4.8″ max_width=”862px” max_width__hover_enabled=”on|desktop”][et_pb_column type=”1_3″ _builder_version=”4.4.8″ custom_padding=”0px||0px|27px|false|false”][et_pb_image src=”https://hubsters.de/wp-content/uploads/2020/12/Qianyu_Chen.jpg” alt=”Linus Trips HUBSTER.S” title_text=”Qianyu_Chen” _builder_version=”4.8.1″ max_width_tablet=”30%” custom_margin=”|||0px|false|false” custom_margin_tablet=”” custom_margin_phone=”||-26px|0px|false|false” custom_margin_last_edited=”on|phone” custom_padding=”|||0px|false|false” module_alignment_tablet=”left”][/et_pb_image][/et_pb_column][et_pb_column type=”2_3″ _builder_version=”4.4.8″][et_pb_blurb title=”Qianyu Chen” _builder_version=”4.8.1″ custom_margin=”36px||||false|false” hover_enabled=”0″ sticky_enabled=”0″]
Qianyu Chen is Solution Architect for Data Analytics and Machine Learning. [/et_pb_blurb][et_pb_blurb use_icon=”on” font_icon=”%%153%%” icon_color=”#303344″ icon_placement=”left” use_icon_font_size=”on” icon_font_size=”20px” _builder_version=”4.8.1″ header_text_align=”left” custom_margin=”||30px||false|false” animation=”off” link_option_url=”mailto:christoph.monzel@hubsters.de” body_text_align_tablet=”center” body_text_align_phone=”center” body_text_align_last_edited=”on|phone” text_orientation_tablet=”left” text_orientation_phone=”left” text_orientation_last_edited=”on|tablet” module_alignment_tablet=”center” module_alignment_phone=”” module_alignment_last_edited=”on|tablet”]
qianyu.chen@hubsters.de[/et_pb_blurb][/et_pb_column][/et_pb_row][/et_pb_section]