Quantcast
Channel: Data Services and Data Quality
Viewing all articles
Browse latest Browse all 236

SAP Data Insight

$
0
0

Data Insight

Data Insight is used to do the DHA (Data Health Assessment) on the data, to see if the data is good to use. We use the tool data Insight to do a test / profiling on the data before we use the data for the ETL process. We can also say that Insight is used to do the data investigation for DHA. It   automates the Analysis and Monitors the data.

 

Using Data Insight we can perform the following tasks


Data Profiling

Column query

Integrity test and Custom query

Scheduling

Creating a trend reports

Sampling reports

 

Getting started

Creating Connection


Navigation to data Insight


Note:- First we need to start the Data insight Engine before we use the tool.

 

To start the Data Insight Engine, follow the bellow navigation.

Start --> Program Files -->Business objects XI 3.0--> Business objects Data Insight --> Data Insight Engine

Insight1.png


Once you click on this, a Dos window will open and it will start the engine.

Once the Engine starts, go for the Data Insight GUI in the same above navigation.

Once your Data Insight starts, You will find the bellow screen.


Insight2.png

Now we need to create a Project

To create a Project go to the navigation

File-> New Project -> Give the project name and Check the box for Share Project.

Insight3.png

By sharing, we can make it accessible to the rest.

 

Now you have to provide the connection name.

Choose Data base and click on the Down arrow to specify the database connection.

Insight4.png


If using for the first time, give your SQL server name and click on OK, it opens the data link properties window.

Give in the server name and username and password. In step 3 select your SQL database on which you want to perform the test.

Click on Test connection to see if the credentials are correct. And click on OK.

Insight5.png

Now it will open the below window for selecting Owners. You can click on OK. Now the Insight Window is open and you can see the selected DB available. Expand the data base to see the tables under it. Go to the selected table and expand it.

Insight6.png

Insight7.png

Here we have 4 tabs (Data Profile, Column Query, Referential Integrity, Custom Query) using which we can perform different types of tests on the data.

Data Profile

Using this we can perform tests like Summary on the data, Comparison,  Frequency Test, Word Frequency test, Uniqueness of data, Redundancy test.

Summary will give the snap shot of the data for decision making or further drill-down.

How can we carry out the Summary test?

You can perform the summary on the table level or on a column level as well.  Select the Check box under the Summary column and click on RUN.

It will give you the below Summary Profile on the data which gives a complete DHA on the data.

It will give you the below Summary Profile on the data which gives a complete DHA on the data.

Insight12.png

You can check on Save report and click on close. Now it will ask you to save the profile  report. Click on Yes and give the Profile name and click on OK.

Insight9.png

Insight10.png

Now if you notice, the last run column is populated with the time stamp and the result. Click on the result next to the time stamp to see the results

Comparison test

Comparison is used to get the report of Count and percentages of rows with incomplete column values.

To do a comparison test, Click on the check box under comparison at the table level or the row level and click on RUN.Insight11.png


Insight12.png

Now you can observe the result and it gives the result of the match or duplicates records available. In our case we don’t have duplicates or match records.

You can also click on print report to generate the report and also can export the report to different formats by clicking on the export report.

Insight13.png

Insight14.png

Once you close this report and click on close in the main window, it will ask you to save the result and same as the above procedure we can save the results.

Frequency (FRQ) is used to find the frequency distribution of distinct values in columns.

The working procedure is same as the above. Click on the check box under the FRQ and click on RUN to see the results. You can also click on print report to export it in to different formats. You can also save the result  by checking save report and click on close and give the profile name.

Please see the following screen shots.

Insight15.png

Insight16.png

Insight17.png

WFRQ (Word frequency )Frequency distribution of single word.

Same as the above procedure, click on check box and click on run to see the results.

UNQ (Unique) This gives the count and percentages of the rows with non-unique column values.

Same as the above procedure, click on check box and click on run to see the results.

RDN (Redundancy ) This test is to identify the commonalities and outlives between the columns.

Same as the above procedure, click on check box and click on run to see the results.

Column Query :-  This is used to Analyze the data within the Data Insight.

  1. Select the column on which you want to perform the test and right click à add combined column query

We can perform the following test using the Combined column Query.

 

Insight18.png

Format


Occurrence Search for the occurrence (<, >, =,<=, >=) ‘n’ times

Pattern                                           Pattern of the data in the column

Pattern recognition                        Recognizing the string pattern with special chars

Range                                            specify the min and max values for the range

Reference column                         reference column on which we have to refer this column

Specific value test                         Search the column with a specific value


Select the Radio buttons on the left side and the respective selections will be activated on the right side.

Insight19.png

Once you select the query type on the left side, chose the respective options on the right side and click on return data check box and click on run,

In our example, we take the specific value test.

Select the specific values on left side and specify a value on the right hand side. Select the Return data check box and click on run. You will get the below result. You can click on print report to see the data in a report format, or you can click the check box save data and click on close. Click ok to save the report and give the report name and click on OK.

Happy Learning

Rakesh


Viewing all articles
Browse latest Browse all 236

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>