Jump to content
  • Automated Data Cleaning with Statistica


    The following example is to get a user started with the Data Health Check node.

    The following example is to get a user started with the Data Health Check node.

    After starting Statistica, select Home menu -> pull down arrow on New menu.  Now select the Workspace menu. The list of templates will display. Select the Get Data template to locate a data connector. Select the node and type Ctrl+C. 

    Note: Some of the nodes may be disabled if the user does not own the associated product. If data needs to be retrieved from the OSISoft PI database, then look for a template named PI Asset Framework or PI Asset Framework and Event Frames.

    Next, select the Home menu -> pull down the arrow on the New menu. Now select the Workspace menu. Select the Automated Data Cleaning template.  

    Connect data (see Get Data above) to the Data Health Check node. This node can be found by searching for it by name via Feature Finder (upper right corner).

    stat1.png.b89e5255f1a167cb19bf1d726a321c75.png

    Data Health Check node looks for common data issues (missing, invariant, etc...) for each variable and generates a report. This report can be used in deciding how to clean the data. This can be especially useful because the user does not want to explore all variables by hand.

    stat2.png.93005ade9a06c1242adaf33b3c97d043.png

     

     Learn more about Statistica 


    User Feedback

    Recommended Comments

    There are no comments to display.


×
×
  • Create New...