Jump to content
  • Spotfire Tips & Tricks: Data preparation using Split columns


    Spotfire has many data wrangling capabilities available from the columns flyout or the data canvas. In this article we cover one of these options which is the ability to split a column into multiple columns.

    Introduction 

    Spotfire includes in-built data wrangling functionality. One of these functionalities is the ability to split a column into multiple columns. Through the column properties flyout it is possible to easily define how to split a column into multiple columns.  See the guidance here  for details on how to access this. Below are some examples usages of this.

    Level 1 Scenario

    When delimiter or separator is obvious and numbers of new columns is unambiguous:

    Example: Location column has both latitude and longitude together. Spotfire can automatically select a default separator based on commonly used separators and can also recommend the number of new columns.  It provides flexibility to override the default and manually input values for the separator and number of new columns to generate. If required it is possible to hide the original column from Analysis.

    split_column_1.thumb.png.43bc78a04dacd1662d3c2c07d73724e5.png

     

    Level 2 Scenario

    When delimiter or separator is obvious, numbers of new columns is unambiguous but there is some extraneous information in the data:

    Example : Spotfire can help take out noise from the beginning or end of your string.  For example, an instrument sends a few extra characters at the beginning, which are not required. The user can enter the number of characters to be ignored either at the beginning or end.

    usecase2.thumb.png.4570bfafadb1e841253ac5aaef260d74.png

     

    Level 3 Scenario

    When delimiter or separator is obvious but the numbers of new columns is not well-defined:

    Example : Spotfire data functions can be used to resolve this issue. Based on the number of values, different numbers of columns would be generated. In this case since key B has three colors, three columns are generated. If the maximum number of colors for any Key will be 4 then four columns would be generated.

    usecase3.thumb.png.6b1e3a909b92c2c477905955b27b31ae.png


    User Feedback

    Recommended Comments

    There are no comments to display.


×
×
  • Create New...