Using data buckets

Data Buckets allow Visualizer users to create a new Attribute based on where a Measure falls in a range of values or as a percentile rank without intervention of a Birst administrator. Data Buckets are similar to the Flash feature Bucketed Measures. Normally, attributes are values that are provided directly in the source data. Attributes in general can be used to group measures. However, it can be useful to define groups based on measures. Data Buckets allow a designer to take a given measure and define ranges of values, or buckets. These buckets can then be used like any other attribute to group other measures.

As an example, suppose you wanted to compare discounting behavior of large versus small customers. If there is no data field assigned to a customer to indicate large versus small, you could create a data bucket to measure each customer and determine their category. If you decided that the difference between large and small customers are those that have an order volume of over 1,000 units vs. under 1,000 units, you would create a data bucket that makes this distinction and name it Customer Size Category. You would specify a category called Large with the minimum value set to 1,000 and the maximum value set to 10,000 and a Small category with a minimum value of 0 and a maximum value of 1000.

Users can edit their existing Bucketed Measures by accessing them from the Subject Area section in Visualizer.

  1. From the Visualizer menu, select the Advanced Tools icon.
  2. Select Create Data Bucket.
  3. To create your data bucket:
    1. Enter your data bucket name.
    2. Designate which subject area you want to save the data bucket in.
    3. Select a Measure.
    4. Select an Attribute.
    5. Provide a Default Bucket Category. This is a catch all category for values or items that do not fall within a defined bucket.
    6. Provide a Null Data Bucket. This is a catch all category for any null values or items.
  4. Select the number of buckets you want to create by supplying an absolute value or by designating the size of the bucket by which you want to create buckets. If you designate the size of the bucket, the number of buckets will be auto-generated by the application.
  5. You can designate the number values using the slider or No. of Buckets box.
    As an example, bucket size was selected as 10,000 and the system automatically rounded the bucket size to the increment that would populate the buckets equally.
    Note: The Data Bucket is saved in the subject area as an attribute for reports and as a separate function for quick editing. Data Buckets are available for quick editing from the Subject Area menu. Data Buckets are available as an attribute to apply to a report.