You must configure each instance of the HBase data set rule before it can read data
from and save it to an external HBase storage.
-
Create an instance of the HBase
data set rule.
-
Connect to an instance of the
Data-Admin-Hadoop
configuration rule
by performing the following actions:
-
In the
Hadoop configuration instance
field, reference the
Data-Admin-Hadoop
configuration rule
that contains HBase
storage configuration.
-
Click
Test connectivity.
-
Configure mapping between the fields that are stored in an HBase table and properties
in the
Pega Platform
by performing the following actions:
- Optional:
Click
Refresh table names.
-
In the
HBase table name
field, select a table that is
available in the HBase storage to which you are connected.
-
Click
Preview table
to see the first 100 row IDs and all
column families defined in the table schema, and then select a row ID and a column
family to view data in the selected table.
When you preview the data, it helps you to define the property mappings.
-
In the Row ID field, specify a property that contains a row ID.
A row ID uniquely identifies a single row in an HBase table. The HBase dataset
rule instance that you are configuring will perform all operations on a row identified
by the row ID.
-
Click
Add mapping.
-
In the HBase column field, specify a name of the field that is stored in the HBase
table. Use the following format
<column_family>:<column_name>, for example,
total:expenses.
Note: You can specify just a column family name and map it to the page list property
of Embed-NameValuePair type or page group property of SingleValue-Text type. In this
case, all the column values are put into a list, using the pyName or pxSubscript
property for the column name, and pyValue for the value.
-
In the
Property Name
field, specify a property in the
Pega Platform.
-
Repeat steps e-g to map more properties.
-
Click
Save.