Skip to main content


         This documentation site is for previous versions. Visit our new documentation site for current releases.      
 

Building machine learning entity extraction models

Updated on July 5, 2022

Use Pega Platform machine learning capabilities to create entity extraction models to recognize named entities.

Before you begin:
By using models that are based on the Conditional Random Fields (CRF) algorithm, you can extract information from unstructured data and label it as belonging to a particular group. For example, if the document that you want to analyze mentions Galaxy S8, the entity extraction model classifies that as Phone.
  • Preparing data for entity extraction

    In the Source selection step of the entity extraction model creation wizard, select the extraction type and provide the data for training and testing of your entity extraction model.

  • Defining the training set and training the entity extraction model

    In the Sample construction step of the entity extraction model creation wizard, select the data to use to train the model and the data to use to test the model's accuracy. In the Model creation step, build the model.

  • Accessing entity extraction model evaluation reports

    After you build the model, you can evaluate it by using various accuracy measures, such as F-score, precision, recall, and so on. You can view the model evaluation report in the application or you can download that report to your directory. You can also view the test results for each record.

  • Saving the entity extraction model

    After the model has been created, you can export the binary file that contains the model to your directory and store it for future use. You can also create a specialized rule that contains the model. That rule can be used in text analyzers in Pega Platform.

  • Best practices for creating entity extraction models

    Use extraction analysis to detect and classify named entities into predefined categories, for example, names of people, locations, organizations, and so on.

Have a question? Get answers now.

Visit the Support Center to ask questions, engage in discussions, share ideas, and help others.

Did you find this content helpful?

Want to help us improve this content?

We'd prefer it if you saw us at our best.

Pega.com is not optimized for Internet Explorer. For the optimal experience, please use:

Close Deprecation Notice
Contact us