This documentation site is for previous versions.

Visit our new documentation site for current releases.

Best practices for Stream service configuration

Updated on March 11, 2021

This content applies only to On-premises and Client-managed cloud environments

Follow these guidelines for the recommended configuration of the Stream service in your system.

Expected throughput

Data throughput depends on the number of nodes, CPUs, and partitions, as well as the replication factor and bandwidth

Review the results of tests on three running stream service nodes on machines with the following configuration:

CPU cores: 2
Memory (GB): 8
Bandwidth (Mbps): 450
Number of partitions: 20
Replication factor: 2

The following table shows the test results for writing messages to the stream (producer):

Producer throughput – test results

Records	Record size (bytes)	Threads	Throughput (rec/sec)	Average latency (ms)	MB/sec
5000000	100	1	83675	2.2	8.4
		5	172276.1	11.9	17.2
		10	216682.8	40.4	21.7
	100	1	32967.7	5.1	33
		5	53033.7	48.6	53
		10	49861.7	174.3	49.8
	100	1	76812.1	2.4	7.7
		5	165317.4	13.3	16.5
		10	203216.3	46.1	19.38
	1000	1	35865.7	4.8	35.9
		5	52456.7	41.9	52.4
		10	50266.1	158.8	50.3

The following table presents the test results for reading messages from the stream (consumer):

Consumer throughput – test results

Records	Record size (bytes)	Threads	Throughput (rec/sec)	MB/sec
5,000,000	100	1	120673.8	12
		5	150465	15
		10	143395.3	14.3
	1,000	1	54128.4	54.1
		5	55903.3	55.9
		10	54674.5	54.7

Disk space requirements

By default, the Kafka cluster stores data for 60 hours (2.5 days). You can change the retention period for specific stream categories by modifying the services/stream/category/retention/categoryName property in the prconfig.xml file, where categoryName can have one of the following values:

dataset
decisioning
queueprocessor
system

For example, you can set the retention period for streams in the QueueProcessor category by using the following property: services/stream/category/retention/queueprocessor.

For example:

Your goal is to process 100,000 messages per second, 500 bytes each, and to keep messages on the disk for one day. The replication factor is set to 2.

The expected throughput is 50 MB/sec:

3 GB is used in one minute for a single copy of the data.
6 GB of disk space is used in one minute due to the replication factor of 2.
The total throughput is 360 GB in one hour and 8.64 TB in one day.
Apart from your data, the Kafka cluster uses additional disk space for internal data (around 10% of the data size).

In that sample scenario, the total minimal disk size should be 9.5 TB.

Compression

Depending on your needs, you can choose data compression using one of the algorithms that Kafka supports: gzip, Snappy, or LZ4. Consider the following aspects:

Gzip requires less bandwidth and disk space, but this algorithm might not saturate your network while the maximum throughput is reached.
Snappy is much faster than gzip, but the compression ratio is low, which means that throughput might be limited when the maximum network capacity is reached.
LZ4 maximizes the performance.

Review the following table and diagram with throughput and bandwidth usage per codec:

Throughput and bandwidth per codec (%)

Codec	Throughput %	Bandwidth %
None	100	100
Gzip	47.5	5.2
Snappy	116.3	64.1
LZ4	188.9	34.5

The bar chart shows that LZ4 has the highest throughput. Snappy has the highest bandwidth and traffic. — Throughput and bandwidth metrics per codec (%)

Previous topic Configuring the Stream service
Next topic Monitoring the Stream service

Have a question? Get answers now.

Visit the Support Center to ask questions, engage in discussions, share ideas, and help others.

Visit the Support Center

Get Started with Community

Best practices for Stream service configuration

Expected throughput

Producer throughput – test results

Consumer throughput – test results

Disk space requirements

Compression

Throughput and bandwidth per codec (%)

Have a question? Get answers now.

Ready to crush complexity?

Experience the benefits of Pega Community when you log in.

Get Started with Community

Expected throughput

Producer throughput – test results

Consumer throughput – test results

Disk space requirements

Compression

Throughput and bandwidth per codec (%)

Have a question? Get answers now.

Ready to crush complexity?

Experience the benefits of Pega Community when you log in.

We'd prefer it if you saw us at our best.