Configure Memory Settings for the DLP Exact Data Matching Indexer

Available Languages

Download Options

PDF (17.8 KB)
View with Adobe Reader on a variety of devices
ePub (82.0 KB)
View in various apps on iPhone, iPad, Android, Sony Reader, or Windows Phone
Mobi (Kindle) (67.1 KB)
View on Kindle device or Kindle app on multiple devices

Updated:October 8, 2025

Document ID:225188

Bias-Free Language

The documentation set for this product strives to use bias-free language. For the purposes of this documentation set, bias-free is defined as language that does not imply discrimination based on age, disability, gender, racial identity, ethnic identity, sexual orientation, socioeconomic status, and intersectionality. Exceptions may be present in the documentation due to language that is hardcoded in the user interfaces of the product software, language used based on RFP documentation, or language that is used by a referenced third-party product. Learn more about how Cisco is using Inclusive Language.

Introduction

This document describes how to increase available memory for the DLP Exact Data Matching Indexer to work with large data sources in Cisco Umbrella.

Prerequisites

Requirements

There are no specific requirements for this document.

Components Used

The information in this document is based on Cisco Umbrella.

The information in this document was created from the devices in a specific lab environment. All of the devices used in this document started with a cleared (default) configuration. If your network is live, ensure that you understand the potential impact of any command.

Overview

The Exact Data Match Indexer is part of the Exact Data Match feature in Umbrella DLP. The tool indexes a customer data source (CSV file) and generates fingerprints of critical records which are uploaded to Umbrella for use in DLP policies. This article explains how to increase the available memory for the indexer to work with large data sources.

Problem

When a large data source (CSV file) is indexed, this error displays:

ERROR: Out of heap space; please rerun with an increased size (-Xmx).

Solution

Run the indexing tool with -Xmx specifying the amount of memory to allocate to the indexing tool. The memory allocation can be specified in mebibytes (m) or gibibytes (g). For example:

-Xmx1000m = 1000 mebibyte (1024 megabytes)
-Xmx1g = 1 gibibyte (1074 megabytes)

The required memory depends on the file size of the source file (CSV file). Umbrella recommends allocating memory at least twice the size of the source CSV file.

For example, if the source data is 512 MB, the memory can be allocated like this:

java -X1g -jar edm-indexer.jar -i source_file.csv -e template-id

If the tool is being run in an automated way, then the memory allocation must be increased to account for changes in the source data size.

Revision History

Revision	Publish Date	Comments
1.0	08-Oct-2025	Initial Release

Contributed by Cisco Engineers

Was this Document Helpful?

Feedback

Contact Cisco

Open a Support Case
(Requires a Cisco Service Contract)

This Document Applies to These Products

Umbrella

Configure Memory Settings for the DLP Exact Data Matching Indexer

Available Languages

Download Options

Bias-Free Language

Contents

Introduction

Prerequisites

Requirements

Components Used

Overview

Problem

Solution

Revision History

Contributed by Cisco Engineers

Was this Document Helpful?

Contact Cisco

This Document Applies to These Products