This page last changed on Dec 09, 2008 by rbattaglin.

Introduction

A Jira data anonymiser is also available.

Atlassian may request a copy of the entities.xml file from a customer's exported zip file, in order to diagnose database corruption, or to find a bug in Confluence.

If your data is confidential, you can run this program over your entities.xml file, removing all your data and leaving only the structure of the export.

Usage

To run the anonymiser on your backup:

  1. Download the anonymiser JAR.
  2. Extract the entities.xml file from your zipped backup file to the same directory as the JAR.
  3. Use the command prompt to go to the directory where all three files are located.
  4. To create cleaned.xml, run the command:
java -jar confluence-export-cleaner-1.1-jar-with-dependencies.jar entities.xml cleaned.xml

How it works

The Content Anonymiser application replaces all the text content in file entities.xml with 'x' characters. For example, the word "Atlassian" will be transformed to "xxxxxxxxx". The resulting cleaned.xml file is expected to have the same size of the original file.

This release of the export cleaner uses STX, a fast and efficient XML transformation technology. It should not require a lot of memory to run, even for a large backup.

Development

For Atlassian developers:


exportcleaner.jar (application/java-archive)
dom4j-1.4-full.jar (application/java-archive)
confluence-export-cleaner-1.1-jar-with-dependencies.jar (application/octet-stream)
Document generated by Confluence on Jul 09, 2010 01:11