Confluence 3.5 has reached end of life
Check out the [latest version] of the documentation
Atlassian may request a copy of the entities.xml
file from a customer's exported zip file (backup file), in order to diagnose database corruption or to find a bug in Confluence.
If your data is confidential, you can run an anonymiser program over your entities.xml
file to remove all your data and leaving only the structure of the export.
Usage
To run the Content Anonymiser on your backup file:
- Download the anonymiser JAR (attached to this page).
- Extract the
entities.xml
file from your zipped backup file to the same directory as the JAR. - Use the command prompt to go to the directory where all three files are located.
- To create
cleaned.xml
, run the command:
java -jar confluence-export-cleaner-1.1-jar-with-dependencies.jar entities.xml cleaned.xml
How it works
The Content Anonymiser program replaces all the text content in file entities.xml
with 'x' characters. For example, the word "Atlassian" will be transformed to "xxxxxxxxx". The resulting cleaned.xml
file is expected to have the same size as the original file.
This release of the Content Anonymiser uses STX, a fast and efficient XML transformation technology. It should not require a lot of memory to run, even for a large backup.
Development
For Atlassian developers: