Content Anonymiser for Data Backups
Atlassian may request a copy of the
entities.xml file form your exported zip file (backup file), in order to diagnose database corruption or to troubleshoot a bug in Confluence.
If your data is confidential, you can run an anonymiser program over your
entities.xml file to remove all your data and leaving only the structure of the export.
To run the Content Anonymiser on your backup file:
- Download the anonymiser JAR (attached to this page).
- Extract the
entities.xmlfile from your zipped backup file to the same directory as the JAR.
- Use the command prompt to go to the directory where all three files are located.
cleaned.xml, run the command:
Move the original
file to a different location and then rename
Re-ZIP the new
exportDescriptor.propertiesto ensure Atlassian Support know exactly which version of Confluence the XML backup was exported from.
How it works
The Content Anonymiser replaces all text content in file
entities.xml with 'x' characters. For example, the word "Atlassian" will be transformed to "xxxxxxxxx". The resulting
cleaned.xml file is expected to have the same size as the original file.
This release of the Content Anonymiser uses STX, a fast and efficient XML transformation technology. It should not require a lot of memory to run, even for a large backup.
Was this helpful?
Thanks for your feedback!