|
Append "useUnicode=true to your JDBC URL:
jdbc:mysql://hostname:port/database?useUnicode=true&characterEncoding=utf8 |
If you are modifying
|
Create a UTF-8 database with binary UTF-8 collation.
Binary UTF-8 provides case-sensitive collation.
CREATE DATABASE confluence CHARACTER SET utf8 COLLATE utf8_bin; |
You will also need to set the Server Characterset to utf8. This can be done by adding the following in my.ini for Windows or my.cnf for other OS. It has to be declared in the Server section, which is the section after [mysqld]:
[mysqld] default-character-set=utf8 |
status command to verify database character encoding information. ![Confluence 4.0 > Configuring Database Character Encoding > mysql-database-creation-and-status-check-via-console.png (mysql-database-creation-and-status-check-via-console.png [Screenshot of MySQL Console screen showing database creation and status check commands with encoding highlighted.])](/download/attachments/269985489/mysql-database-creation-and-status-check-via-console.png?version=1&modificationDate=1247635701761&api=v2)
In some cases, the individual tables collation and character encoding may differ from the one that the database as a whole has been configured to use. Please use the command below to ensure all tables within your Confluence database are correctly configured to use UTF-8 character encoding and binary UTF-8 collation:
use confluence; show table status; |
Check for the value listed under the Collation column, to ensure it has been set to utf8_bin (that is, case-sensitive) collation for all tables.
If not, then this can be changed by the following command, executed for each table in the Confluence database:
ALTER TABLE tablename CONVERT TO CHARACTER SET utf8 COLLATE utf8_bin; |
Please substitute the <tablename> above, with each table within the confluence database.
Relevant MySQL manual for more detailed explanation:
CREATE DATABASE confluence WITH ENCODING 'UNICODE'; |
Or from the command-line:
$ createdb -E UNICODE confluence |
For more information see the PostgreSQL documentation.
Please note that international characters sets are only fully supported and functional when using PostgreSQL 8.1 and above under Microsoft Windows.
Please make sure you check the following to ensure proper handling of international characters in your database |
When PostgreSQL creates an initial database cluster, it sets certain important configuration options based on the host enviroment. The command responsible for creating the PostgreSQL environment initdb will check environment variables such as LC_CTYPE and LC_COLLATE (or the more general LC_ALL) for settings to use as database defaults related to international string handling. As such it is important to make sure that your PostgreSQL environment is configured correctly before you install Confluence.
To do this, connect to your PostgreSQL instance using pgsql and issue the following command:
SHOW LC_CTYPE; |
If LC_CTYPE is set to either "C" or "POSIX" then certain string functions such as converting to and from upper and lower case will not work correctly with international characters. Correct settings for this value take the form <LOCALE>.<ENCODING> (en_AU.UTF8 for example).
If your LC_CTYPE is incorrect please check the PostgreSQL documentation for information on configuring database localisation. It is not easy to change these settings with a database that already contains data.
If you're using a existing database, confirm the Character Encoding by executing the query: |
Before proceeding with the following changes, please backup your database. |
This example shows how to change your database from latin1 to utf8.
mysqldump -p --default_character-set=latin1 -u <username> --skip-set-charset confluence > confluence_database.sqlconfluence_database.sql to confluence_utf8.sqlconfluence_utf8.sql in a text editor and change all character sets from 'latin1' to 'utf8'recode latin1..utf8 confluence_utf8.sql (the recode utility is described at http://directory.fsf.org/recode.html; it can actually be downloaded from http://recode.progiciels-bpi.ca/, and is available for Ubuntu via apt-get)In MySQL:
DROP DATABASE confluence;CREATE DATABASE confluence CHARACTER SET utf8 COLLATE utf8_bin;Finally, reimport the UTF-8 text file:
mysql -u <username> -p --default-character-set=utf8 --max_allowed_packet=64M confluence < /home/confluence/confluence_utf8.sqlTo support large imports, the parameter '--max_allowed_packet=64M' used above sets the maximum size of an SQL statement to be very large. In some circumstances, you may need to increase it further, especially if attachments are stored in the database.
See Troubleshooting Character Encodings for a number of tests you can run to ensure your database encoding is correct.
Character encodings in Confluence
Known Issues for MySQL