I have problem with UNICODE character showing in PDF file. There is a solution for this that it is not very efficient for me. The solution is like this.
document.add(new Paragraph("Unicode: \u0418", new Font(bfComic, 12)));
You are experiencing different problems:
Encoding of the data:
You need to make sure that you retrieve data from the database using the correct encoding. For instance:
String name1 = new String(rs.getBytes("given_name"), "UTF-8");
I use "UTF-8" in this example, because my database contains different names with special characters stored as "UTF-8". I risk that these special characters are displayed as gibberish if I would retrieve the field like this:
String name2 = rs.getString("given_name")
Encoding of the font:
You create your font like this:
Font font = new Font(bfComic, 12);
You don't show how you create
bfComic, but I assume that this object is a
BaseFont object using
IDENTITY_H as encoding.
Writing from right to left / making ligatures
Although your code will work to show a single character, it won't work to show a sentence correctly.
name1 is the Arabic version of the name "Lawrence of Arabia" and that we want to write this name to a PDF. This is done three times in the following screen shot:
The first line is wrong, because the characters are in the wrong order. They are written from left to right whereas they should be written from right to left. This is what will happen when you do:
Even if the encoding is correct, you're rendering the text incorrectly.
The second line is also wrong. The characters are now in the correct order, but no ligatures are made: Ù„ followed by Ùˆ should be combined into a single glyph: Ù„Ùˆ
You can only achieve this by adding the content to a
PdfPCell object, and by setting the run direction to
PdfWriter.RUN_DIRECTION_RTL. For instance:
Now the text will be rendered correctly.
This is explained in chapter 11 of my book. You can find a full example here: Ligatures2