Which UTF code should I use?

Which unicode should I use in my HTML document? My doc contains the following codes:

Ā which renders Ā

ā which renders ā

á which renders á

í which renders í


which renders सरस्वतीनदी (Sanskrit)

ཐིག་ལེ which renders ཐིག་ལེ (Tibetan)

རླུང which renders རླུང (Tibetan)

I haven't included the semi-colons because they would render otherwise. I don't know much about unicodes but I intend to learn about them. After realising that UTF-8 cannot be used with these characters, I wondered if others could be used, like UTF-16 or UTF-32 etc. I've also heard of BOM. Does this apply and is there anything else I must include? Thanks.

1 Answer

  • 7 years ago
    UTF-8 can be used to render ALL characters. You should in general always use UTF-8.

    BOM is almost useless; just avoid them. They do not apply in your HTML file anyway.

