Unicode Encoder

Text to Unicode

Code Point to Character

Unicode Encoder

Convert text to Unicode code points, UTF-8 hex bytes, HTML entities, and more. Enter any character or text to inspect its encoding.

FAQ

Q: What is a Unicode code point?

A: A code point is a unique number assigned to each character in the Unicode standard, written as U+XXXX (e.g. U+0041 for "A"). Unicode covers over 140,000 characters across all scripts.

Q: What is UTF-8?

A: UTF-8 is a variable-width encoding that represents Unicode code points as 1–4 bytes. It is the most widely used encoding on the web and is backward compatible with ASCII.

Q: What is an HTML entity?

A: HTML entities are sequences like A or A that represent special characters in HTML. They are used to safely include characters that might otherwise be interpreted as HTML markup.