Why can't a raw hash or raw bytes just be printed as text?

Raw bytes contain invisible control characters, characters that break URL parsing, and characters that some email systems treat as line endings or render differently across operating systems. Pasting such a string somewhere and getting it back unchanged is impossible to guarantee, so we encode the bytes into a safe character set first, accepting that the result gets a bit longer.

What's the difference between hex, Base58, and Base64?

All three represent the same underlying bytes using different alphabets. Hex uses 0-9 and a-f, so two characters equal one byte and the output is exactly twice the length. Base58 drops look-alike characters like 0, O, I, and l so humans can copy it by hand without mistakes, and Base64 uses 64 symbols including + and / to stay compact, which is common for machine-to-machine data like email attachments.

Encoding | RedDuck Academy

Q: Is encoding the same as encryption?

No, they are completely different despite the similar names. Encryption scrambles data with a key so only someone with that key can read it, while encoding just translates data into a different representation so it can travel safely through systems that restrict which characters they accept. Encoding has no key, so anyone can reverse it, which means encoding something never keeps it secret.

Encoding is not encryption

Encoding and encryption are completely different things, even though the words sound similar.

Encryption scrambles data so only someone with the right key can read it. The output is supposed to be unreadable to everyone else. The next lesson covers encryption properly.

Encoding translates data into a different representation so it can travel safely through systems that have constraints on what characters they accept. The output is supposed to be readable. There is no key. Anyone can decode it.

The same two bytes can be encoded as Hi in ASCII, 48 69 in hex, SGk= in Base64, or 6Wc in Base58. All four represent the same underlying data. Switching between them does not change the data, it changes how the data looks on the page.

If someone ever tells you they "encoded" a password to keep it safe, they didn't.

Why raw bytes can't just be displayed

The hash from the previous lesson is 256 bits of binary data. Try to print those 256 bits as ASCII characters directly and you'll get something like this in your terminal:

,Ò$Û¥û°£.&è;*ÅùZé¿b^p43 62)8¹$

That string contains invisible control characters, characters that break URL parsing, characters that some email systems treat as line endings, and characters that render differently on different operating systems. Pasting it somewhere and getting it back unchanged is impossible to guarantee.

So we encode it. Pick a representation that uses only safe characters, accept that the resulting string will be longer than the raw bytes, and move on. The different encoding schemes you'll meet make different tradeoffs about which character set to use and how long the output ends up.

The encodings you'll see

Three formats cover almost everything in the ecosystems this course touches.

Hex

Each byte becomes two characters drawn from 0–9 and a–f. The hash output from the previous lesson, 2cf24dba5fb0a30...2938b9824, is hex. So is almost any long string of letters and numbers you've seen labeled as an "address" or a "transaction hash." The leading 0x you sometimes see is a convention that says "what follows is hex," borrowed from C.

Properties:

Each hex character is exactly 4 bits, so two characters always represent one byte. Converting back and forth with raw bytes is trivial.
Output is exactly twice the length of the input. 32 bytes of hash become 64 hex characters.
Case-insensitive by default. Some systems use mixed case to encode a checksum without changing what the address means, but lowercase the same string and it still resolves to the same data.

Hex is the standard representation for anything that's "raw bytes shown as text." If you're staring at a long string of 0-9a-f characters, you're looking at hex.

Base58

Uses 58 characters: all digits and letters except 0, O, I, and l. Those four are excluded because they look alike in many fonts and would be easy to misread when a human copies a string by hand.

A string like 1A1zP1eP5...5SLmv7DivfNa is Base58. There's also a variant called Base58Check that adds a few extra checksum bytes so a typo in a copied string is detected before damage is done.

Properties:

Shorter than hex for the same data, because 58 symbols carry more information per character than 16.
Designed for humans copying strings by hand. The excluded characters are the ones people misread.
Not aligned to byte boundaries the way hex is, so converting is slightly more involved.

Base58 is the format reached for whenever a system needs to give humans something to copy without losing characters to font ambiguity. Anywhere a user might paste a long secret into a phone keypad or read it aloud, Base58 is the safer choice.

Base64

Uses 64 characters: A–Z, a–z, 0–9, plus + and /, with = for padding. Predates blockchain by more than a decade. It's how email attachments are encoded for transport over text-only systems, how images get inlined into HTML data URLs, and how API tokens get transmitted in HTTP headers.

Properties:

Slightly more compact than Base58 because it has six more symbols in its alphabet.
Not designed for humans copying strings. Uses both 0 and O, both I and l. Fine for machine-to-machine, bad for "read this aloud over the phone."
The + and / characters break URL parsing, so a URL-safe variant exists that swaps them for - and _.

You'll meet Base64 in plenty of non-blockchain web contexts, and recognising it on sight saves time. The trailing = padding is the giveaway.

Side-by-side

The same 5-byte input (hello) encoded four ways:

Encoding	Output	Length
Binary (raw)	`01101000 01100101 01101100 01101100 01101111`	40 bits
Hex	`68656c6c6f`	10 characters
Base58	`Cn8eVZg`	7 characters
Base64	`aGVsbG8=`	8 characters

Same 5 bytes. Four different visual representations. Each one is reversible to the exact same binary input.

The playground below lets you try this yourself. Type any input and watch hex, Base58, and Base64 produced from the same underlying bytes.

A practical note for later in the course

Hex and Base58 cause confusion in one specific way: when you start meeting addresses on different systems, some will look like long lowercase-hex strings and others will look like mixed-case Base58 strings. They look so different that it's tempting to assume they represent fundamentally different kinds of data. They don't. Underneath, both formats are usually wrapping the same kind of fixed-size byte sequence that comes out of a cryptographic process. The encoding is the wrapper, the data inside is the same kind of thing.

This will matter again in the wallet-creation lesson, where the same starting secret produces addresses that look very different across different systems. Part of that difference is just the encoding wrapper, and the lesson will show how much.

01.Welcome to the RedDuck Academy

02.What is web3

03.Cryptography

04.How blockchains work

05.Bitcoin

06.Beyond Bitcoin

Encoding

Encoding is not encryption

Why raw bytes can't just be displayed

The encodings you'll see

Hex

Base58

Base64

Side-by-side

A practical note for later in the course