From “Hello” to Meaning: How AI Turns Your Words Into Understanding

(Tokenization & Embeddings explained like we’re having coffee) ☕
Imagine you have a box of LEGO bricks.
You can build anything — a castle, a car, even a tiny dragon — but first you need to break your idea into smaller pieces.
That’s exactly how computers handle text.
Step 1: Tokenization — Breaking Text into Pieces
When we read:
Hello world!
We see two words and a “!”.
But a computer? It needs to turn this into tiny “tokens” — little chunks it can understand.
These chunks could be:
Whole words:
"Hello"
Parts of words:
"Hell"
+"o"
Even spaces and punctuation:
" "
or"!"
Think of it like:
Sentence → LEGO bricks of words and symbols.
Example:"I love cats"
might become:
["I", "love", "cats"]
Or even:
["I", "lo", "ve", "cat", "s"]
depending on how small we break it.
Step 2: Embeddings — Giving Meaning to the Pieces
Okay, so now the computer has a bunch of “tokens.”
But “token #42” doesn’t mean anything… unless we give it meaning.
Embeddings are like giving each LEGO brick a secret label in numbers that says:
What it is
How it relates to other bricks
Example:
In the computer’s “map of words,”
“Cat” might be near “Dog” and “Kitten”
“Apple” might be near “Banana” and “Mango”
Why?
Because the computer learns these relationships by reading tons of books, articles, and websites.
Why Do We Do This?
Computers don’t “understand” like we do.
They work with numbers.
Tokenization turns our messy human words into neat chunks.
Embeddings turn those chunks into numbers with meaning.
Once it has that, the computer can:
Write stories ✍️
Answer questions ❓
Translate languages 🌍
Even make silly jokes 😂
The Journey in One Line
Your sentence → Broken into tokens → Each token gets a meaning → Computer starts doing its magic.
If you’ve read this far, congratulations — you now understand the first step of how ChatGPT and other AI tools read!
It’s not magic. It’s LEGO… but with words.
Subscribe to my newsletter
Read articles from Vikas TC directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by
