How Chemistry Uses Subscript Notation

In chemistry, subscript numbers in molecular formulas specify the number of atoms of each element in one molecule of a compound. The subscript appears immediately after the element symbol. In H₂O, the subscript ₂ tells us there are two hydrogen atoms bonded to one oxygen atom. In CO₂, the subscript ₂ indicates two oxygen atoms attached to one carbon atom.

This notation — called the molecular formula or chemical formula — is one of chemistry's most fundamental communication tools. It encodes the exact composition of a compound in a compact, universally understood format. The subscript numbers are not optional decoration; they carry precise stoichiometric information.

Traditionally, writing these formulas required either specialized software (LaTeX, ChemDraw), HTML formatting (<sub> tags), or word-processor subscript menus. The problem: all of these methods produce formatting that disappears when you copy the text into plain-text environments — emails, social media, chats, and databases.

Unicode solves this. The Unicode "Superscripts and Subscripts" block (U+2070–U+209F) contains dedicated subscript digit glyphs (₀₁₂₃₄₅₆₇₈₉) that are actual UTF-8 characters. When you write H₂O using Unicode ₂ instead of a formatted "2", the subscript is embedded in the character itself — it survives copy-paste into any platform that supports basic Unicode text, which is essentially every modern device.