This article shows a couple of interesting zero-width characters techniques for the invisible fingeprinting of text.
In early 2016 I realized that it was possible to use zero-width characters, likeĀ zero-width non-joinerĀ or other zero-width characters like theĀ zero-width spaceĀ to fingerprint text. Even with just a single type of zero-width character the presence or non-presence of the non-visible character is enough bits to fingerprint even the shortest text.
[…]
I also realized that it is possible to useĀ homoglyphĀ substitution (e.g., replacing the letter “a” with its Cyrillic counterpart, “Š°”), but I dismissed this as too easy to detect due to the differences in character rendering across fonts and systems. However, differences in dashes (en, em, and hyphens), quotes (straight vs curly), word spelling (color vs colour), and the number of spaces after sentence endings could probably go undetected due to their frequent use in real text.
[…]
The reason I’m writing about this now is that it appears bothĀ homoglyph substitutionĀ andĀ zero-width fingerprintinghave been discovered by others, so journalists should be informed of the existence of these techniques.