FAQ Desk: Why Does My .docx File Look Like Junk In Notepad++
Hello, and welcome to the FAQ Desk. If you find yourself linked to this post, it is likely because you asked something like “Why does my .docx file look like junk [in Notepad++]?”, or “Why doesn’t Find In Files match anything when searching a directory of PDFs?”
Notepad++ is an editor for plaintext files. The file types
.docx(modern MS Word files) and
(There are times, especially in the PDF, when there is some plain text… but there is no guarantee that the sequence you are looking for will stay contained in plaintext; it might get separated by some binary characters, or otherwise have binary control characters embedded along with it, inhibiting your search. Even here, finding what you expect might be the exception rather than the rule.)
Notepad++ was not built to read such binary files; if you want to read or search
.docxfiles, you need to use a program (usually a word processor, such as MS Word, LibreOffice, OpenOffice, or the like) that is specifically designed to read such files; similarly, for reading
What you are asking is the equivalent of “I just brought my friend, who only reads English, over to index my personal library: Why is she not able to index my Russian, Hindi, and ancient Greek books?” That friend could be reasonably expected to understand British English, American English, Canadian English, and Australian English (in my analogy, various standard encodings of the same underlying text, such as the ASCII, UTF8, …), but it is unreasonable to expect her to also understand shorthand Sanskrit (in my analogy, a compressed binary format with its own proprietary encoding).