My notepad txt file turned into weird characters

Some years ago I wrote some notes in a .txt file (plain text) on Notepad and when I opened it recently it appeared with these weird characters. I don't know at what point it turned out like that, but it could have been in-between a transition from a Windows 7 -> External Drive -> Windows 10 (current). Another .txt files that were in the same folder as this one got like that, although the majority hadn't changed. This makes me suspect of either a conversion error between Microsoft OS or the files got corrupted. Also, when I opened with Notepad++, this same file was written like this . When I copied it to google translate textbox, the characters with black background turned into some kind of coded matrix, so maybe it can give some kind of clue. I already tried to decode through many ways without being successful. Maybe someone has any idea if this is a matter that can be solved through decoding to Plain Text (ASCII), or if the files are corrupted and there is no way back. Thanks.

asked Nov 22, 2019 at 14:19 13 1 1 gold badge 1 1 silver badge 4 4 bronze badges What language were the files written in? Commented Nov 22, 2019 at 14:35 Please post one (small) file. Can it be that you wrote in Unicode without a BOM? Commented Nov 22, 2019 at 14:46

3 Answers 3

Some years ago I wrote some notes in a .txt file (plain text) on Notepad and when I opened it recently it appeared with these weird characters. I don't know at what point it turned out like that, but it could have been in-between a transition from a Windows 7 -> External Drive -> Windows 10 (current).

Another .txt files that were in the same folder as this one got like that, although the majority hadn't changed. This makes me suspect of either a conversion error between Microsoft OS or the files got corrupted.

The files got corrupted. It might be a hardware problem or an OS problem, although it's much more likely that they got corrupted when copying from/to the external drive (e.g. via bad USB connection or the drive was damaged) and not during an OS upgrade.

when I opened with Notepad++, this same file was written like this . When I copied it to google translate textbox, the characters with black background turned into some kind of coded matrix, so maybe it can give some kind of clue.

These are "control characters" – they're meant to be interpreted by programs and not shown on screen, and normally they wouldn't occur in a text file at all (except for CR/LF/TAB of course). Therefore they don't have a standard visual representation, and different programs have different ways of displaying them if they do occur:

I already tried to decode through many ways without being successful. Maybe someone has any idea if this is a matter that can be solved through decoding to Plain Text (ASCII), or if the files are corrupted and there is no way back.

In Notepad++, the file looks kind of like it's half-UTF-8 and half-garbage (the accented-'A's tend to show up when an UTF-8 file is misinterpreted as Windows-1252).

However, in this case it's probably just a coincidence and there's likely nothing decodable in this file anymore.