Help Wanted: Delete 1 Language, Keep the Rest from a Text Dump

Hi!
First time posting here, so please don't pelt me with rotten tomatoes if this is the wrong forum or I made some other faux pas!
I'm working on localising Terra Phantastica with Malenko.

I have a text dump of the game kindly provided by Conan (not Derek, sorry, I guess I'm just enamored with Derek ).
But, it's basically 90% control code gobbledegook (that's the technical term for program code, right?) and 10% Japanese.
I just want the Japanese so I don't have to sift through the codes when I want to go to the next line of JP when translating.

Is there an easy way that's not mega technical of deleting all the codes so I have just the JP left?

I've been Googling for hours looking for search and replace tricks in Word or something, but couldn't find anything that works.
Saw some python scripts and stuff too, but I have no idea what to do with those😀

Maybe there's some experienced Saturn localiser who's been in a similar sitch who could help?
 
Last edited:
it would be pretty easy to strip the text out with a regex capture. but don't you kind of need the identifiers and other information to put it back into the game? otherwise you just have random text strings? I could write something in 10 minutes to do it, but I don't understand what you're trying to accomplish.
 
it would be pretty easy to strip the text out with a regex capture. but don't you kind of need the identifiers and other information to put it back into the game? otherwise you just have random text strings? I could write something in 10 minutes to do it, but I don't understand what you're trying to accomplish.
He's just translating the text, I'm going to worry about adjusting control codes to get the text to fit. There are control codes that print strings from memory, like the player name, but everything is encapsulated by the text decode/end text decode controls.

Also that dump is from @Mr Conan but @derek (ateam) also had a more manual dumping tool that was a big help in getting started with this. Continuing thanks to the both of them for all their help.
 
it would be pretty easy to strip the text out with a regex capture. but don't you kind of need the identifiers and other information to put it back into the game? otherwise you just have random text strings? I could write something in 10 minutes to do it, but I don't understand what you're trying to accomplish.
Don't know about other people who do this a lot, this is only my 2nd time doing a full game fan translation after Bomberman Fight, and I didn't need to deal with codes for that, praise Malenko.
I do this for a living, and we never have to deal with codes like this (just the odd HTML tag or variables in the text, which is easy enough).

The idea is, I want the text just laid out in a column I can plop into Excel so I can see easily how one string might relate to another for general understanding/context purposes.

I've tried feeding the dump files as-is into OmegaT (software I use in order to keep translation memory and terminology organised), and I can translate that way, but the codes are still there, so it's still a PITA to sift through that stuff.

Also, I don't know what a regex edit is...
Total non-code guy here
 
He's just translating the text, I'm going to worry about adjusting control codes to get the text to fit. There are control codes that print strings from memory, like the player name, but everything is encapsulated by the text decode/end text decode controls.

Also that dump is from @Mr Conan but @derek (ateam) also had a more manual dumping tool that was a big help in getting started with this. Continuing thanks to the both of them for all their help.
Apologies for the name mix up, I totally brain farted there!
Thanks indeed to both Conan and Derek!
 
Back
Top