Well, I haven't shown any of my work, so it's interesting you'd put so much faith into it. I guess I should at least demonstrate what I know about the Inkling numeral system:
This was derived from a combination of the signs hanging from the ceiling in Walleye Warehouse, a small sign posted in Camp Triggerfish that happened to read "1234," the datestamps and timestamps on the photographs of the initial news reports about the missing Great Zapfish and the "UFO" sighting, the markings on the shipping crates in various stages (mostly Port Mackerel, as the lighting is brightest there), and the advertisements in Arowana Mall indicating some event that has happened. Through this, I can also deduce
Splatoon is set in the Inkling year 2015 and that those events in Arowana Mall have already happened. You will also find that, no matter where in the game the characters in this numeral set are located, they are always found together.
Although I don't know the values of all of the numerals, or why there's an excess one (I'm guessing the "A" looking one and its upside-down counterpart are the same numeral, or the one at the end is the punctuation mark/mathematical sign I mentioned earlier), I can now produce Inkling Sudoku:
(Go ahead and try solving it if you'd like.)
Progress on Inkling English is still very incomplete, and I don't feel ready to show what I've figured out so far. However, I am pretty sure I am on the right track--of the vowels I have figured, they fall into logical patterns.
By the way, I think I should explain a few basics of cryptanalysis, as I don't know if I can do this alone.
What some of you had assumed was that this would be a
simple substitution cipher or
monoalphabetic substitution cipher. In such a cipher, each letter corresponds to one and only one other letter. This is the sort of cipher you see in newspapers and used for decoder rings in those old-fashioned cereal boxes. You almost never see any other sort of cipher, so I can see why most people think that this is the only kind of cipher that exists. This is the most easily broken cipher, and even without spaces, simple substitution ciphers have been broken almost as soon as they were made.
Inkling English is a
polyalphabetic substitution cipher, albeit a nontraditional one. In such a cipher, multiple characters can stand for one letter. Perhaps the most famous of these kinds of ciphers is the
Vigenère cipher, a type of encryption invented in the 16th century. In a Vigenère cipher, each letter is encoded in a different way, in a repeating pattern using a key known to both the sender and receiver of the message. Vigenère ciphers remained unbreakable for three hundred years (when Friedrich Kasiski and Charles Babbage independently came up with techniques to decrypt such messages). Inkling English is not nearly as complicated as a Vigenère cipher, but I just used that as an example on how encryption got complex real fast.
Finally, the original message is called
plaintext.
(A bit of trivia: If any of you are familiar with
Gravity Falls, you may have noticed an encrypted message at the end of each episode. Each Season 1 episode uses a simple substitution cipher, and each Season 2 episode uses a Vigenère cipher with a key hidden in each episode.)
Now, onto the meat of the matter: Decryption. Every cipher that has ever been broken was done so through knowlege of patterns in the language of the plaintext message and through intuition. Substitution ciphers, at least up to a certain complexity (one-time pads are incredibly difficult to crack, and modern computer-based encryption is downright impossible without a powerful decryption program of your own), can be solved using a technique called
frequency analysis. This technique involves finding the most common characters in an encrypted message and assigning them the most common characters in the plaintext's language. In the case of English, the most common letter is E, followed by T, then A, then O, then U, then I, then N. Hence, if you're given a substitution cipher, especially a simple one, the best way to tackle it is to find which letters are the most common and keep trying combinations until they make sense. (Frequency analysis won't always get you the correct answer on the first try, but the longer the message, the easier it will be.) I'll show you an example of how powerful a tool frequency analysis is through an example. Here is a message encrypted with a simple substitution cipher:
Code:
DWVLWSGWDDWVLDMHDRIDMLKYLIDRWGZMLDMLSDRIGWVJLSRGDMLBRGCDWIYUULSDMLIJRGNIHGC
HSSWZIWUWYDSHNLWYIUWSDYGLWSDWDHFLHSBIHNHRGIDHILHWUDSWYVJLIHGCVEWAAWIRGNLGCD
MLBDWCRLDWIJLLAGWBWSLHGCVEHIJLLADWIHEZLLGCDMLMLHSDHQMLHGCDMLDMWYIHGCGHDYSHJ
IMWQFIDMHDUJLIMRIMLRSDW
Looks daunting? It actually isn't. If you apply frequency analysis, you'll see that each letter's frequency, in order of most to least, are as follows:
L: 31
D: 30
W: 26
H: 21
I: 21
G: 18
M: 16
S: 15
R: 11
C: 9
Y: 8
J: 7
U: 6
V: 6
A: 4
B: 4
N: 4
E: 3
Z: 3
F: 2
Q: 2
K: 1
O: 0
P: 0
T: 0
X: 0
Since L and D are the two most common letters, respectively, let's replace each "L" with "e" and each "D" with "t" and see what happens:
Code:
tWVeWSGWttWVetMHtRItMeKYeItRWGZMetMeStRIGWVJeSRGtMeBRGCtWIYUUeStMeIJRGNIHGC
HSSWZIWUWYtSHNeWYIUWStYGeWStWtHFeHSBIHNHRGItHIeHWUtSWYVJeIHGCVEWAAWIRGNeGCt
MeBtWCRetWIJeeAGWBWSeHGCVEHIJeeAtWIHEZeeGCtMeMeHStHQMeHGCtMetMWYIHGCGHtYSHJ
IMWQFItMHtUJeIMRIMeRStW
It probably still looks like gobbledygook, but don't let that deter you. If you look carefully, you'll start to see some patterns here. The string "tMe" occurs over and over in this message, at least five times. What absurdly common English word uses "t_e"? That would be "the." Going off this assumption, let's replace "M" with "h," and we get this:
Code:
tWVeWSGWttWVethHtRItheKYeItRWGZhetheStRIGWVJeSRGtheBRGCtWIYUUeStheIJRGNIHGC
HSSWZIWUWYtSHNeWYIUWStYGeWStWtHFeHSBIHNHRGItHIeHWUtSWYVJeIHGCVEWAAWIRGNeGCt
heBtWCRetWIJeeAGWBWSeHGCVEHIJeeAtWIHEZeeGCtheheHStHQheHGCthethWYIHGCGHtYSHJ
IhWQFIthHtUJeIhRIheRStW
Something else that you'll see is "thHt" occurring towards the beginning of this message and again towards the end of it. The word that most fits in with "th_t" is "that." Looking at the frequency chart above, "H" occurs very frequently, the fourth-most, after "W." If you substitute "H" with "a," the message becomes as such:
Code:
tWVeWSGWttWVethatRItheKYeItRWGZhetheStRIGWVJeSRGtheBRGCtWIYUUeStheIJRGNIaGC
aSSWZIWUWYtSaNeWYIUWStYGeWStWtaFeaSBIaNaRGItaIeaWUtSWYVJeIaGCVEWAAWIRGNeGCt
heBtWCRetWIJeeAGWBWSeaGCVEaIJeeAtWIaEZeeGCtheheaStaQheaGCthethWYIaGCGatYSaJ
IhWQFIthatUJeIhRIheRStW
We've only substituted four letters, but you can see how far this message has been decoded. By now, you can probably make out certain words, though you might not necessarily be sure they're the right ones. Next up, do you see "GC" repeated pretty frequently? It always seems to be "aGC" and "eGC," but mostly "aGC." The most common word in English that's three letters long, begins with "a," and does not have "h," "t," or "e" in it is "and." Let's replace "G" with "n" and "C" with "d."
Code:
tWVeWSnWttWVethatRItheKYeItRWnZhetheStRInWVJeSRntheBRndtWIYUUeStheIJRnNIand
aSSWZIWUWYtSaNeWYIUWStYneWStWtaFeaSBIaNaRnItaIeaWUtSWYVJeIandVEWAAWIRnNendt
heBtWdRetWIJeeAnWBWSeandVEaIJeeAtWIaEZeendtheheaStaQheandthethWYIandnatYSaJ
IhWQFIthatUJeIhRIheRStW
And now, entire words can be parsed out. Towards the end, do you see "endtheheaStaQhe"? Removing "end the" from there, you have "heaStaQhe." Seeing it as "hea_ta_he," it must be the word "heartache." "S" must be "r," and "Q" must be "c." (There are only two Q's in this message, but every little bit helps.) Now it becomes:
Code:
tWVeWrnWttWVethatRItheKYeItRWnZhethertRInWVJerRntheBRndtWIYUUertheIJRnNIand
arrWZIWUWYtraNeWYIUWrtYneWrtWtaFearBIaNaRnItaIeaWUtrWYVJeIandVEWAAWIRnNendt
heBtWdRetWIJeeAnWBWreandVEaIJeeAtWIaEZeendtheheartacheandthethWYIandnatYraJ
IhWcFIthatUJeIhRIheRrtW
The next obvious world is "natYraJ." "nat_ra_." It must be "natural." Let's change "Y" into "u" and "J" into "l." (Yeah, that makes upper-case I and lower-case "l" confusing now. I hope your font distinguishes them.)
Code:
tWVeWrnWttWVethatRItheKueItRWnZhethertRInWVlerRntheBRndtWIuUUertheIlRnNIand
arrWZIWUWutraNeWuIUWrtuneWrtWtaFearBIaNaRnItaIeaWUtrWuVleIandVEWAAWIRnNendt
heBtWdRetWIleeAnWBWreandVEaIleeAtWIaEZeendtheheartacheandthethWuIandnatural
IhWcFIthatUleIhRIheRrtW
By now, the only vowels left are "i" and "o" (not including "y"). If you continue looking towards the end, you'll see "theheartacheandthethQuIandnatural." That "Q" must be a vowel (the only trineme of consonants involving "th_" is "thr," and "r" has already been solved), and no words in English start with "thiu." Therefore, "Q" must be "o."
Code:
toVeornottoVethatRItheKueItRonZhethertRInoVlerRntheBRndtoIuUUertheIlRnNIand
arroZIoUoutraNeouIUortuneortotaFearBIaNHRnItaIeaoUtrouVleIandVEoAAoIRnNendt
heBtodRetoIleeAnoBoreandVEaIleeAtoIaEZeendtheheartacheandthethouIandnatural
IhocFIthatUleIhRIheRrto
You know what this message is now, don't you? That's right, it's the
Hamlet soliloquy. Suppose you don't know anything past "To be or not to be, that is the question." You will still know that "V" is "b," "R" is "i," "I" is "s," "K" is "q." Plugging them in, we have:
Code:
tobeornottobethatisthequestionZhethertisnoblerintheBindtosuUUertheslinNsand
arroZsoUoutraNeousUortuneortotaFearBsaNainstaseaoUtroublesandbEoAAosinNendt
heBtodietosleeAnoBoreandbEasleeAtosaEZeendtheheartacheandthethousandnatural
shocFsthatUleshisheirto
And now the code has been blown wide open. "Zhether" tells you that "Z" is "w." You have "outraNeous," so "N" must be "g." "Uortune" shows "U" is "f."
Code:
tobeornottobethatisthequestionwhethertisnoblerintheBindtosuffertheslingsand
arrowsofoutrageousfortuneortotaFearBsagainstaseaoftroublesandbEoAAosingendt
heBtodietosleeAnoBoreandbEasleeAtosaEweendtheheartacheandthethousandnatural
shocFsthatfleshisheirto
"shocFs"? That must be "shocks." "F" must stand for "k."
Also, "andbEasleeA" can be parsed out to become "and bE asleeA." All of the normal vowels have been used, so "E" must be "y." And it's pretty clear by now that "A" must be "p."
Code:
tobeornottobethatisthequestionwhethertisnoblerintheBindtosuffertheslingsand
arrowsofoutrageousfortuneortotakearBsagainstaseaoftroublesandbyopposingendt
heBtodietosleepnoBoreandbyasleeptosayweendtheheartacheandthethousandnatural
shocksthatfleshisheirto
"B" is the only one left. Not all of them are obvious, but the one that can help you solve it the best is "fortune or to take arBs against." The only word that can make sense there is "arms," so "B" stands for m." And now you have:
Code:
tobeornottobethatisthequestionwhethertisnoblerinthemindtosuffertheslingsand
arrowsofoutrageousfortuneortotakearmsagainstaseaoftroublesandbyopposingendt
hemtodietosleepnomoreandbyasleeptosayweendtheheartacheandthethousandnatural
shocksthatfleshisheirto
"to be or not to be that is the question whether tis nobler in the mind to suffer the slings and arrows of outrageous fortune or to take arms against a sea of troubles and by opposing end them to die to sleep no more and by a sleep to say we end the heartache and the thousand natural shocks that flesh is heir to"
I hope this wasn't too confusing for you. If you don't understand it all, try re-reading it. But yeah, this is the technique I am using to attack the Inkling code. Even if you don't see how I cracked that message, hopefully you at least see that there's a logic to it all.