====== Lab 2 - Hash Collisions Galore ====== Upon its first execution the ''L2'' binary asks the user to generate 1024 passwords. With this information we can assume that the password is not directly hardcoded in the binary but will be processed and compared against a reference value and that this process will map at least 1024 passwords to this value. Using this assumption we can assume that the binary computes a hash of the given password and check it against a reference hash. Moreover due to the need to provide multiple passwords, we can assume that this hash function is weak against collitions. ==== Reverse Engineering ==== The first step is to use IDA to disassemble the ''L2'' binary. This gives us a symbolic representation of the program's code. After analysis of the general program flow, we can see that the program looks for the presence of a single argument which we can assume is the user-supplied password. The program then checks the password (pictured left) and prints a message if the password matches an expected input. === Algorithm Analysis === {{ :heig:su16:sre:lab2-hash.png?nolink&200|Lab 2 hash function}} The first block of code checks for the length of the string using the ''repne scasb'' instruction. This instruction is a complex instruction that decrements the ''ecx'' register while the byte at the address of the ''edi'' register is non zero. With this knowlege in mid we can infer that the program expects a password of length ''0xFFFFFFFF - 0xFFFFFFF2'' or ''0xD16'' or ''1310'' including the null-byte terminating the string so 12 significative characters. If the string does not contains 12 characters, the program jumps to the error block, otherwise it jumps to the second block. The second to fourth block perform the hashing on the string. The hashing algorithm can be expressed as the following pseudocode: chunks is [[int8 * 4] * 3] := partition password in chunks of 4 characters hash is int32 := 0 for each chunk in chunks do segment is int32 := chunk[0] || chunk[1] || chunk[3] || chunk[2] hash := hash xor segment end is hash equals to 0x5A155E39 === Breaking the Hash === Since this algorithm performs a xor compression it is trivial to find three individual characters whose xor compression is equal to an 8 bit chunk out of the reference value. A script that can generate all possible passwords is then a trivial endeavour. Firstly we need to generate all character triplets that xor to one of the desired values. pairs = { 0x5a: [], 0x15: [], 0x5e: [], 0x39: [] } char_range = [c for c in range(33, 127)] permutations = it.product(char_range, char_range, char_range) for chars in permutations: delta = chars[0] ^ chars[1] ^ chars[2] if delta in [0x5a, 0x15, 0x5e, 0x39]: pairs[delta].append(chars) Then the cross product of the pairs allows to generate all possible passwords using the following code which computes the cross product of the four sets and creates a password for each entry. for pps in it.product(pairs[0x5a], pairs[0x15], pairs[0x5e], pairs[0x39]): pwd = str.join("", sum([[chr(pp[n]) for pp in pps] for n in range(0, 3)], [])) print("Password:", pwd) Note that the last two values are swapped from their order in the desired hash. The total number of passwords can be obtained by multiplying the size of the four sets and is **1781102812020000** (or 18.98Pb of data or 1.58Pb by compressing data).