Description
This post aim to just remove obvious opaque predicates in control flow graph. Using Binary Ninja, it is possible to patch to de-obfuscate the sample since the strategy used is the same.
HASH : 0581f0bf260a11a5662d58b99a82ec756c9365613833bce8f102ec1235a7d4f7
The Pattern
Firstly, the basic blocks are really fragmented which are glued together via conditional and unconditional jumps. For unconditional jump statements, there are instances where ONLY one path is taken for all cases. Notice how there are a few comparisons whereby left equals right via cmp
instruction which would always set zero flag to zero. Depending on the jump conditions, it would only go down ONE path.
To understand this, we can reference the instruction graph for this comparison. The main IL Operation is LLIL_IF
to check for the condition of LLIL_CMP_E
. In the script, we should cover all the different IL related to CMP
. The pattern exists where the left and right operands are of LLIL_REG
which we can double check. Finally, to detect this, we can also make sure that the src
for both left
and right
are the same which is the pre-requisites for the opaque predicate.
Changing Control Flow
We can make use of the always_branch
and never_branch
from the patch menu which can also be done so programmatically.
bv.always_branch(<instr>.address)
bv.never_branch(<instr>.address)
What it does is to patch to become an unconditional jump to the true statement (always_branch
) or false statement (never_branch
).## Writing the Script
We have to keep track of the locations to patch with the *_branch
API.
for f in bv.functions:
func = f.low_level_il
patch_locations = []
for bb in func:
for instr in bb:
if handle_cmp_same_regs(instr):
print()
Next, we can parse the instruction in Low Level IL as per the IL Graph shown in the previous section. Note that when appending the patch location, I have added another value (1 or 0) to indicate if we should always branch or to never branch. This ensures that the unconditional jump statement would jump to the correct location. Also, the different CMP
conditions are accounted for as well.
def handle_cmp_same_regs(instr):
if instr.operation == LowLevelILOperation.LLIL_IF:
comparison_statement = instr.operands[0]
print("Operation : ", comparison_statement.operation)
print(hex(instr.address))
try:
left_comparator = comparison_statement.left
right_comparator = comparison_statement.right
false_instr = instr.false
true_instr = instr.true
except:
print("Skipping instruction " , instr , "@", hex(instr.address))
return False
if hasattr(left_comparator, 'src') == False or hasattr(right_comparator, 'src') == False:
return False
# Testing agains the following
"""
LLIL_CMP_NE - not equal
LLIL_CMP_SLT - signed less than
LLIL_CMP_ULT - unsigned less than
LLIL_CMP_SLE - signed less than or equal
LLIL_CMP_ULE - unsigned less than or equal
LLIL_CMP_SGE - signed greater than or equal
LLIL_CMP_UGE - unsigned greater than or equal
LLIL_CMP_SGT - signed greater than
LLIL_CMP_UGT - unsigned greater than
"""
if left_comparator.src == right_comparator.src and right_comparator.operation == LowLevelILOperation.LLIL_REG and left_comparator.operation == LowLevelILOperation.LLIL_REG:
if comparison_statement.operation == LowLevelILOperation.LLIL_CMP_E:
patch_locations.append((instr.address, 1))
return True
elif comparison_statement.operation == LowLevelILOperation.LLIL_CMP_NE:
patch_locations.append((instr.address,0))
return True
elif comparison_statement.operation == LowLevelILOperation.LLIL_CMP_SLE:
patch_locations.append(instr.address, 1)
return True
elif comparison_statement.operation == LowLevelILOperation.LLIL_CMP_UGE:
patch_locations.append(instr.address,1)
return True
elif comparison_statement.operation == LowLevelILOperation.LLIL_CMP_ULE:
patch_locations.append(instr.address,1)
return True
elif comparison_statement.operation == LowLevelILOperation.LLIL_CMP_SGE:
patch_locations.append(instr.address,1)
return True
elif comparison_statement.operation == LowLevelILOperation.LLIL_CMP_SGT:
patch_locations.append(instr.address,0)
return True
elif comparison_statement.operation == LowLevelILOperation.LLIL_CMP_UGT:
patch_locations.append(instr.address,0)
return True
elif comparison_statement.operation == LowLevelILOperation.LLIL_CMP_SLT:
patch_locations.append(instr.address,0)
return True
elif comparison_statement.operation == LowLevelILOperation.LLIL_CMP_ULT:
patch_locations.append(instr.address,0)
return True
return False
Next, we can do the actual manipulation of the control flow graph via patching.
for i in patch_locations:
if i[1] == 1 and bv.is_always_branch_patch_available(i[0]):
bv.always_branch(i[0])
elif i[1] == 0 and bv.is_always_branch_patch_available(i[0]):
bv.never_branch(i[0])