This repository contains initial code for comprehensive testing of binary translators.
For Python dependencies, see the requirements.txt
. We also require at least LLDB version 17 for fs_base
/gs_base
register support.
I had to compile LLDB myself; these are the steps I had to take (you also need swig version >= 4):
git clone https://github.com/llvm/llvm-project <llvm-path>
cd <llvm-path>
cmake -S llvm -B build -DCMAKE_BUILD_TYPE=Release -DLLVM_ENABLE_PROJECTS="clang;lldb" -DLLDB_ENABLE_PYTHON=TRUE -DLLDB_ENABLE_SWIG=TRUE
cmake --build build/ --parallel $(nproc)
# Add the built LLDB python bindings to your PYTHONPATH:
PYTHONPATH="$PYTHONPATH:$(./build/bin/lldb -P)"
It will take a while to compile.
focaccia.py
is the main executable. Invoke focaccia.py --help
to see what you can do with it.
The tools/
directory contains additional utility scripts to work with focaccia.
convert.py
: Convert logs from QEMU or Arancini to focaccia's snapshot log format.
The following files belong to a rough framework for the snapshot comparison engine:
-
focaccia/snapshot.py
: Structures used to work with snapshots. TheProgramState
class is our primary representation of program snapshots. -
focaccia/compare.py
: The central algorithms that work on snapshots. -
focaccia/arch/
: Abstractions over different processor architectures. Currently we have x86 and aarch64.
The following files belong to a prototype of a data-dependency generator based on symbolic execution:
-
focaccia/symbolic.py
: Algorithms and data structures to compute and manipulate symbolic program transformations. This handles the symbolic part of "concolic" execution. -
focaccia/lldb_target.py
: Tools for executing a program concretely and tracking its execution using LLDB. This handles the concrete part of "concolic" execution. -
focaccia/miasm_util.py
: Tools to evaluate Miasm's symbolic expressions based on a concrete state. Ties the symbolic and concrete parts together into "concolic" execution.
-
focaccia/parser.py
: Utilities for parsing logs from Arancini and QEMU, as well as serializing/deserializing to/from our own log format. -
focaccia/match.py
: Algorithms for trace matching.
To add support for an architecture , do the following:
-
Add a file
focaccia/arch/<arch>.py
. This module declares the architecture's description, such as register names and an architecture class. The convention is to declare state flags (e.g. flags in RFLAGS for x86) as separate registers. -
Add the class to the
supported_architectures
dict infocaccia/arch/__init__.py
. -
Depending on Miasm's support for , add register name aliases to the
MiasmSymbolResolver.miasm_flag_aliases
dict infocaccia/miasm_util.py
. -
Depending on the existence of a flags register in , implement conversion from the flags register's value to values of single logical flags (e.g. implement the operation
RFLAGS['OF']
) in the respective concrete targets (LLDB, GDB, ...).