Analysis Configuration Options

Mariana Trench is highly configurable and we recommend that you invest time into adjusting the tool to your specific use cases. At Meta, we have dedicated security engineers that will spend a significant amount of their time adding new rules and model generators to improve the analysis results.

This page will cover the more important, non-trivial configuration options. Note that you will spend most of your time configuring Mariana Trench writing model generators. These are covered in the Models and Model Generators.

Command Line Options

You can get a full set of options by running mariana-trench --help. The following is an abbreviated version of the output.

$ mariana-trench --help

Target arguments:
  --apk-path APK_PATH   The APK to analyze.

Output arguments:
  --output-directory OUTPUT_DIRECTORY
                        The directory to store results in.

Configuration arguments:
  --system-jar-configuration-path SYSTEM_JAR_CONFIGURATION_PATH
                        A JSON configuration file with a list of paths to the system jars.
  --rules-paths RULES_PATHS
                        A `;`-separated list of rules files and directories containing rules files.
  --repository-root-directory REPOSITORY_ROOT_DIRECTORY
                        The root of the repository. Resulting paths will be relative to this.
  --source-root-directory SOURCE_ROOT_DIRECTORY
                        The root where source files for the APK can be found.
  --model-generator-configuration-paths MODEL_GENERATOR_CONFIGURATION_PATHS
                        A `;`-separated list of paths specifying JSON configuration files. Each file is a list of paths to JSON model generators relative to the
                        configuration file or names of CPP model generators.
  --model-generator-search-paths MODEL_GENERATOR_SEARCH_PATHS
                        A `;`-separated list of paths where we look up JSON model generators.
  --maximum-source-sink-distance MAXIMUM_SOURCE_SINK_DISTANCE
                        Limits the distance of sources and sinks from a trace entry point.

`--apk-path`

Mariana Trench analyzes Dalvik bytecode. You provide it with the android app (APK) to analyze.

`--output-directory OUTPUT_DIRECTORY`

The output of the analysis is a file containing metadata about the particular run in JSON format as well as sharded files containing data flow specifications for every method in the APK. These files need to be processed by SAPP (see Getting Started) after the analysis. The flag specifies where these files are saved.

`--system-jar-configuration-path SYSTEM_JAR_CONFIGURATION_PATH`

This path points to a json file containing a list of .jar files that the analysis should include in the analysis. It's important that this contains at least the path to android.jar on your system. This file is typically located in your android SDK distribution at $ANDROID_SDK/platforms/android-30/android.jar. Without the android.jar, Mariana Trench will not know about many methods from the standard library that might be important for your model generators.

`--rules-paths RULES_PATHS`

A ; separated search path pointing to files and directories containing rules files. These files specify what taint flows Mariana Trench should look for. Check out the rules.json that's provided by default. It specifies that we want to find flows from user controlled input (ActivityUserInput) into CodeExecution sinks and that this constitutes a remote code execution.

`--source-root-directory SOURCE_ROOT_DIRECTORY`

Mariana Trench will do a source indexing path before the analysis. This is because Dalvik/Java bytecode does not contain complete location information, only filenames (not paths) and line numbers. The index is later used to emit precise locations.

`--model-generator-configuration-paths MODEL_GENERATOR_CONFIGURATION_PATHS`

A ; separated set of files containing the names of model generators to run. See default_generator_config.json for an example.

`--model-generator-search-paths MODEL_GENERATOR_SEARCH_PATHS`

A ; separated search path where Mariana Trench will try to find the model generators specified in the generator configuration.

`--maximum-source-sink-distance MAXIMUM_SOURCE_SINK_DISTANCE`

For performance reasons it can be useful to limit the maximum length of a trace Mariana Trench tries to find (note that longer traces also tend to be harder to interpret). Due to the modular nature of the analysis the value specified here limits the maximum length from the trace root to the source, and from the trace root to the sink. This means found traces can have length of 2 x MAXIMUM_SOURCE_SINK_DISTANCE.

`--heuristics HEURISTICS_FILE_PATH`

Mariana Trench uses various heuristics parameters during the analysis. It is possible to set some of them with a JSON configuration file. The complete list of configurable parameters is reported in the heuristics parameters section. It is optional to specify a configuration for the heuristics parameters, and the the parameters that are not specified are set to a default value.

Heuristics Parameters

`join_override_threshold INT`

When a method has a set of overrides greater than this threshold, Mariana Trench does not join all overrides at call sites.

`android_join_override_threshold INT`

When an android/java/google method has a set of overrides which is greater than this threshold, Mariana Trench does not join all overrides at call sites.

`warn_override_threshold INT`

When a method which has a set of overrides greater than this threshold that is not marked with NoJoinVirtualOverrides is called at least once, Mariana Trench prints a warning.

`generation_max_port_size INT`

Maximum size of the port of a generation.

`generation_max_output_path_leaves INT`

Maximum number of leaves in the tree of output paths of generations. When reaching the maximum, Mariana Trench collapses all the subtrees into a single node.

`parameter_source_max_port_size INT`

Maximum size of the port of a parameter source.

`parameter_source_max_output_path_leaves INT`

Maximum number of leaves in the tree of output paths of parameter sources. When reaching the maximum, Mariana Trench collapses all the subtrees into a single node.

`sink_max_port_size INT`

Maximum size of the port of a sink.

`sink_max_input_path_leaves INT`

Maximum number of leaves in the tree of input paths of sinks. When reaching the maximum, Mariana Trench collapses all the subtrees into a single node.

`call_effect_source_max_port_size INT`

Maximum size of the port of a call effect source.

`call_effect_source_max_output_path_leaves INT`

Maximum number of leaves in the tree of output paths of call effect sources. When reaching the maximum, Mariana Trench collapses all the subtrees into a single node.

`call_effect_sink_max_port_size INT`

Maximum size of the port of a call effect sink.

`call_effect_sink_max_input_path_leaves INT`

Maximum number of leaves in the tree of input paths of call effect sinks. When reaching the maximum, Mariana Trench collapses all the subtrees into a single node.

`max_number_iterations INT`

Maximum number of global iterations before Mariana Trench aborts the analysis.

`max_depth_class_properties INT`

Maximum depth of dependency graph traversal to find class properties.

`max_call_chain_source_sink_distance INT`

Maximum number of hops that can be tracked for a call chain issue.

`propagation_max_input_path_size INT`

Maximum size of the input access path of a propagation.

`propagation_max_input_path_leaves INT`

Maximum number of leaves in the tree of input paths of propagations.

Heuristics Parameter Configuration Example

The following JSON document is a valid configuration file for the heuristics parameters. Typically, we try to find a balance between precision of the analysis and performance.

{
    "join_override_threshold": 100,
    "android_join_override_threshold": 100,
    "warn_override_threshold": 100,
    "generation_max_port_size": 10,
    "generation_max_output_path_leaves": 30,
    "parameter_source_max_port_size": 10,
    "parameter_source_max_output_path_leaves": 30,
    "sink_max_port_size": 10,
    "sink_max_input_path_leaves": 30,
    "call_effect_source_max_port_size": 10,
    "call_effect_source_max_output_path_leaves": 30,
    "call_effect_sink_max_port_size": 10,
    "call_effect_sink_max_input_path_leaves": 30,
    "max_number_iterations": 300,
    "max_depth_class_properties": 30,
    "max_call_chain_source_sink_distance": 30,
    "propagation_max_input_path_size": 10,
    "propagation_max_input_path_leaves": 10
}

Command Line Options​

--apk-path​

--output-directory OUTPUT_DIRECTORY​

--system-jar-configuration-path SYSTEM_JAR_CONFIGURATION_PATH​

--rules-paths RULES_PATHS​

--source-root-directory SOURCE_ROOT_DIRECTORY​

--model-generator-configuration-paths MODEL_GENERATOR_CONFIGURATION_PATHS​

--model-generator-search-paths MODEL_GENERATOR_SEARCH_PATHS​

--maximum-source-sink-distance MAXIMUM_SOURCE_SINK_DISTANCE​

--heuristics HEURISTICS_FILE_PATH​

Heuristics Parameters​

join_override_threshold INT​

android_join_override_threshold INT​

warn_override_threshold INT​

generation_max_port_size INT​

generation_max_output_path_leaves INT​

parameter_source_max_port_size INT​

parameter_source_max_output_path_leaves INT​

sink_max_port_size INT​

sink_max_input_path_leaves INT​

call_effect_source_max_port_size INT​

call_effect_source_max_output_path_leaves INT​

call_effect_sink_max_port_size INT​

call_effect_sink_max_input_path_leaves INT​

max_number_iterations INT​

max_depth_class_properties INT​

max_call_chain_source_sink_distance INT​

propagation_max_input_path_size INT​

propagation_max_input_path_leaves INT​

Heuristics Parameter Configuration Example​