Android’s defense-in-depth technique applies not solely to the Android OS working on the Utility Processor (AP) but additionally the firmware that runs on gadgets. We notably prioritize hardening the mobile baseband given its distinctive mixture of working in an elevated privilege and parsing untrusted inputs which can be remotely delivered into the system.
This submit covers the way to use two high-value sanitizers which may forestall particular lessons of vulnerabilities discovered throughout the baseband. They’re structure agnostic, appropriate for bare-metal deployment, and ought to be enabled in present C/C++ code bases to mitigate unknown vulnerabilities. Past safety, addressing the problems uncovered by these sanitizers improves code well being and total stability, lowering assets spent addressing bugs sooner or later.
As we outlined beforehand, safety analysis centered on the baseband has highlighted a constant lack of exploit mitigations in firmware. Baseband Distant Code Execution (RCE) exploits have their very own categorization in well-known third-party marketplaces with a comparatively low payout. This means baseband bugs might probably be considerable and/or not too complicated to seek out and exploit, and their outstanding inclusion within the market demonstrates that they’re helpful.
Baseband safety and exploitation has been a recurring theme in safety conferences for the final decade. Researchers have additionally made a dent on this space in well-known exploitation contests. Most not too long ago, this space has turn out to be outstanding sufficient that it’s widespread to seek out sensible baseband exploitation trainings in high safety conferences.
Acknowledging this development, mixed with the severity and obvious abundance of those vulnerabilities, final yr we launched updates to the severity tips of Android’s Vulnerability Rewards Program (VRP). For instance, we contemplate vulnerabilities permitting Distant Code Execution (RCE) within the mobile baseband to be of CRITICAL severity.
Frequent lessons of vulnerabilities may be mitigated by the usage of sanitizers supplied by Clang-based toolchains. These sanitizers insert runtime checks in opposition to widespread lessons of vulnerabilities. GCC-based toolchains may present some degree of help for these flags as properly, however won’t be thought of additional on this submit. We encourage you to test your toolchain’s documentation.
Two sanitizers included in Undefined Habits Sanitizer (UBSan) shall be our focus – Integer Overflow Sanitizer (IntSan) and BoundsSanitizer (BoundSan). These have been extensively deployed in Android userspace for years following a data-driven strategy. These two are properly suited to bare-metal environments such because the baseband since they don’t require help from the OS or particular structure options, and so are typically supported for all Clang targets.
Integer Overflow Sanitizer (IntSan)
IntSan causes signed and unsigned integer overflows to abort execution until the overflow is made express. Whereas unsigned integer overflows are technically outlined habits, it could usually result in unintentional habits and vulnerabilities – particularly once they’re used to index into arrays.
As each intentional and unintentional overflows are seemingly current in most code bases, IntSan might require refactoring and annotating the code base to stop intentional or benign overflows from trapping (which we contemplate a false optimistic for our functions). Overflows which should be addressed may be uncovered through testing (see the Deploying Sanitizers part)
BoundsSanitizer (BoundSan)
BoundSan inserts instrumentation to carry out bounds checks round some array accesses. These checks are solely added if the compiler can not show at compile time that the entry shall be secure and if the scale of the array shall be identified at runtime, in order that it may be checked in opposition to. Word that this won’t cowl all array accesses as the scale of the array might not be identified at runtime, equivalent to operate arguments that are arrays.
So long as the code is appropriately written C/C++, BoundSan ought to produce no false positives. Any violations found when first enabling BoundSan is no less than a bug, if not a vulnerability. Resolving even these which aren’t exploitable can tremendously enhance stability and code high quality.
Modernize your toolchains
Adopting fashionable mitigations additionally means adopting (and sustaining) fashionable toolchains. The advantages of this transcend using sanitizers nonetheless. Sustaining an outdated toolchain is just not free and entails hidden alternative prices. Toolchains comprise bugs that are addressed in subsequent releases. Newer toolchains deliver new efficiency optimizations, useful within the extremely constrained bare-metal setting that basebands function in. Safety points may even exist within the generated code of out-of-date compilers.
Sustaining a contemporary up-to-date toolchain for the baseband entails some prices by way of upkeep, particularly at first if the toolchain is especially outdated, however over time the advantages, as outlined above, outweigh the prices.
Each BoundSan and IntSan have a measurable efficiency overhead. Though we have been in a position to considerably scale back this overhead prior to now (for instance to lower than 1% in media codecs), even very small will increase in CPU load can have a considerable influence in some environments.
Enabling sanitizers over all the codebase offers probably the most profit, however enabling them in security-critical assault surfaces can function a primary step in an incremental deployment. For instance:
- Capabilities parsing messages delivered over the air in 2G, 3G, 4G, and 5G (particularly features dealing with pre-authentication messages that may be injected with a false/malicious base station)
- Libraries encoding/decoding complicated codecs (e.g. ASN.1, XML, DNS, and many others…)
- IMS, TCP and IP stacks
- Messaging features (SMS, MMS)
Within the specific case of 2G, the very best technique is to disable the stack altogether by supporting Android’s “2G toggle”. Nonetheless, 2G remains to be a essential cell entry expertise in sure elements of the world and a few customers would possibly have to have this legacy protocol enabled.
Having a transparent plan for deployment of sanitizers saves a variety of effort and time. We consider the deployment course of as having three phases:
- Detecting (and fixing) violations
- Measuring and lowering overhead
- Soaking in pre-production
We additionally introduce two modes through which sanitizers ought to be run: diagnostics mode and trapping mode. These shall be mentioned within the following sections, however briefly: diagnostics mode recovers from violations and offers useful debug data, whereas trapping mode actively mitigates vulnerabilities by trapping execution on violations.
Detecting (and Fixing) Violations
To efficiently ship these sanitizers, any benign integer overflows should be made express and unintended out-of-bounds accesses should be addressed. These must be uncovered by testing. The upper the code protection your assessments present, the extra points you may uncover at this stage and the simpler deployment shall be afterward.
To diagnose violations uncovered in testing, sanitizers can emit calls to runtime handlers with debug data such because the file, line quantity, and values resulting in the violation. Sanitizers can optionally proceed execution after a violation has occurred, permitting a number of violations to be found in a single check run. We seek advice from utilizing the sanitizers on this method as working them in “diagnostics mode”. Diagnostics mode is just not supposed for manufacturing because it offers no safety advantages and provides excessive overhead.
Diagnostics mode for the sanitizers may be set utilizing the next flags:
-fsanitize=signed-integer-overflow,unsigned-integer-overflow,bounds -fsanitize-recover=all
Since Clang doesn’t present a UBSan runtime for bare-metal targets, a runtime will should be outlined and supplied at hyperlink time:
// integer overflow handlers __ubsan_handle_add_overflow(OverflowData *knowledge, ValueHandle lhs, ValueHandle rhs) __ubsan_handle_sub_overflow(OverflowData *knowledge, ValueHandle lhs, ValueHandle rhs) __ubsan_handle_mul_overflow(OverflowData *knowledge, ValueHandle lhs, ValueHandle rhs) __ubsan_handle_divrem_overflow(OverflowData *knowledge, ValueHandle lhs, ValueHandle rhs) __ubsan_handle_negate_overflow(OverflowData *knowledge, ValueHandle old_val) // boundsan handler __ubsan_handle_out_of_bounds_overflow(OverflowData *knowledge, ValueHandle old_val)
For instance, see the default Clang implementation; the Linux Kernels implementation may be illustrative.
With the runtime outlined, allow the sanitizer over all the baseband codebase and run all obtainable assessments to uncover and deal with any violations. Vulnerabilities ought to be patched, and overflows ought to both be refactored or made express by the usage of checked arithmetic builtins which don’t set off sanitizer violations. Sure features that are anticipated to have intentional overflows (equivalent to cryptographic features) may be preemptively excluded from sanitization (see subsequent part).
Except for uncovering safety vulnerabilities, this stage is extremely efficient at uncovering code high quality and stability bugs that would lead to instability on person gadgets.
As soon as violations have been addressed and assessments are now not uncovering new violations, the following stage can start.
Measuring and Decreasing Overhead
As soon as shallow violations have been addressed, benchmarks may be run and the overhead from the sanitizers (efficiency, code dimension, reminiscence footprint) may be measured.
Measuring overhead should be completed utilizing manufacturing flags – particularly “trapping mode”, the place violations trigger execution to abort. The diagnostics runtime used within the first stage carries vital overhead and isn’t indicative of the particular efficiency sanitizer overhead.
Trapping mode may be enabled utilizing the next flags:
-fsanitize=signed-integer-overflow,unsigned-integer-overflow,bounds -fsanitize-trap=all
A lot of the overhead is probably going attributable to a small handful of “sizzling features”, for instance these with tight long-running loops. Tremendous-grained per-function efficiency metrics (much like what Simpleperf offers for Android), permits evaluating metrics earlier than and after sanitizers and offers the best means to determine sizzling features. These features can both be refactored or, after handbook inspection to confirm that they’re secure, have sanitization disabled.
Sanitizers may be disabled both inline within the supply or by the usage of ignorelists and the -fsanitize-ignorelist flag.
The bodily layer code, with its extraordinarily tight efficiency margins and decrease probability of exploitable vulnerabilities, could also be a superb candidate to disable sanitization wholesale if preliminary efficiency appears prohibitive.
Soaking in Pre-production
With overhead minimized and shallow bugs resolved, the ultimate stage is enabling the sanitizers in trapping mode to mitigate vulnerabilities.
We strongly advocate an extended interval of inner soak in pre-production with check populations to uncover any remaining violations not found in testing. The extra thorough the check protection and size of the soak interval, the much less threat there shall be from undiscovered violations.
As above, the configuration for trapping mode is as follows:
-fsanitize=signed-integer-overflow,unsigned-integer-overflow,bounds -fsanitize-trap=all
Having infrastructure in place to gather bug reviews which end result from any undiscovered violations might help decrease the chance they current.
The advantages from deploying sanitizers in your present code base are tangible, nonetheless in the end they deal with solely the bottom hanging fruit and won’t lead to a code base freed from vulnerabilities. Different lessons of reminiscence security vulnerabilities stay unaddressed by these sanitizers. A long run resolution is to start transitioning at the moment to memory-safe languages equivalent to Rust.
Rust is prepared for bare-metal environments such because the baseband, and we’re already utilizing it in different bare-metal parts in Android. There isn’t any have to rewrite every part in Rust, as Rust offers a powerful C FFI help and simply interfaces with present C codebases. Simply writing new code in Rust can quickly scale back the variety of reminiscence security vulnerabilities. Rewrites ought to be restricted/prioritized just for probably the most important parts, equivalent to complicated parsers dealing with untrusted knowledge.
The Android crew has developed a Rust coaching meant to assist skilled builders rapidly ramp up Rust fundamentals. A complete day for bare-metal Rust is included, and the course has been translated to a lot of totally different languages.
Whereas the Rust compiler might not explicitly help your bare-metal goal, as a result of it’s a front-end for LLVM, any goal supported by LLVM may be supported in Rust by customized goal definitions.
Because the high-level working system turns into a tougher goal for attackers to efficiently exploit, we count on that decrease degree parts such because the baseband will appeal to extra consideration. By utilizing fashionable toolchains and deploying exploit mitigation applied sciences, the bar for attacking the baseband may be raised as properly. When you have any questions, tell us – we’re right here to assist!