Black Hat2024

The Devil is in the (Micro-) Architectures: Uncovering New Side-Channel and Bit-Flip Attack Surfaces in DNN Executables

Black Hat748 views41:0712 months ago

This talk demonstrates novel side-channel and bit-flip attack vectors targeting Deep Neural Network (DNN) executables generated by modern deep learning compilers. The researchers show that compiler-specific optimizations, such as blocking and vectorization, create unique, identifiable cache access patterns that allow for model architecture recovery. Furthermore, they present a bit-flip attack technique that exploits these compiled structures to degrade model intelligence without requiring white-box access to model weights. The research highlights the security implications of deploying compiled DNNs in resource-sharing cloud environments.

Compiling Neural Networks Creates New Side-Channel and Bit-Flip Attack Surfaces

TLDR: Modern deep learning compilers like TVM and Glow optimize neural networks for specific hardware, but these optimizations inadvertently leak model architecture through cache access patterns. Researchers demonstrated that these identifiable patterns allow attackers to recover model structures and perform bit-flip attacks on the compiled binary code. This research proves that even without white-box access to model weights, attackers can degrade model intelligence by flipping specific bits in the executable's code region.

Deep learning models are no longer just static blobs of weights sitting in a container. To squeeze performance out of commodity hardware, organizations are increasingly using deep learning compilers to transform high-level model definitions into optimized, hardware-specific binaries. While this shift improves inference speed, it introduces a massive, overlooked attack surface. By turning a model into a standard executable, you are no longer just dealing with tensor math; you are dealing with memory layouts, instruction streams, and cache behavior.

The Compiler as an Information Leak

The core of this research centers on how compilers like TVM and Glow handle operator optimization. To maximize performance, these compilers employ techniques like blocking and vectorization. Blocking breaks down large matrix multiplications into smaller, cache-friendly chunks, while vectorization leverages SIMD instructions to process multiple data points simultaneously.

These optimizations are not generic. They are highly dependent on the specific hyper-parameters of the neural network layers. Because the compiler makes deterministic decisions based on these parameters, the resulting binary exhibits unique, repeatable cache access patterns.

During the research, the team used Intel PIN to instrument the execution of these compiled DNNs. By monitoring the L1 cache, they observed that different operators—such as a 1x1 convolution versus a 3x3 convolution—produce distinct "fingerprints" in the cache access trace. For a pentester, this means that if you can monitor cache activity on a shared system, you can effectively reverse-engineer the architecture of a proprietary model. You are not just guessing; you are reading the model's blueprint directly from the CPU's cache behavior.

Bit-Flip Attacks Without Weight Access

Perhaps the most alarming finding is the shift in bit-flip attack methodology. Traditionally, bit-flip attacks (often leveraging Rowhammer) required white-box knowledge of the model's weights to calculate gradients and identify the most sensitive bits to flip. This is a high bar for an attacker.

This research changes the game by targeting the compiled code region rather than the weight data. By analyzing the binary, the researchers identified "superbits"—specific bits in the code that, when flipped, cause the model to skip critical computations or return uninitialized results.

The attack flow is straightforward:

The attacker compiles a local version of the target model using the same compiler and settings.
They perform a local scan of the binary to identify bits that, when flipped, cause the desired degradation in model intelligence.
They apply these specific flips to the victim's running executable.

Because the attacker is targeting the instruction stream, they do not need to know the model's weights. They only need to know the model's structure and the compilation pipeline. In their experiments, the researchers achieved a 45% attack success rate with an average of only 1.4 bit-flips. This is a massive improvement over previous techniques that required significantly more effort and precise knowledge of the model's internal state.

Real-World Implications for Cloud Deployments

If you are testing an environment where multiple tenants share the same physical hardware—a standard setup for most Machine Learning as a Service (MLaaS) providers—this is a critical finding. The ability to recover model architecture via cache side-channels allows an attacker to perform reconnaissance on a competitor's model. Once the architecture is known, the bit-flip attack provides a low-cost, high-impact method to sabotage the model's accuracy.

During an engagement, you should look for instances where high-performance inference services are running on shared infrastructure. If you can gain unprivileged execution on the same host, you can use cache-monitoring techniques to profile the target's inference calls. The lack of memory isolation between the cache-monitoring process and the target process is the primary enabler here.

Defensive Realities

Defending against these attacks is difficult because they exploit the very optimizations that make these models viable in production. Disabling blocking or vectorization would render many models too slow for real-time use.

Blue teams should focus on hardening the execution environment. This includes implementing strict cache partitioning or using hardware features that isolate cache lines between processes. Furthermore, monitoring for unusual cache miss rates or unexpected process crashes—which often occur when a bit-flip attack hits an invalid instruction—can serve as a detection signal for ongoing exploitation attempts.

The industry has spent years focusing on adversarial examples and data poisoning, but we have largely ignored the reality that a model is just another piece of software. When we compile it, we inherit all the classic vulnerabilities of compiled code. It is time to start treating DNN executables with the same level of scrutiny we apply to any other high-value binary.

Talk Type

research presentation

Difficulty

advanced