Deep Unfolding: Between Algorithms and Deep Learning - Complex Systems Science Laboratory Hiroshima University

Introduction

Deep learning has demonstrated strong performance in areas such as image processing and natural language processing. However, have you ever felt that it is difficult to understand why it works so well, or what is actually happening inside the model? For those who value explicit computational procedures and algorithmic structures, the black-box nature of deep learning can sometimes be a source of discomfort.

Deep unfolding is an approach motivated by this concern. It reformulates conventional iterative algorithms directly into the structure of deep neural networks. While incorporating learning, it places emphasis on explicitly preserving the computational meaning and structure inherent in the original algorithm.

By performing learning while maintaining the algorithmic structure, it becomes possible to improve performance while understanding why each computation is necessary. This characteristic is important not only for achieving higher accuracy, but also for making results easier to interpret and refine.

Topics that students in information science programs have studied—such as programming, discrete mathematics, algorithms, probability and statistics, and linear models—are closely related to the concept of deep unfolding. The ability to decompose processing flows and express computations as code can be directly applied in this field.

Rather than extensive knowledge of advanced mathematics, what matters is the attitude of following computational flows, implementing them, and reasoning through observed behavior. For this reason, deep unfolding is a field that students can readily engage in even at the undergraduate level.

What Is Deep Unfolding?

Many algorithms approach solutions by repeatedly applying the same computation. This iterative structure reflects the core design philosophy of the algorithm, with each step playing a clearly defined role.

In deep unfolding, these repeated computations are unfolded along the time axis, and each step is treated as a layer of a neural network. A key feature is that each layer corresponds one-to-one with the original update rule, rather than simply stacking generic layers.

Because the operations in each layer are derived from the original algorithm, the computational meaning is easy to trace. As a result, even after introducing learning, it remains possible to reason about how each computation affects the outcome.

By keeping this structure intact while making certain parameters learnable, deep unfolding can achieve high performance with fewer iterations than conventional methods. The ability to balance theoretical insight and practical performance is one of the major strengths of deep unfolding.

Example: Deep Unfolding of an Iterative Algorithm

A representative example is the iterative algorithm ISTA, which is widely known as a method for finding solutions by repeatedly applying a fixed update rule.

In standard ISTA, coefficients and thresholds in the update rule are fixed, based on theory or experience. In deep unfolding, these parameters are treated as independent across layers and are made learnable from data.

This allows the algorithm to adapt its behavior to specific problems or datasets while preserving its original meaning. Unlike black-box learning approaches, the structure of the updates remains central to the design.

In this way, deep unfolding enables the simultaneous treatment of theoretically meaningful computations and data-driven optimization.

Broadening Applications

Deep unfolding is not limited to a single application domain. Any setting in which iterative algorithms or optimization methods are used can potentially be reformulated through deep unfolding.

As a result, research has been conducted in a wide range of areas, including image reconstruction, data analysis, communication systems, and biomedical signal processing. Although application domains differ, the underlying principles remain the same.

What these applications share is the idea of transforming algorithms with clearly defined computational flows into learnable structures. This is what supports the generality of deep unfolding.

Extension to Control: Temporal Deep Unfolding

In our laboratory, we extend the concept of deep unfolding to control problems that involve temporal evolution. In control systems, states change over time, and these changes are causally determined by inputs.

In spatio-temporal (phase) deep unfolding, the structure of the controlled system and the control law are unfolded along the time axis, with computations at each time step represented as layers. This allows learning to be incorporated without breaking temporal causality.

By making part of the control law learnable, it becomes possible to naturally integrate theory-based design with data-driven adjustment.

Validation through Numerical Simulation

The behavior of spatio-temporal deep unfolding has been validated through numerical simulations. Stable performance has been confirmed even for nonlinear systems and systems with complex dynamics.

Compared with conventional control methods, spatio-temporal deep unfolding can reach solutions with fewer computations and exhibits greater flexibility under changing conditions.

An important feature is that, while performance is improved through learning, the computational flow and the role of each operation remain clear. This makes it easier to analyze not only the results, but also the reasons behind the observed behavior.

Advantages of Temporal Deep Unfolding

A key advantage of spatio-temporal deep unfolding lies in its integration of reliability based on mathematical models with the flexibility of AI within a single framework. This enables designs that do not rely exclusively on either theory or learning.

In terms of effectiveness, high performance has been demonstrated across a wide range of control problems, including nonlinear, time-varying, and large-scale systems. Another advantage is that optimizers commonly used in existing AI frameworks can be directly applied.

As a result, faster convergence and higher control success rates have been achieved compared with conventional methods. Moreover, when a mathematical model of the controlled system is available, a working system can often be constructed within a relatively short time.

The code structure is simple and flexible, allowing easy modification of parameters and model structures. The ability to handle diverse control problems within a single framework is a major strength of spatio-temporal deep unfolding.

A Message to Students

Research does not require complete mastery of advanced knowledge from the outset. In many cases, it begins with simple questions such as “What does this computation do?” or “Why is this process repeated?”

In research on deep unfolding and spatio-temporal deep unfolding, the core activity is repeatedly tracing algorithms, implementing them, running experiments, and examining the results. Thinking, coding, observing outcomes, and thinking again form a continuous cycle.

There is room for students who enjoy programming as well as those who prefer carefully reasoning about the meaning of computations.

The necessary theoretical background is acquired gradually as research progresses. The field may not appear flashy, but for those interested in continually asking “why does this work?”, it may be worth taking a closer look.