DVFS Thermal Model and Algorithm

[Pro-Temp] Temperature Control in 2008

SMP-Temperature_Control_Using_Convex_Optimization

Ref: S. Murali et al., “Temperature control of high-performance multi-core platforms using convex optimization,” in Proc. of DATE, 2008.

Details of the design

1st phase [Design-Time]

DTPM-2008-Phase1

Denote

Core 1 ...... Core i ..... Core j ..... Core n
Power-p1p_1 ..... Power-pip_i Power-pnp_n

Compact Model in Thermal-design

There exists a well-known duality [Krum 2000]. The rationale behind is that current and heat flow are described by exactly the same differential equations for a potential difference.

This duality provides a convenient basis for an architecture-level thermal model. For a microarchitectural unit, heat conduction to the thermal package and to neighboring units are the dominant mechanisms that determine the temperature.

Heat Transfer Electrical Phenomena
Heat flow passing through a thermal resistance Current passing through a resistance
Heat:QQ Charge:qq
Temperature Difference:ΔT \Delta T Voltage:VV
Thermal capacitance:Ct=ΔQΔT C_t =\frac{\Delta Q}{\Delta T} Capacitance:C=qVC =\frac{q}{V}
RCtdT(t)dt+T(t)=RP(t) R C_t \frac{d T(t)}{dt} +T(t) = R P(t) RCdV(t)dt+V(t)=VI R C \frac {d V(t)}{dt} +V(t) = V_I

Explanation:

[2] K. Skadron, M. Stan, W. Huang, S. Velusamy, K. Sankaranarayanan and D. Tarjan, “Temperature-aware microarchitecture: Modeling and implementation,” ACM Trans. Archit. Code Optim., vol. 1, no. 1, pp. 94–125, Mar. 2004.

Thermal Equation

We discretize the equation above assuming a sampling period of Ts seconds.

The thermal equation of core i at current time (k+1) is based on the previous one timestamp (k).

ti=ti+jAdjiai,j(tjti)+bipit_i ' = t_i + \sum_{\forall j \in Adj_i} a_{i,j} (t_j - t_i) + b_i p_i

Need to be smaller than maximum allowable temperature := t_max

s.t.titmax(i,timestampk)s.t. \quad t_i' \leq t_{max}\quad (\forall i, timestamp_k)

Temperature Gradients

a physical quantity that describes in which direction and at what rate the temperature changes the most rapidly around a particular location.

Assumption: Uniform spatial temperature gradients across the cores

titjtgrad(i,j)t_i - t_j \leq tgrad \quad (\forall i,j)

Objective

Find a vector fi0f_i \geq 0 that satisfies the above restrictions and

minipi+tgradmin \quad \sum_i p_i + tgrad

minipmaxfi2fmax2+tgradmin \quad \sum_i p_{max} \frac{f_i^2}{f_{max}^2} + tgrad

Solving convex optimization models

Design Time Implementation

We use the execution characteristics of tasks from a mix of different benchmarks, ranging from web-accessing to playing multimedia files [26]. The maximum task/thread lengths of the benchmarks is around 10 ms. The experiments are conducted using a large trace with around 60,000 tasks, modeling several hundredseconds of actual system execution.

DTPM-2008-Phase1-Output

2nd phase [Run-Time]

Effects

With the integration of the efficient task assignment policy with our Pro-Temp method, the spatial temperature difference across the cores reducesfurther (by 16%).

High level of DTPM

Variable Frequency Setting considering floorplan of the chip

The processors near the cooler caches ( compare with near other hotter processors )

[13] A. K. Coskun, J. L. Ayala, D. Atienza, T. S. Rosing, and Y. Leblebici, “Dynamic thermal management in 3d multicore architectures,” in Proc. of DATE, 2009.

[14] V. Hanumaiah, S. Vrudhula, and K. S. Chatha, “Performance optimal online dvfs and task migration techniques for thermally constrained multi-core processors,” IEEE Trans on CAD of Integrated Circuits and Syst., vol. 30, no. 11, pp. 1677–1690, 2011.

[15] F. Zanini, D. Atienza, L. Benini, and G. De Micheli, “Multicore thermal management with model predictive control,” in European Conf. on Circuit Theory and Design, 2009.

[16] Y. Wang, K. Ma, and X. Wang, “Temperature-constrained power control for chip multiprocessors with online model estimation,” in AC M SIGARCH Comp. Arch. News, vol. 37, no. 3, 2009, pp. 314–324.

Agent [17] M.A.AlFaruque,J.Jahn,T.Ebi,andJ.Henkel,“Runtimethermalmanagement using software agents for multi-and many-core architectures,”IEEE Design & Test of Computers, vol. 27(6), pp. 58–68, 2010.

[4] T. S. Muthukaruppan et al., “Hierarchical power management for asymmetric multi-core in dark silicon era,” in Proc. of DAC, 2013.

-----  Appendix : List of related literature ----

[PROMETHEUS-framework] Thermal Power in HMP Embedded

Proactive_Method_for_Thermal

Ref: PROMETHEUS: A Proactive Method for Thermal Management of Heterogeneous MPSoCs, Shervin Sharifi, Member, IEEE, Dilip Krishnaswamy, Senior Member, IEEE, and Tajana Sˇimunic ́ Rosing, Member, IEEE

Thermal Model

2013-Network Analytical Solution 2013-TempBreakDown

Proactive Temperature Aware Scheduling

DTPM-2013-Scheduler

Power State Decision Modules

Two scheduling techniques

Multi-parametric programming

Multiparametric optimization approach splits the optimization process into offline and online stages. We describe how we deal with the general case of systems with multiple cores and different representation of performance requirements.

the Varying Parameters (Inputs)

s.t.T(α[k+1],α[k],To[k+1])<TThresholds.t. \quad T(\alpha[k+1],\alpha[k],T_o[k+1])< T_{Threshold}

Objective

minαPtotal(α[k+1],To[k+1])min_\alpha \quad P_{total}(\alpha[k+1],T_o[k+1])

TempoMP

DTPM-2013-Multi-Prog

TemPromp

To avoid this overhead of storing and retreiving, use a heuristic for its power state decision module.

Task Assignment to the cores

[DTPM] Thermal Power in HMP Mobile

HMP-AM-2015-USA_Singla-Thermal_Power-Mobile

Ref: Predictive_dynamic_thermal_and_power_management_for_heterogeneous_mobile_platforms, 2015

DTPM

Linux implementation of DTPM: please refer to DTPM Linux

High level of DTPM

The proposed DTPM approach is non-intrusive when the temperature is within permissible levels (i.e. when Thermal Violation == NO).

Power and Thermal Models

with experimental validation using one of the first commercial big.LITTLE architectures (Samsung Exynos 5410 octa-core chip).

Compute power budget using temperature prediction

Thermal Equation also enables temperature prediction at an arbitrary number of time steps tin t_i^{n} ahead. In particular, the temperature at time step k + n can be derived.

ti=ti+jAdjiai,j(tjti)+bipit_i ' = t_i + \sum_{\forall j \in Adj_i} a_{i,j} (t_j - t_i) + b_i p_i

ti=ti+jAdjiai,j(tjti)+bipit_i '' = t_i' + \sum_{\forall j \in Adj_i} a_{i,j} (t_j' - t'_i) + b_i p'_i

A prediction interval of “1s” since it is sufficient to control the temperature of our target platform. In general, accurate predictions up to“5s” can be made.

Algorithm

Effects

Exhaustive experimental evaluation which demonstrates effective thermal regulation with 6× smaller variance and as much as 16% reduction in total platform power.

In particular, it regulates the temperature more effectively than the default configuration which uses a fan, and on average offers 10% platform power savings with 3.3% loss in performance.

Appendix: TO DO

HMP-AM-2017-Bhat-Thermal_Power

Ref: Algorithmic Optimization of Thermal and Power Management for Heterogeneous Mobile Platforms, 2017

HMP-AM-2017-Pakistan_Khan-Thermal_Power

Ref: Scheduling_based_energy_optimization_technique_in_multiprocessor_embedded_systems