Close

Presentation

RAGE - The 1st International workshop on Real-time And intelliGent Edge computing
TimeSunday, July 10th8:00am - 5:00pm PDT
Location3002, Level 3
Event Type
Workshop
Topics
AI
DescriptionThe edge computing paradigm is becoming increasingly popular as it facilitates real-time computation, reduces energy consumption and carbon footprint, and fosters security and privacy preservation by processing the data closer to its origins, thereby drastically reducing the amount of data sent to the cloud. On the application side, there is a growing interest in using edge computing as a key pillar to support decentralized artificial intelligence by implementing federated learning and adaptive deep learning inference at the edge. However, many edge applications tightly interact with the surrounding environment and are required to deliver a result (e.g., perform actuation or send a message through a 5G network) within a predefined deadline. Therefore, a key requirement in edge computing is the need to be predictable across the edge-to-cloud continuum while also efficiently utilizing the system resources.

However, meeting the above requirements is non-trivial. Modern edge devices can be very diverse (from hand-held devices to large in-premise servers) and can include complex embedded platforms with multiple heterogeneous cores and hardware accelerators such as GPUs, TPUs, and FPGAs. This complexity introduces considerable challenges when trying to guarantee timing requirements of real-time applications: for example, due to scheduling policies implemented by the hardware accelerators (often hidden by vendors), or due to the memory contention experienced by the cores when accessing concurrently main memory. Secondly, the network transmission time (TSN over Ethernet to 5G links) can lead to variability in the end-to-end latencies incurred by edge applications.

Furthermore, the operating system (OS) also plays a crucial role in enabling the edge computing paradigm, but quite often at the price of increasing the difficulty in deriving timing guarantees: for example, think of a complex deep neural network that needs to leverage a Linux-based OS (which is far more complicated than a real-time operating system), since it makes available all the software stacks (e.g., TensorRT) and device drivers to interact with NVIDIA GPUs.

The complexity of the problem is further increased by the usage of middleware frameworks, which simplify the development of applications, but at the cost of introducing additional scheduling policies that add to those implemented by the underlying operating system, hindering predictability. Some relevant examples are ROS, in the context of robotics, TensorFlow for artificial intelligence, TensorRT for efficient deep neural network inference on GPUs, and others. Virtualization technologies are also becoming crucial in implementing the edge paradigm, but again, at the expense of creating a more complex operating environment, where guaranteeing temporal properties is really challenging. These problems are common to many application domains, including cyber-physical systems, future-generation autonomous-driving applications, robotics, Industry 4.0, smart-buildings, and more.

In this workshop, we solicit the submission of work-in-progress papers. Workshop topics include, but are not limited to:

-Real-time edge computing

-QoS mechanisms for temporal isolation

-Mechanisms for end-to-end latency guarantees in the edge-to-cloud continuum

-Predictability in middleware frameworks (ROS, TensorFlow, TensorRT, and more)

-Real-time edge computing use cases

-Real-time network protocols for edge computing

-Real-time distributed artificial intelligence

-Predictable and efficient parallel applications

-Timing predictability for artificial intelligence