Perspectives: DPU-based hardware acceleration: a software perspective

Data-processing units (DPUs) promise greater data-center efficiency, but low-level programming requirements have hindered broad adoption.

Promising more-efficient data centers, DPUs add another element to the heterogeneous processing mix. DPUs are important to data-center disaggregation, allowing server processors to perform only compute tasks while the DPU handles data movement between networked compute and storage.

Several vendors now offer processors positioned as DPUs. After examining the product field, The Linley Group defines a DPU as a programmable network SoC that integrates all major functions from the network ports to the PCI Express (PCIe) interface. A high-bandwidth PCIe interface separates DPUs from programmable Ethernet switch chips as well as legacy embedded processors. Combined with an integrated data plane for high-rate packet processing, the PCIe interface suits DPUs to network-traffic termination in smart NICs and for connecting SSDs in storage-controller cards.

Portability requires developers to use high-level APIs, avoiding any dependencies on underlying hardware. Conversely, adopting DPUs has required custom low-level code, creating a barrier to application developers.

With DOCA, Nvidia aims to remove this obstacle by providing a higher level of abstraction for DPU programming. By providing runtime binaries and high-level APIs, the framework allows developers to focus on application code rather than learning DPU-hardware intricacies.

For AI, there are similar tensions between running code on an x86 server processor and accelerating it using optimized hardware such as a GPU. Despite increasing competition, Nvidia remains the leader in AI acceleration due in part to the maturity and breadth of its CUDA software. Open-source neural-network frameworks essentially use CUDA as the default solution for acceleration.

This paper provides an in-depth discussion of the software-related issues and solutions.

https://digiconasia.net/sponsored/dpu-based-hardware-acceleration-a-software-perspective

First Name

Last Name

Business Email

Company Name

Country

Job Title

Direct Number

Which area of research are you working on?

Others; please indicate

Where does your research compute mainly take place?

Other (please specify)

Stay connected! Stay connected! By checking this box, I agree that DigiconAsia can share my data with NVIDIA so that NVIDIA and its partners may contact me by email or phone to provide more information about this content.

NVIDIA Privacy Statement: NVIDIA Privacy Statement: Send me the latest enterprise news, announcements and more from NVIDIA. I can unsubscribe at any time. Your information will be handled in accordance with NVIDIA's Privacy Policy.

UTM

Featured

Generative AI in the workplace

Featured

Cloud data and storage challenges brought on by AI

Featured

India’s data center landscape is edging closer to responsible climate action

Featured

Can we expect less monetary easing by the Fed this year?

Featured

South Korea’s research communities can now collaborate on a 600G network

Featured

What factors are driving IT modernization in the global healthcare industry?

Perspectives: DPU-based hardware acceleration: a software perspective

https://digiconasia.net/sponsored/dpu-based-hardware-acceleration-a-software-perspective