How Will Arm’s New V9 Architecture Transform IoT and Edge AI?

February 26, 2025
How Will Arm’s New V9 Architecture Transform IoT and Edge AI?

The recent announcement from Arm, a UK-based chip design company, marks a significant advancement in the integration of AI workloads with IoT devices. Arm has extended its v9 architecture, known as Armv9, to the far edge to handle AI tasks directly on IoT devices, including vehicles, cameras, machinery, and various cellular IoT sensors. The new platform, designed as an Armv9 edge AI platform, is optimized specifically for IoT applications. The key highlight of this new platform is its combination with the latest Cortex-A320 and Ethos-U85 central processing units (CPU) and neural processing units (NPU), hailed as a pioneering step in the industry.

Advancements in AI and IoT Integration

Enhanced AI Capabilities

Arm emphasized that the AI revolution is breaking free from its cloud constraints. The combination of Cortex-A320 and Ethos-U85 enables the processing and acceleration of AI models with over one billion parameters directly on devices. This combination leads to a remarkable tenfold (1,000 percent) improvement in machine learning (ML) performance and a 30 percent increase in scalar performance compared to the previous Cortex-A35. The introduction of these advanced AI capabilities aims to provide developers with significant benefits when working on IoT projects.

These enhancements are not just evolutionary but revolutionary in terms of their potential to transform IoT devices’ capabilities. On-device AI processing reduces latency and improves response times, making IoT devices more efficient and capable of handling complex tasks in real time. This transformation is poised to have far-reaching implications for industries relying on precise and immediate decision-making processes. Particularly for sectors like healthcare, autonomous vehicles, and industrial automation, this leap in AI processing power will enable intelligent, data-driven actions without relying heavily on cloud connectivity. Consequently, IoT devices can operate more autonomously, enhancing both their usefulness and reliability.

Comprehensive Coverage for IoT Devices

This move by Arm ensures comprehensive coverage of the IoT sector with its Armv9 CPUs, offering solutions spanning from high-performance devices to those where cost and energy efficiency are critical. The v9 architecture, and specifically the v9.2 iteration, supports SVE2 for improved ML performance and security features such as Pointer Authentication (PAC), Branch Target Identification (BTI), and Memory Tagging Extension (MTE). These features bolster the security and performance of IoT devices handling sensitive data in critical business and mission environments.

An important aspect is that these security enhancements are essential as IoT devices interact with sensitive data and critical infrastructures. By incorporating PAC, BTI, and MTE, Armv9 ensures that its devices uphold stringent security standards, mitigating the risk of exploitation in adversarial scenarios. This capability is crucial for industries such as finance, healthcare, and defense, where secure data handling and real-time processing are paramount. Moreover, the architecture is designed to accommodate diverse requirements, from high-performance devices to energy-efficient solutions, ensuring that a wide array of IoT applications can benefit from these advancements without compromising on functionality or security.

Software Enhancements and Developer Benefits

Kleidi Software Suite

Moreover, Arm has extended its Kleidi software, meant for AI inference workloads, to edge IoT devices. This software suite includes compute libraries that optimize AI and ML tasks on Arm-based CPUs, eliminating the need for additional developer efforts. Existing AI frameworks, like Llama.cpp and ExecuTorch (or LiteRT), have already integrated KleidiAI via the XNNPACK library, which includes neural network inference operators to enhance the performance of key models like Meta Llama 3 and Phi-3. According to the blog, Kleidi AI boosts the performance of the Cortex-A320 CPU by up to 70 percent when running certain datasets, like Microsoft’s Tiny Stories on Llama.cpp.

The performance boost offered by Kleidi AI represents a significant step towards making sophisticated AI accessible on IoT devices. By simplifying the developmental processes and improving the integration of AI frameworks, Arm alleviates a considerable amount of workload from developers. This efficiency allows them to focus on innovating rather than devoting time and resources to integrate various software functionalities manually. Furthermore, the seamless compatibility of Kleidi AI with popular AI frameworks means that developers can leverage existing tools and libraries, hastening their project’s time-to-market. In the competitive landscape of IoT, such advancements can differentiate products and define their success.

Time-to-Market and Software Compatibility

Paul Williamson, Arm’s senior vice president and general manager of the IoT business, highlighted the strategic importance of time-to-market, noting that it can be a decisive factor for new products. The new platform’s software compatibility with higher-end Cortex-A units was another feature accentuated by Williamson. Williamson further elaborated that AI’s shift towards the edge, facilitated by this new platform, would be a catalyst for the next wave of IoT innovation. Bringing intelligent decision-making closer to the source of data collection reduces latency and enhances privacy.

Reducing latency is critical in applications where real-time processing and decision-making are required. This includes autonomous systems, such as self-driving cars or real-time facial recognition systems, where milliseconds can be the difference between success and failure. By processing data at the edge, the need for round-trip data communication to centralized cloud infrastructures is minimized, resulting in faster and more reliable outputs. Additionally, local data processing enhances privacy since sensitive information can be analyzed on-device rather than transmitted across networks. Such privacy enhancements are not just beneficial but often mandatory in sectors dealing with personal or classified data, like healthcare and defense.

Industry Support and Future Implications

Industry Enthusiasm

Supporting voices from various companies in the industry echoed Arm’s enthusiasm. Miller Chang, president of the embedded sector at Advantech, recognized the acceleration of edge AI and the potential broad impacts of Arm’s innovative IoT computing architecture on various industry applications. Yasser Alsaied, vice president of IoT at AWS, was optimistic about how the integration with nucleus lite, a lightweight runtime of AWS IoT Greengrass, would benefit their edge AI applications in areas like precision agriculture, smart manufacturing, and autonomous vehicles.

The widespread industry support underscores Armv9’s promising capabilities to redefine IoT and edge AI applications. Companies across different fields see the extended Armv9 platform as a game-changer, capable of driving substantial advancements in AI-based innovations. For example, in precision agriculture, real-time data analytics can lead to more refined and immediate agricultural practices, improving yield and resource management. Similarly, smart manufacturing can benefit from autonomous systems that monitor and maintain production lines with minimal human intervention, effectively boosting productivity and reducing downtime. The shared enthusiasm signifies a unified belief in Arm’s architecture as a foundation for future technological strides.

Transformative Potential

Arm, a chip design company based in the UK, has made an important announcement that marks a major step forward in merging AI capabilities with IoT devices. The company has expanded its v9 architecture, called Armv9, to the far edge, enabling AI tasks to be performed directly on IoT devices like vehicles, cameras, machinery, and various cellular IoT sensors. This new Armv9 edge AI platform is optimized specifically for IoT applications.

A standout feature of this platform is its integration of the latest Cortex-A320 CPUs and Ethos-U85 NPUs, which are central and neural processing units, respectively. These advancements are seen as groundbreaking in the industry. The combination of these cutting-edge processors enhances the capability of IoT devices to handle complex AI workloads right at the edge, reducing latency and increasing efficiency. This significant move by Arm is set to drive advancements in various sectors, offering improved performance and new possibilities for IoT technology.

Subscribe to our weekly news digest.

Join now and become a part of our fast-growing community.

Invalid Email Address
Thanks for Subscribing!
We'll be sending you our best soon!
Something went wrong, please try again later