aiMotive ships first aiWare4 NPU production RTL

News & insights

Check our latest stories on automated driving

Engineer holding production validated neural processing unit&imagePreview=1

Written by aiMotive / Posted at 5/5/22

aiMotive ships first aiWare4 NPU production RTL

Budapest, Hungary, 5th May 2022 – aiMotive, one of the world's leading modular automated driving technology suppliers, announced today that it has shipped final production-validated RTL of the latest generation of its ultra-high efficiency NPU aiWare4 to lead customers. The latest aiWare4 RTL shipped delivers up to 5x the performance of the previous generation aiWare3 NPUs, while using less than 2x the silicon area. This demonstrates the exceptional scalability, PPA and performance per mm2 of aiWare4, while extending the feature set and operational “sweet spot” for high-efficiency CNN acceleration.

“Our aiWare team has relentlessly refined our production validation processes to enable us to deliver customer configurations at record speed to full automotive quality for aiWare4,” says Márton Fehér, SVP hardware engineering at aiMotive. “Thanks to our sophisticated wavefront processing making full use of our new WFRAM technology, plus many other architectural advances from aiWare3, we have been able to achieve exceptional PPA for our lead customers without compromising our leadership in high-efficiency execution up to 95% of the most demanding automotive inference CNN workloads”.

To achieve extremely demanding PPA constraints from customers, aiMotive was able to fine-tune the exact feature set of the aiWare4 production RTL to best meet customers’ requirements. Making full use of the physical tile-based layout and dataflow methodologies, the aiWare team demonstrated clock speeds for the production RTL of up to 1.3GHz over the full automotive AEC-Q100 Grade 2 temperature range on a 14nm process. The aiWare4 hardware IP has been assessed externally as suitable for certification to ASIL-B or higher as an SEooC.

The aiWare4 NPU scales from 1 to 256 TOPS and is supported by an exceptionally comprehensive SDK featuring highly accurate offline performance estimation, enabling customers to accurately estimate and fine-tune their CNN workloads to within 5% of final silicon performance prior to first silicon.

aiWare4 hardware IP is available now for licensing.

For more details, click here.