NXP i.MX95 - Introduction - What is i.MX95?-Copy
This section provides an overview of the i.MX 95 applications processor, highlighting its advanced features in graphics and video processing, machine learning acceleration, and efficient CPU performance. The processor is equipped with robust real-time processing capabilities and enhanced security through the integrated EdgeLock® secure enclave, making it ideal for energy-efficient edge computing. Additionally, it supports a comprehensive array of peripherals and processor cores, ensuring versatility and high performance for a wide range of applications. The figure 1 shows the i.MX 95 Verdin Evaluation Kit.

The i.MX 95 chip integrates a range of advanced hardware units, enhancing its capabilities for various applications. The Central Processing Unit (CPU) comprises six Cortex-A55 cores, one Cortex-M7 core, and 512 kBytes of Cortex-M33 TCM. For graphics processing, it includes an Arm Mali-G310 GPU. Video processing is managed by a dedicated Video Processing Unit (VPU) alongside a JPEG encoder and decoder. Machine learning tasks are accelerated by four eIQ® Neutron N3-1024S Neural Processing Units (NPUs). Additionally, the chip features an Image Signal Processor (ISP) for handling RAW camera images through both streaming and DDR-to-DDR modes. Table 1 summarizes the hardware characteristics of the chip.
Feature | Specifications |
---|---|
Arm Cortex-A55 MPCore platform |
|
Arm Cortex-M33 and Cortex-M7 platforms |
|
Memory |
|
Machine learning | High-performance NPU |
Graphics |
|
Video processors |
|
Display controllers |
|
Camera and ISP |
|
Audio |
|
Connectivity |
|
Low-speed communication peripherals |
|
Timers and PWMs |
|
GPIO and pin multiplexing |
|
Analog |
|
Clocking |
|
Safety |
|
Security |
|
System debug |
|
Power management |
|
Packages |
|
Central Processing Unit (CPU)
Cortex A55
The Cortex-A55 is a high efficiency processor with Media Processing Engine (MPE) with NEON technology supporting Advanced Single Instruction Multiple Data architecture.
There are 6 Cortex-A55 cores on this chip.
Cortex-M7
The Arm Cortex-M7 processor is a highly efficient, high-performance, embedded processor that features low interrupt latency, low-cost debug, and has backward compatibility with existing Cortex-M profile processors.
There are 1 Cortex-M7 core on this chip.
Cortex-M33
In this chip, 512 kByte of Cortex-M33 TCM is present.
Graphics Processing Unit (GPU)
The chip has an Arm Mali-G310 and it is the "V2" configuration.
Mali GPU supports OpenGL, OpenGl ES, and Vulkan texture compression APIs.
The main feature of this GPU is to render complex graphics data and performs general purpose processing tasks under the control of the main application processor.
Video Processing Unit (VPU)
In addition to a VPU, the chip contains a JPEG encoder and a JPEG decoder:
- The VPU supports simultaneous encoding or decoding of 32 multiple streams. It features both H.264/AVC and H.265/HEVC encoders and decoders, with maximum throughput of 4K@60fps.
- The JPEG Encoder offers Motion JPEG payload encoding and supports up to 4 simultaneous streams. It achieves up to 400 MSamples/s and conforms to DICOM JPEG standards.
- The JPEG Decoder supports Motion JPEG payload decoding and can handle up to 4 simultaneous streams. It achieves up to 400 MSamples/s.
Neural Processing Unit (NPU)
The chip comes with four eIQ® Neutron N3-1024S NPU.
The NPU provides efficient processing of machine-learning workloads. It works in conjunction with the chip's CPU to deliver higher performance and lower power consumption for neural network (NN) inference.
This unit natively supports:
- Convolutional neural networks (CNN) including Atrous Separable Convolution networks and Depthwise Separable Convolutional networks, with matrix sizes of:
- 1x1, 3x3, 5x5, and 7x7, with stride and padding support
- 1x3, 3x1, 1x5, 5x1, 1x7, and 7x1
- Recurrent Neural networks (RNN), including minimum gated unit (MGU) networks
- Activation function for determining final value of a neuron. Includes support the activation functions: ReLU, ReLUx, HSWISH, Sigmoid, Tanh, eLU, SeLU, LeakyReLU, PReLU and exp(x)
- Sampling or reshaping functions
- Element-wise operations
- Layer operations
- Normalization functions
- Pooling functions
- Fully connected layer output
Image Signal Processor (ISP)
The ISP integrates several functions for handling RAW camera images and supports two modes:
- Streaming mode: Pixel stream is received from one of the two CSI-2 interfaces, and de-bayered pixels are written to DDR via DMAs.
- DDR to DDR mode: The ISP can process more than one camera stream. Processing is frame-based and can support multiple RAW cameras, with DDR as the source of input frames.