09:04 hallo1: Rockchip changed the `struct rknpu_submit`in their latest driver 0.9.6 (20240322)
09:07 tomeu: hallo1: hi, what do you mean by stating that?
09:11 hallo1: I tested your driver with mobilenetv1 and it not working on the latest kernel
09:12 tomeu: oh, right, yeah, that's the downside of out-of-tree kernel drivers and userspace blobs
09:12 hallo1: dmesg shows `failed to wait job` error
09:12 tomeu: I will be starting to write a proper kernel driver soon
09:13 hallo1: The latest driver most obviously adds a iommu_domain_id field
09:13 hallo1: seems used to bypass 4GB memory limit, for their LLM inference library
09:15 tomeu: 🤦
09:24 phh: ah, I assumed iommu_domain_id was used for secure stuff
09:25 phh: like REE/Linux domain vs TEE/OP-TEE domain
09:25 hallo1: You can learn about this at the RK document `02_Rockchip_RKNPU_User_Guide_RKNN_SDK_V2.0.0beta0_EN.pdf` page 55
09:26 hallo1: > iommu_domain_id int32_t "Index of the IOMMU address space domain where the matrix context resides. The IOMMU address space corresponds to the context one-to-one, and the size of each IOMMU address space is 4GB. This parameter is mainly used when the parameter specifications of matrices A, B, and C are large, and the memory allocated by the NPU in a certain domain exceeds 4GB and needs to be switched to another domain."
09:27 phh: ah, makes sense, thanks