Running LLMs on the NPU of the Rockchip RK3588

  Рет қаралды 2,454

LivingLinux

LivingLinux

Күн бұрын

In this video I show you running a Large Language Model (LLM) on the NPU of the Rockchip RK3588.
With the Ubuntu 24.04 version by Joshua Riek for Rockchip RK3588 SBCs, it comes with the NPU driver version 0.9.6.
You can find it here: github.com/Joshua-Riek/ubuntu...
Someone on Reddit posted this information: / psa_ubuntu_rockchip_fr...
You can find the program to use the NPU here: github.com/Pelochus/ezrknn-llm
Converted model files: huggingface.co/Pelochus/ezrkl...
Allow more open files: ulimit -n 16384
Check NPU load: sudo cat /sys/kernel/debug/rknpu/load
00:00 Intro
00:13 Reddit RockchipNPU Community
01:13 ezrknpu ezrknn-llm
02:26 Installation
04:33 ulimit
05:23 Llama 2 7B on the NPU
09:25 Llama 2 7B on the CPU
If this video was helpful, please like, comment and subscribe!
Bluesky: bsky.app/profile/livinglinux....
#Radxa #Rock5 #rockchip #RK3588 #ubuntu #ai

Пікірлер: 16
@ribeiro4642
@ribeiro4642 23 күн бұрын
Obrigado pelo vídeo!
@devlogschannel
@devlogschannel 28 күн бұрын
hi, thanks for sharing great video, but is there anyway to fully use 3 cores of npu
@stegofroggy
@stegofroggy Ай бұрын
Was the power consumption 8 watts when running two of the three NPU cores? I wonder how the power consumption changes as a function of the number of NPU cores in use.
@LivingLinux
@LivingLinux Ай бұрын
Idle is around 4W. Yes, it was running on 2 cores (but not at 100%) at around 8W. So my estimate is 12W total when running the 3 NPU cores at full load. Do note that I was running from a micro SD. Running from a m.2 drive might even use more power.
@StasNsky
@StasNsky Ай бұрын
What was the exact speed in t/s for the llama 2 on the NPU?
@Crftbt
@Crftbt Ай бұрын
Is the NPU failing to complete with the 7billion model due to running out of memory? Is there a log file somewhere?
@LivingLinux
@LivingLinux Ай бұрын
I don't think it's running out of memory. According to htop, memory usage is around 72% and stable. Perhaps there is a reason that the NPU driver still doesn't have a version number at 1 or higher.
@Freshbott2
@Freshbott2 Ай бұрын
Hi, sorry it's not really related to your video but did you compile uboot for this device? I'm at my wit's end trying to follow the Rockchip Wiki for uboot.
@LivingLinux
@LivingLinux Ай бұрын
No, I have never compiled uboot. Do you have a Radxa or Orange Pi board (or other)? It's probably better to ask in their forums. forum.radxa.com/ www.orangepi.org/orangepibbsen/
@Freshbott2
@Freshbott2 Ай бұрын
@@LivingLinux I've got the FriendlyElec CM3588 and a lot of regret, as I don't want to be dependent on someone's Google Drive for OS support (now) or into the future. But thankyou though I'll see if someone's got more detail for an Orange Pi.
@user-gq7kq7ju4t
@user-gq7kq7ju4t Ай бұрын
Hi. I think this content is Ubuntu on rk3588 and use rk3588's npu. If i use rk3568, can i use this source?
@LivingLinux
@LivingLinux Ай бұрын
It needs NPU driver 0.9.6. You can check it with this command: dmesg | grep -i rknpu
@user-gq7kq7ju4t
@user-gq7kq7ju4t Ай бұрын
Thanks for response. But rk3568's rknpu driver version is 0.9.0. I tried uploading kernel , but it doesn't easy. Could you tell me what is your devlopment board?
@LivingLinux
@LivingLinux Ай бұрын
@@user-gq7kq7ju4t I have the Radxa Rock 5B and 5A. I also have some Mekotronics devices, but I mainly use Android on them.
@ps3301
@ps3301 Ай бұрын
It is so slow. It might as well be useless.
@LivingLinux
@LivingLinux Ай бұрын
It's not fast, but it is energy-efficient.
Pinokio AI
12:52
LivingLinux
Рет қаралды 251
OpenAI NEW GPT-4o | 10 Mindblowing Capabilities Revealed
12:24
AI Uncovered
Рет қаралды 29 М.
We Got Expelled From Scholl After This...
00:10
Jojo Sim
Рет қаралды 69 МЛН
Must-have gadget for every toilet! 🤩 #gadget
00:27
GiGaZoom
Рет қаралды 11 МЛН
The joker's house has been invaded by a pseudo-human#joker #shorts
00:39
Untitled Joker
Рет қаралды 11 МЛН
Samsung Galaxy Book Go Debian 2024
7:31
LivingLinux
Рет қаралды 1,5 М.
WAY faster than a Raspberry Pi-but is it enough?
17:26
Jeff Geerling
Рет қаралды 652 М.
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 828 М.
Zed “kills” VSCode
12:10
Alex Ziskind
Рет қаралды 635 М.
All You Need To Know About Running LLMs Locally
10:30
bycloud
Рет қаралды 118 М.
Star64 RISC-V gaming
9:31
DoktorCranium
Рет қаралды 1,6 М.
The Linux Experience
31:00
Bog
Рет қаралды 340 М.
I Tried A New RK3588 SOM - The Mixtile Core 3588E
9:58
Michael Klements
Рет қаралды 3,4 М.
Computer Vision on NPU - all you need to know
40:37
Anton Maltsev
Рет қаралды 1,6 М.
The Free and Open Source Software I Use in 2024 - Part 1
28:31
Awesome Open Source
Рет қаралды 181 М.
iPhone 12 socket cleaning #fixit
0:30
Tamar DB (mt)
Рет қаралды 53 МЛН
Ждёшь обновление IOS 18? #ios #ios18 #айоэс #apple #iphone #айфон
0:57
Main filter..
0:15
CikoYt
Рет қаралды 12 МЛН
Собери ПК и Получи 10,000₽
1:00
build monsters
Рет қаралды 1,6 МЛН
Lid hologram 3d
0:32
LEDG
Рет қаралды 9 МЛН
Неразрушаемый смартфон
1:00
Status
Рет қаралды 1,9 МЛН