Make LLM Efficient Again!

The BJTU Edge Computing & Edge Intelligence Group is pioneering the integration of large language models with edge devices. Our current focus areas include:
Token Optimization
Distributed Inference
Speculative Decoding
LLM Routing

Jetson Deployment

Llama2 on Jetson Orin Nano

Windows Optimization

DeepSeek-R1 on Windows Laptop

Android Demo

Gemma on Android Phone