N
Hacker Next
new
show
ask
jobs
submit
login
Accelerating Gemma 4: faster inference with multi-token prediction drafters
blog.google
685 points by
amrrs
7 days ago
|
328 comments
add comment