content type videopublished September 23, 2025
Apple Workshop on Natural Language and Interactive Systems 2025: Speculative Streaming: Fast LLM Inference Without Auxiliary Models
AuthorsIrina Belousova (Apple)
Apple Workshop on Natural Language and Interactive Systems 2025: Speculative Streaming: Fast LLM Inference Without Auxiliary Models
AuthorsIrina Belousova (Apple)