Master Thesis: Semantic Chunk Sparse Attention for Large Language Model
Master Thesis: Semantic Chunk Sparse Attention for Large Language Model slides Lecturer : Prof. Torsten Hoefler, Dr Maciej Besta, Dr Grzegorz Kwasniewski 1. Introduction Large Language Models(LLMs)&