I am a researcher focusing on LLM optimization. Before that, I was an research at Microsoft Research. I am also collaborating with Anima Anandkumar. at Caltech, Beidi Chen at CMU. I am currently a researcher at TikTok
I am interested in bridging hardware constraints with the principles of LLM. I focus on developing efficient reasoning, training and inference. Check out my research for more details.
Additionally, I am a passionate community builder, which I founded the Efficient Reasoning workshops.