- π Lead Software Engineer at Baseten, focusing on model performance optimization
- πΌ Previously at Meituan, specialized in GPU inference using TensorFlow/TensorRT for CTR and PyTorch for LLMs. Earlier at Baidu, familiar with bRPC and Babylon
- π» Open Source: Team Member at LMSYS Org, core developer of SGLang, committer for FlashInfer and LMDeploy
- π Check out my talk on SGLang at GPU MODE
- π« Contact: [email protected] | Telegram
- π More: LinkedIn | Homepage
zhyncs
Follow
π―
Pinned Loading
-
sgl-project/sglang
sgl-project/sglang PublicSGLang is a fast serving framework for large language models and vision language models.
-
flashinfer-ai/flashinfer
flashinfer-ai/flashinfer PublicFlashInfer: Kernel Library for LLM Serving
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.