Search "AI Inference"

6 results

huggingface/transformers
Open SourcePythonAIMachine LearningNLP

A Python library that lets anyone easily use top-tier AI models for text, images, and audio without training from scratch.

159,560
Tencent/ncnnDeveloper Tools
Tencent/ncnn
GitHubDeep LearningMobilencnnAI Inference

Tencent's open-source mobile AI inference framework, runs deep learning models offline on phones with high speed and no dependencies.

23,117
google-ai-edge/gallery
大模型AIgoogle离线隐私

A mobile AI app by Google that lets you experience large language models and generative AI features offline, ensuring privacy and speed.

21,448
Apache TVMDeveloper Tools
apache/tvm
PythonAImachine-learningcompilerdeployment

Apache TVM is an open-source compiler framework that compiles AI models into efficient code, making them run faster and more compatible on various hardware like CPUs and GPUs.

13,279
deepseek-ai/DeepGEMMDeveloper Tools
deepseek-ai/DeepGEMM
LLMDeepSeekCUDAGPU OptimizationAI Infrastructure

A high-performance computing library for NVIDIA GPUs that significantly accelerates large model training and inference, boosting underlying code efficiency.

6,493
DFlashAI & Automation
z-lab/dflash
大模型加速推理优化Speculative DecodingvLLMSGLang

A tool that speeds up large model generation by predicting blocks of content to reduce waiting time.

1,850