Llama.cpp
llama.cpp python library is a simple Python bindings for
@ggerganovllama.cpp.This package provides:
- Low-level access to C API via ctypes interface.
 - High-level Python API for text completion
 
OpenAI-like APILangChaincompatibilityLlamaIndexcompatibility- OpenAI compatible web server
 
- Local Copilot replacement
 - Function Calling support
 - Vision API support
 - Multiple Models
 
Overviewโ
Integration detailsโ
| Class | Package | Local | Serializable | JS support | 
|---|---|---|---|---|
| ChatLlamaCpp | langchain-community | โ | โ | โ | 
Model featuresโ
| Tool calling | Structured output | JSON mode | Image input | Audio input | Video input | Token-level streaming | Native async | Token usage | Logprobs | 
|---|---|---|---|---|---|---|---|---|---|
| โ | โ | โ |