wecoso
  • bm
  • stills
  • 2025-07-06
    • container
    • dustynv
    • jetson
    • rag
    • speech

    NanoLLM is a lightweight, high-performance library using optimized inferencing APIs for quantized LLM’s, multimodality, speech services, vector databases with RAG, and web frontends with (jetson container)