{"id":"lemonade","name":"Lemonade","af_score":62.0,"security_score":0.0,"reliability_score":null,"what_it_does":"Local AI inference server supporting text generation (LLM), image generation, speech-to-text, and text-to-speech across CPU, GPU (Vulkan/ROCm), NPU (XDNA2), and Apple Silicon. Exposes an OpenAI-compatible REST API on localhost:8000 for drop-in integration with existing tools.","best_when":"You want to run AI models locally with an OpenAI-compatible API, especially on AMD hardware, NPUs, or Apple Silicon without cloud costs or data leaving your machine.","avoid_when":"You need NVIDIA CUDA optimization, production-scale serving, or models that exceed your local hardware capacity. Use vLLM, Ollama, or cloud APIs instead.","last_evaluated":"2026-03-01T09:50:05.778214+00:00"}