Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip.