Inference

Inference.net is a cutting-edge technology company that specializes in harnessing idle GPU capacity from around the globe to create a powerful, cohesive computing platform for large language model (LLM) inference. With over 5,000 GPUs and hundreds of terabytes of VRAM connected to their network, Inference.net is dedicated to delivering high-performance web experiences that empower users. Based in downtown San Francisco, the company operates with a small, well-funded team that values collaboration, craftsmanship, and innovation. Investors include prominent firms such as a16z CSX and Multicoin. Inference.net is committed to building user-friendly applications and optimizing performance while fostering a culture of mentorship and continuous improvement.