Menu
Sign in

Together AI has just closed a $305 million Series B round, pushing its valuation to $3.3 billion. The capital fuels a rapid rollout of GPU‑accelerated clusters and a new self‑service platform that lets developers spin up NVIDIA‑powered resources in minutes.

Key product milestones include: - * a self‑service GPU cluster provisioning tool that automates deployment; - * an expanded AI Acceleration Cloud featuring NVIDIA HGX B200 and upcoming Blackwell GPUs; - * a growing library of over 200 open‑source generative models that can be fine‑tuned via a single API.

With data centers now live in Sweden and plans for large‑scale Blackwell GPU deployments, Together AI is positioning itself as the bridge between open‑source innovation and enterprise‑grade performance. The company’s roadmap points to deeper integrations with NVIDIA’s ecosystem and a broader European footprint.

Serverless AI is reshaping how developers deploy models

Replicate’s integration with Cloudflare Workers AI unlocks a massive library of production-ready models, enabling instant, scalable inference without managing infrastructure. The platform’s API-first design lets teams spin up custom models in minutes, while the serverless GPU tier keeps costs predictable.

Key takeaways: - Zero‑ops deployment: Deploy models directly to the edge. - Scalable GPU inference: Pay per request, no upfront hardware. - Ecosystem synergy: Cloudflare Workers, Vercel, and AWS Lambda all support Replicate’s JavaScript client.

Whether you’re prototyping a chatbot or running large‑scale generative workloads, Replicate’s serverless approach delivers speed, reliability, and cost‑efficiency.

OpenRouter offers a single API endpoint that unlocks access to over 400 AI models from 50+ providers, eliminating vendor lock‑in and simplifying multi‑model integration.

  • One key, one call: Use a single OpenAI‑compatible endpoint to switch between models.
  • Transparent reasoning: Enable step‑by‑step reasoning in responses for auditability.
  • Easy onboarding: Create an account, generate an API key, and start calling models in minutes.

To get started, visit the OpenRouter website, connect your API key, and explore the extensive documentation and tutorials available on YouTube and the official docs.

Web Results

Blog | Together AI

Dive into our latest product launches, company news, and technical guides on open-source AI models.

www.together.ai/blog

Together AI | Company Overview & News

Earlier this year, the company reached a valuation of $3.3 billion after closing a $305 million round led by General Catalyst, and acquired Refuel.ai in May. Read Less ... Forbes Technology Senior Writer Richard Nieva sits down with Luminance CEO Eleanor Lightbody, Abridge Cofounder & CTO Zachary Lipton, and Together AI Cofounder & CTO Ce Zhang to gain insights about the AI revolution - from foundational models and enterprise platforms to breakthrough innovations shaping the future.

www.forbes.com/companies/together-ai/

Together AI launches self-service AI clusters - DCD

News · The Cloud &amp; Hybrid Channel ... 2025 · By · Georgia Butler Have your say · <strong>Together AI has launched a service automating the provisioning of GPU clusters for customers</strong>....

www.datacenterdynamics.com/en/news/to...

Our Investment in Together AI

<strong>Together AI is pioneering a new paradigm: AI infrastructure that combines the best of open-source innovation with enterprise-grade performance</strong>. The platform provides over 200 generative AI models, enabling developers to experiment, customize, ...

www.generalcatalyst.com/stories/our-i...

Together AI | The AI Native Cloud

Reliably deploy models with unmatched price-performance at scale. Benefit from inference-focused innovations like the ATLAS speculator system and Together Inference Engine.

www.together.ai

Together AI - Crunchbase Company Profile & Funding

<strong>A platform enabling users to fully fine-tune or adapt open-source AI models with complete ownership and easy-to-use APIs</strong>. ... Access a complete feed of recent news and press, from product launches to leadership changes.

www.crunchbase.com/organization/together-1a7e

Documentation – Replicate

Learn to build, deploy, and scale your own custom model on Replicate. ... Create and deploy a web app with a serverless backend and a React frontend in under 60 seconds.

replicate.com/docs

Videos