Building the rails for software on-demand

From the beginning, Relace was about building the right tool for the right job. We isolated the tasks that coding agents struggled with and trained specialized SLMs to do them instead. Starting with apply and embed/rerank, our models have made code generation faster, better, and cheaper for everyone.

We envision a future where coding agents can be built into all software systems -- the static SaaS pages of today will become malleable interfaces for the user to decide. At Relace, we want to build the models and infrastructure to make this possible.

Our team is based in-person in SF, where we work together. We're a team of ex-academics, founders, and hackers. If you're stubbornly optimistic and don't back away from hard technical problems, join us.

Join the team

Meet the team

Trusted by the best leading brands:

FAQs

Frequently asked questions

Why Relace?
Relace is purpose-built for coding workflows. Instead of relying on general-purpose LLMs, our in-house models specialize in retrieval, merging, and code generation — making them faster, more reliable, and easier to integrate into engineering pipelines. Teams use Relace to cut down errors, accelerate development, and gain a real competitive edge.
Is Relace SOC 2 compliant?
Yes. Relace is built with enterprise security in mind, and our systems are SOC 2 compliant. This ensures your data is handled with the highest standards of confidentiality and integrity.
Can I self-host Relace models?
Absolutely. For teams with stricter compliance or latency requirements, we offer on-premise and VPC-isolated deployments. You get full control of your environment while still benefiting from Relace's optimized inference stack.
How does Relace handle sensitive code?
Your code never leaves your controlled environment when using self-hosted or VPC deployments. Even on our hosted tier, all data is encrypted in transit and at rest.
What’s the main advantage for source control workflows?
Relace outperforms frontier LLMs at code retrieval and merging. Our models can search a codebase in under a second and merge at 10,000 tokens per second — speeding up PR reviews, automated fixes, and CI/CD processes.
How fast in onboarding?
You can start experimenting within minutes using our hosted API. For enterprise and self-hosted setups, our team provides guided onboarding to help you deploy Relace quickly and securely in your stack.