Senior Software Engineer – Toronto We're looking for a Senior Software Engineer to join our team in Toronto, focusing on building and optimizing state‑of‑the‑art LLM‑powered agents that can reason, plan, and automate workflows for users. You will lead the design and development of search and retrieval agent systems that enable users to generate competitive insights for their business. In this role, you will own projects end‑to‑end, guiding architecture decisions, experimentation strategy, and production readiness for LLM‑powered retrieval and generation workflows. You will shape how we integrate retrieval‑augmented generation (RAG), dense retrieval, query understanding, and agentic reasoning loops to deliver fast, accurate, and trusted search experiences at scale. What You’ll Do Build and ship backend systems that power agentic workflows, designing retrieval pipelines, orchestration layers, and multi‑step agent architectures that turn millions of competitive data points (news, press releases, webpage changes, Slack posts, emails, reviews, CRM data) into actionable intelligence for our customers. Own evaluation of agentic systems at scale. Develop and operate evaluation frameworks (automated, offline, and human‑in‑the‑loop) that measure relevance, quality, latency, and end‑to‑end task success across our agent pipelines. Define what "good" looks like and build the infrastructure to measure it continuously. Design and optimize retrieval and ranking systems. Work across hybrid retrieval, re‑ranking, query rewriting, and post‑retrieval synthesis to ensure our agents surface the right information at the right time. Understand the tradeoffs between BM25, dense retrieval, and hybrid approaches and know when each matters. Improve LLM‑powered workflows end to end. From prompt design and retrieval strategy to caching and latency optimization, make our agent responses faster, more accurate, and more reliable in production. Ship with the customer in mind. Connect technical decisions to customer outcomes. Prioritize what to build next based on how customers use the product, ship iteratively, measure impact, and course‑correct quickly. Collaborate across product, infrastructure, and data teams—align technical direction with product goals, contribute to architecture decisions, and help the team move faster by establishing patterns and best practices for production‑grade agentic systems. Stay on the frontier. Evaluate and integrate advances in LLMs, retrieval architectures, and agentic reasoning. Bring strong opinions about where this space is heading. What You Bring Experience building and operating backend systems in production, with meaningful experience in at least one of: search/retrieval, data pipelines, distributed systems, or API‑heavy service architectures. Hands‑on experience with search, retrieval, or ranking systems. Built or significantly improved retrieval pipelines and understand information retrieval fundamentals (hybrid retrieval, relevance tuning, query understanding). Experience building or evaluating agentic/LLM‑powered systems. Worked with retrieval‑augmented generation, multi‑step agent workflows, or similar architectures and have thought critically about how to evaluate their output quality at scale. Strong software engineering fundamentals. Write clean, maintainable, well‑tested code. Comfortable with Python and experienced with backend frameworks, APIs, and production infrastructure. Care about reliability, observability, and CI/CD. Familiarity with vector databases and search infrastructure. Worked with tools like FAISS, PGVector, Pinecone, Weaviate, Elasticsearch, or OpenSearch and understand operational tradeoffs. Experience with cloud infrastructure (AWS, GCP, or Azure) and building systems that handle scale, large data volumes, low‑latency requirements, and high availability. Use AI coding tools to accelerate your work. Integrated tools like Copilot, Cursor, or similar into your development workflow and can speak to how they've changed the way you build software. Customer‑oriented mindset. Shipped features where you understood the end‑user problem, not just the technical specification. Ability to lead projects and provide technical direction. Own a problem end‑to‑end, make sound architectural decisions, and help others on the team level‑up. Nice to Have Experience designing multi‑agent systems or complex orchestration workflows. Background in conversational search or dialogue systems. Contributions to open‑source projects in search, retrieval, or the LLM ecosystem. Interest in sharing learnings externally (blog posts, talks, open‑source contributions). What Success Looks Like Take ownership and run with ambiguous problems. Jump into new areas and rapidly learn what’s needed to deliver solutions. Bring scientific rigor while maintaining a pragmatic delivery focus. See unclear requirements as an opportunity to shape the solution. Our Tech Stack LLM platforms: OpenAI, Anthropic, open‑source models ML frameworks: PyTorch, Transformers, spaCy Search/Vector DBs: Elasticsearch, Pinecone, PostgreSQL MLOps tools: Weights & Biases, MLflow, Langfuse Infrastructure: Docker, Kubernetes, GCP Development: Python, Git, CI/CD Compensation Range: CA$145K - CA$183K By submitting your application, you agree that Klue may collect your personal data for recruiting, global organization planning, and related purposes. We may use automated or AI‑assisted tools to support our recruitment process. All hiring decisions are made by our hiring team. #J-18808-Ljbffr