We are seeking a highly progressive Platform Engineer specializing in AI infrastructure and agentic execution environments to join our core cloud enablement team in Vancouver. In this role, you will bridge the gap between traditional Site Reliability Engineering (SRE) and cutting-edge Agentic AI operations. You will design, build, and operate secure multi-cloud landing zones, developer "golden paths," and reusable automated frameworks that empower application teams to safely deploy AI agents at scale.
...
This role is tailored for an engineer with a deep passion for automation, policy-as-code, and distributed systems. You will play a foundational role in defining how our enterprise architecture orchestrates, monitors, and secures large language model (LLM) agent frameworks, runtime function calling, and automated failover mechanics for high-volume retail environments.
Location: Vancouver, BC (Hybrid – 4 days per week onsite)
Contract Duration: 6-month contract with high likelihood of extension
Advantages
Pioneering Technical Landscape: Lead the implementation of modern agentic platform engineering frameworks for a world-renowned brand.
Elite Multi-Cloud Exposure: Deepen your infrastructure mastery by operating simultaneously across production AWS and Azure environments.
High Extensibility Indicators: Enter an initial 6-month contract with highly anticipated ongoing extension cycles as the AI platform grows.
Premier Workspace: Collaborate within a dynamic, culture-led, and people-first onsite setting in Vancouver.
Responsibilities
1. AI Platform Delivery & Agentic Orchestration
Agentic Tool Enablement: Build integration patterns, API mediation layers, and approval workflows supporting autonomous AI agent tool execution and runtime function calling.
Observability Ingestion: Integrate advanced distributed telemetry for agent runs (execution traces, evaluation metrics, latency logs, and token cost analytics).
Failover & Guardrails: Establish runtime safety controls for AI applications, embedding automated rollback scripts, cost control ceilings, and master kill-switches.
Landing Zone Architecture: Build and scale highly secure, automated multi-cloud landing zones (AWS and Azure) utilizing reusable Terraform modules.
CI/CD Pipeline Engineering: Construct and maintain robust GitLab CI/CD pipelines, package registries, and automated infrastructure release strategies.
2. Security, Policy-as-Code & SRE Controls
Policy-as-Code Enforcements: Implement strict automated infrastructure guardrails using Open Policy Agent (OPA), Conftest, or Azure Policies to guarantee security without breaking developer velocity.
Security Architecture: Embed least-privileged access, zero-trust network segmentation, private endpoints, KMS encryption keys, and advanced secrets management.
SRE Practices: Champion Site Reliability Engineering standards by managing Service Level Objectives (SLOs), calculating error budgets, configuring autoscaling matrices, and leading chaos engineering simulations.
FinOps Optimization: Apply cloud financial management protocols (structured resource tagging, budget alarms, anomaly detection, and cluster right-sizing).
3. Developer Enablement & Community Support
Golden Path Documentation: Author clear, accessible developer guides and self-service templates that streamline the adoption of core AI platform features.
Incident Response: Form part of a formal production on-call rotation, managing real-time incident resolution and driving exhaustive post-mortem evaluations.
Qualifications
Must-Have Technical Skills
Core Experience: 3–5 years of dedicated cloud platform engineering or SRE experience working with high-volume distributed systems natively in AWS and Azure.
Infrastructure as Code: Elite proficiency with Terraform, with an emphasis on creating modular, reusable code structures and multi-environment pipelines.
Runtime Languages: Coding proficiency in Python or Go, with a solid history of integrating with complex REST/JSON APIs.
CI/CD & Containers: Strong operational working knowledge of GitLab CI/CD, Docker containerization, and cloud orchestration layers.
AI & Agentic Literacy: Proven, hands-on exposure to AI/LLM development concepts (advanced prompting, tool/skill integration, and Retrieval-Augmented Generation [RAG]).
Coding Approach: Extensive experience leveraging AI and Agentic Coding tools to accelerate software delivery and maintain platform scripts.
Nice-to-Have Skills (Or Will Learn On the Job)
Direct experience building or operating internal Agent Frameworks (tool catalogs, runtime orchestration layers, prompt management).
Hands-on tracing setup using monitoring tools such as Datadog or Splunk.
Formal background in public sector or retail e-commerce compliance (PII protection, data masking rules).
Post-secondary degree in Computer Science, Software Engineering, or an equivalent technical field.
Summary
If you are a Forward-thinking Platform Engineer who loves policy-as-code, writes clean Python/Go, and is passionate about building the infrastructure that fuels enterprise AI agents, we encourage you to apply online at www.randstad.ca. Only qualified candidates will be contacted for the next steps. We look forward to hearing from you!
Randstad Canada is committed to fostering a workforce reflective of all peoples of Canada. As a result, we are committed to developing and implementing strategies to increase the equity, diversity and inclusion within the workplace by examining our internal policies, practices, and systems throughout the entire lifecycle of our workforce, including its recruitment, retention and advancement for all employees. In addition to our deep commitment to respecting human rights, we are dedicated to positive actions to affect change to ensure everyone has full participation in the workforce free from any barriers, systemic or otherwise, especially equity-seeking groups who are usually underrepresented in Canada's workforce, including those who identify as women or non-binary/gender non-conforming; Indigenous or Aboriginal Peoples; persons with disabilities (visible or invisible) and; members of visible minorities, racialized groups and the LGBTQ2+ community.
Randstad Canada is committed to creating and maintaining an inclusive and accessible workplace for all its candidates and employees by supporting their accessibility and accommodation needs throughout the employment lifecycle. We ask that all job applications please identify any accommodation requirements by sending an email to accessibility@randstad.ca to ensure their ability to fully participate in the interview process.
show more
We are seeking a highly progressive Platform Engineer specializing in AI infrastructure and agentic execution environments to join our core cloud enablement team in Vancouver. In this role, you will bridge the gap between traditional Site Reliability Engineering (SRE) and cutting-edge Agentic AI operations. You will design, build, and operate secure multi-cloud landing zones, developer "golden paths," and reusable automated frameworks that empower application teams to safely deploy AI agents at scale.
This role is tailored for an engineer with a deep passion for automation, policy-as-code, and distributed systems. You will play a foundational role in defining how our enterprise architecture orchestrates, monitors, and secures large language model (LLM) agent frameworks, runtime function calling, and automated failover mechanics for high-volume retail environments.
Location: Vancouver, BC (Hybrid – 4 days per week onsite)
Contract Duration: 6-month contract with high likelihood of extension
Advantages
Pioneering Technical Landscape: Lead the implementation of modern agentic platform engineering frameworks for a world-renowned brand.
...
Elite Multi-Cloud Exposure: Deepen your infrastructure mastery by operating simultaneously across production AWS and Azure environments.
High Extensibility Indicators: Enter an initial 6-month contract with highly anticipated ongoing extension cycles as the AI platform grows.
Premier Workspace: Collaborate within a dynamic, culture-led, and people-first onsite setting in Vancouver.
Responsibilities
1. AI Platform Delivery & Agentic Orchestration
Agentic Tool Enablement: Build integration patterns, API mediation layers, and approval workflows supporting autonomous AI agent tool execution and runtime function calling.
Observability Ingestion: Integrate advanced distributed telemetry for agent runs (execution traces, evaluation metrics, latency logs, and token cost analytics).
Failover & Guardrails: Establish runtime safety controls for AI applications, embedding automated rollback scripts, cost control ceilings, and master kill-switches.
Landing Zone Architecture: Build and scale highly secure, automated multi-cloud landing zones (AWS and Azure) utilizing reusable Terraform modules.
CI/CD Pipeline Engineering: Construct and maintain robust GitLab CI/CD pipelines, package registries, and automated infrastructure release strategies.
2. Security, Policy-as-Code & SRE Controls
Policy-as-Code Enforcements: Implement strict automated infrastructure guardrails using Open Policy Agent (OPA), Conftest, or Azure Policies to guarantee security without breaking developer velocity.
Security Architecture: Embed least-privileged access, zero-trust network segmentation, private endpoints, KMS encryption keys, and advanced secrets management.
SRE Practices: Champion Site Reliability Engineering standards by managing Service Level Objectives (SLOs), calculating error budgets, configuring autoscaling matrices, and leading chaos engineering simulations.
FinOps Optimization: Apply cloud financial management protocols (structured resource tagging, budget alarms, anomaly detection, and cluster right-sizing).
3. Developer Enablement & Community Support
Golden Path Documentation: Author clear, accessible developer guides and self-service templates that streamline the adoption of core AI platform features.
Incident Response: Form part of a formal production on-call rotation, managing real-time incident resolution and driving exhaustive post-mortem evaluations.
Qualifications
Must-Have Technical Skills
Core Experience: 3–5 years of dedicated cloud platform engineering or SRE experience working with high-volume distributed systems natively in AWS and Azure.
Infrastructure as Code: Elite proficiency with Terraform, with an emphasis on creating modular, reusable code structures and multi-environment pipelines.
Runtime Languages: Coding proficiency in Python or Go, with a solid history of integrating with complex REST/JSON APIs.
CI/CD & Containers: Strong operational working knowledge of GitLab CI/CD, Docker containerization, and cloud orchestration layers.
AI & Agentic Literacy: Proven, hands-on exposure to AI/LLM development concepts (advanced prompting, tool/skill integration, and Retrieval-Augmented Generation [RAG]).
Coding Approach: Extensive experience leveraging AI and Agentic Coding tools to accelerate software delivery and maintain platform scripts.
Nice-to-Have Skills (Or Will Learn On the Job)
Direct experience building or operating internal Agent Frameworks (tool catalogs, runtime orchestration layers, prompt management).
Hands-on tracing setup using monitoring tools such as Datadog or Splunk.
Formal background in public sector or retail e-commerce compliance (PII protection, data masking rules).
Post-secondary degree in Computer Science, Software Engineering, or an equivalent technical field.
Summary
If you are a Forward-thinking Platform Engineer who loves policy-as-code, writes clean Python/Go, and is passionate about building the infrastructure that fuels enterprise AI agents, we encourage you to apply online at www.randstad.ca. Only qualified candidates will be contacted for the next steps. We look forward to hearing from you!
Randstad Canada is committed to fostering a workforce reflective of all peoples of Canada. As a result, we are committed to developing and implementing strategies to increase the equity, diversity and inclusion within the workplace by examining our internal policies, practices, and systems throughout the entire lifecycle of our workforce, including its recruitment, retention and advancement for all employees. In addition to our deep commitment to respecting human rights, we are dedicated to positive actions to affect change to ensure everyone has full participation in the workforce free from any barriers, systemic or otherwise, especially equity-seeking groups who are usually underrepresented in Canada's workforce, including those who identify as women or non-binary/gender non-conforming; Indigenous or Aboriginal Peoples; persons with disabilities (visible or invisible) and; members of visible minorities, racialized groups and the LGBTQ2+ community.
Randstad Canada is committed to creating and maintaining an inclusive and accessible workplace for all its candidates and employees by supporting their accessibility and accommodation needs throughout the employment lifecycle. We ask that all job applications please identify any accommodation requirements by sending an email to accessibility@randstad.ca to ensure their ability to fully participate in the interview process.
show more