About this role
Role Summary We are seeking an experienced Elastic Platform Architect to lead the architecture, design, governance, and implementation of a next-generation enterprise observability and operational intelligence platform based on the Elastic ecosystem. The successful candidate will be responsible for defining enterprise observability architecture, designing scalable Elastic platform solutions, establishing platform engineering standards, and driving the implementation of cloud-native monitoring capabilities across complex enterprise environments. The role requires strong expertise in Elastic Stack technologies, Kubernetes-based platforms, GitOps deployment methodologies, Infrastructure-as-Code, DevSecOps practices, and enterprise integration architecture. The candidate will work closely with enterprise architects, cybersecurity teams, infrastructure teams, platform engineers, and business stakeholders to deliver highly resilient, secure, and scalable observability platforms supporting critical business operations. Key Responsibilities Enterprise Architecture & Platform Strategy Define and maintain the target-state architecture roadmap for enterprise observability and monitoring platforms. Develop architecture blueprints, reference architectures, design standards, and governance frameworks for Elastic-based platforms. Lead architecture reviews, technical assurance activities, and solution governance processes. Establish platform engineering standards covering scalability, availability, security, performance, and operational resilience. Evaluate emerging observability technologies and recommend strategic platform enhancements. Collaborate with enterprise architecture teams to align observability capabilities with broader technology transformation initiatives. Elastic Platform Architecture & Design Architect and govern enterprise deployments of: Elasticsearch Kibana Logstash Elastic Agent Fleet Server Elastic APM Elastic Cloud on Kubernetes (ECK) Design distributed Elastic platform architectures supporting large-scale telemetry ingestion and analytics workloads. Establish platform standards for: Index Management Shard Allocation Cluster Scaling Data Retention Performance Optimization Disaster Recovery Define and govern Index Lifecycle Management (ILM) and Snapshot Lifecycle Management (SLM) strategies. Design platform resiliency and high-availability frameworks supporting enterprise-grade service levels. Observability & Operational Intelligence Design enterprise observability frameworks covering: Infrastructure Monitoring Application Performance Monitoring (APM) Distributed Tracing Centralized Logging Security Event Monitoring Service Health Monitoring Operational Analytics Establish telemetry collection and data normalization standards across multiple technology domains. Define alerting, event correlation, escalation, and incident visibility frameworks. Develop monitoring standards, service health models, and operational dashboards supporting business and technical stakeholders. Define SLI, SLO, SLA, and reliability measurement frameworks. Cloud-Native & Kubernetes Architecture Design Kubernetes-native observability platforms leveraging Elastic Cloud on Kubernetes (ECK). Define deployment architectures supporting: Kubernetes OpenShift Hybrid Cloud Air-Gapped Environments Architect GitOps deployment frameworks utilizing: FluxCD GitLab Helm Establish platform lifecycle management standards covering upgrades, deployments, configuration management, and operational governance. Design secure multi-cluster deployment architectures supporting enterprise scalability requirements. Platform Integration Architecture Define enterprise integration patterns for onboarding infrastructure, application, network, and security telemetry sources. Architect integrations across: Enterprise Applications Cloud Platforms Security Solutions Network Infrastructure Storage Platforms Kubernetes Environments Identity & Access Management Platforms Design data ingestion and enrichment frameworks supporting operational analytics and observability use cases. Lead integration of observability platforms with IT Service Management and operational workflow systems. Security, Compliance & Governance Define platform security architecture including: RBAC Authentication & Authorization Encryption Standards Certificate Management Secrets Management Audit Controls Collaborate with cybersecurity teams to implement observability-driven security monitoring capabilities. Ensure compliance with enterprise security policies, governance standards, and regulatory requirements. Establish platform governance processes supporting change management, operational controls, and risk management. Automation, DevSecOps & Platform Engineering Define Infrastructure-as-Code standards utilizing: Terraform Ansible Establish CI/CD and GitOps operating models supporting platform lifecycle automation. Design automated deployment, testing, upgrade, rollback, and recovery processes. Drive platform engineering best practices to improve operational efficiency and reliability. Promote automation initiatives supporting scalability, consistency, and operational excellence. Technical Leadership & Stakeholder Engagement Provide technical leadership and architectural guidance to engineering and operations teams. Facilitate architecture workshops, technical reviews, and stakeholder engagement sessions. Mentor platform engineers and observability specialists on architecture standards and best practices. Develop architecture documentation, technical standards, solution blueprints, operational procedures, and implementation guidelines. Support strategic planning, technology assessments, and platform modernization initiatives. Required Qualifications & Experience Technical Expertise Extensive experience designing and implementing enterprise observability platforms. Strong hands-on expertise in: Elasticsearch Kibana Logstash Elastic Agent Fleet Management Elastic APM Elastic Cloud on Kubernetes (ECK) Strong understanding of distributed systems, large-scale data platforms, and enterprise monitoring architectures. Cloud & Platform Technologies Kubernetes OpenShift Docker AWS Azure Hybrid Cloud Environments Automation & Platform Engineering Strong experience with: Terraform Ansible GitOps GitLab FluxCD Helm CI/CD Pipelines Observability & Operations Experience designing: Monitoring Frameworks Logging Architectures Tracing Solutions Alerting Frameworks Event Correlation Models Operational Intelligence Platforms Enterprise Integration Experience integrating observability platforms with: ServiceNow Enterprise Security Platforms Identity Management Solutions Network and Infrastructure Technologies Preferred Qualifications Elastic Certified Observability Engineer Certified Kubernetes Administrator (CKA) HashiCorp Terraform Associate Red Hat OpenShift Certification GitLab Professional Certifications AWS and/or Azure Certifications Experience delivering observability platforms within government agencies, financial institutions, telecommunications organizations, or large enterprise environments. Experience with AIOps, OpenTelemetry, Service Intelligence, and advanced analytics platforms. Employment Type Contract Position (6 Months) Renewable subject to project requirements, performance, and business needs.
Also in Government Policy