Apply for Job
Principal Engineer, AI
SG
Job Description
a) IP & Broadband (Core + Access + Transmission)
• Design and implement automation frameworks across: IP Core (BNG, routing), Broadband Access (OLT/ONT) & Transmission and transport networks
• Enable automated provisioning, configuration management, and lifecycle operations.
• Develop standard APIs and integration interfaces for network actions.
• Implement configuration compliance and drift management mechanisms.
• Support high availability and resilience readiness (failover, rerouting support).
• Integrate network telemetry into centralized platforms AI-driven diagnostics and closed-loop automation workflows.
• Ensure all automation workflows are secure, auditable, and compliant.
b) Data Centre Operations (Infrastructure Automation + Energy & Space Optimization)
• Automate compute, storage, and network provisioning across data center environments.
• Develop runbook automation for operational tasks (restart, failover, scaling).
• Automate patching, upgrades, and lifecycle management.
• Enable infrastructure data pipelines and integrations to support AI-based anomaly detection and predictive maintenance systems.
• Enable event-driven execution workflows from monitoring systems.
• Maintain automation pipelines (CI/CD) for infrastructure operations.
• Ensure robust execution frameworks with rollback and validation mechanisms.
• Enable centralized monitoring of Power consumption (UPS, PDU), Cooling systems (HVAC, CRAC) & Environmental metrics (temperature, airflow, humidity)
• Enable automation readiness for Cooling optimization and airflow balancing and Power utilization tracking (PUE and efficiency metrics)
• Enable execution of optimization actions to improve energy efficiency and reduce costs.
• Support data collection and execution readiness for AI-driven energy optimization (cooling efficiency, power balancing).
c) Business Innovation & Strategic Projects
• Embed “automation-first” principles into transformation programs (e.g., iBNG, SiX AntiDDoS, IP-Optical SRv6 Network etc).
• Enable zero-touch provisioning (ZTP) for new deployments.
• Develop API-driven and programmable interfaces for new systems.
d) Monitoring, Visibility & Dashboard Enablement
• Implement centralized monitoring frameworks across network and data center domains.
• Enable real-time visibility of performance, utilization, and environmental metrics.
• Integrate telemetry into DCIM and monitoring platforms.
• Support development of Operational dashboards (NOC / CXOps) & Executive dashboards (capacity, utilization, risk)
• Enable alarm ingestion and visualization (without owning RCA logic).
• Enable dashboards that incorporate AI-driven insights (e.g., anomaly indicators, predictive alerts)
• Ensure telemetry pipelines support AI/ML consumption (real-time, structured, high-quality data feeds).
e) Automation of Operations
• Develop automation for Provisioning and configuration management, Infrastructure and network lifecycle operations & Runbook automation for repetitive tasks
• Enable execution of Network actions (configuration updates, resets) & Infrastructure actions (restart, scaling)
• Provide secure, standardized APIs for execution.
Qualifications
Bachelor’s Degree in Engineering, Computer Science, or related field
8 –10 years of experience in Network or data center operations & automation / system integration
Strong understanding of IP networking and broadband architecture & Data center infrastructure (power, cooling, monitoring)
Experience with Automation tools (Python, Ansible, Scripting), API integrations and system orchestration, Monitoring and observability platforms & AI/ML-enabled operations (AIOps) concepts and data pipelines
Preferred Skills : DCIM tools, Cloud environments (AWS / GCP / Azure), TR-069 / TR-369 (device management) & familiarity with data platforms (e.g., telemetry systems, data lakes)