Must Have Technical/Functional Skills:
- Define the target architecture for AI enabled AMS: telemetry ingestion, unified observability (metrics, logs, traces), event correlation, and signal driven operations.
- Design an agentic orchestration layer (tooling agnostic) to coordinate AIOps, LLM agents, runbooks, and ITSM workflows.
- Establish integration patterns with ITSM (e.g., ServiceNow/Jira), chat/collab platforms, CMDB, CI/CD, and cloud monitors.
- Create data contracts and a normalized telemetry schema across apps/infra/cloud/edge; ensure lineage and data quality for AI features.
- Own data governance: access controls, retention, PII handling, encryption, auditability, and lawful basis for processing.
- Implement Responsible AI guardrails: policy checks, red teaming, bias tests, safety filters, and human in the loop approvals for impactful actions.
- Define next gen roles: AI Ops Engineer, Agent Supervisor, Prompt/Runbook Engineer, Reliability Engineer.

