With 15+ years supporting mission-critical platforms in finance, messaging, and defense sectors, I help teams reduce incident resolution time, build reliable monitoring, and automate repetitive operational tasks. Available for contract, part-time, and project-based engagements.

πŸ“Š

Splunk & Observability

  • SPL query development for log analysis and spam investigation
  • Alert configuration and dashboard creation
  • Prometheus + Grafana setup and PromQL alert rules
  • Dynatrace and EM7 integration for APM
☸️

Kubernetes & DevOps

  • EKS/AKS cluster design and deployment manifests
  • CI/CD pipeline setup (GitHub Actions, Jenkins)
  • Docker containerization and multi-stage builds
  • Infrastructure as Code with Terraform
☁️

Cloud Infrastructure (AWS)

  • AWS architecture design and migration planning
  • EC2, S3, Lambda, and CloudWatch configuration
  • IAM policies and security best practices
  • Cost optimization and resource right-sizing
🐧

Linux & Automation

  • Advanced log analysis with grep, awk, sed
  • Python and Bash automation scripts
  • Health check watchdogs and auto-remediation
  • Performance tuning and troubleshooting
πŸ€–

AI & Intelligent Operations

  • LLM-assisted incident summarization and RCA drafts
  • Log anomaly detection with Python/ML
  • AWS Bedrock and SageMaker integration guidance
  • AI workflow automation for support teams

Let's Work Together

Whether you need a production support engineer, monitoring specialist, or DevOps consultant β€” reach out to discuss your project.