Rally Ventures unites an intersecting portfolio of companies at the frontier of business technology.

Discover job opportunities across our portfolio.

Staff Cloud Engineer



Software Engineering
Mountain View, CA, USA
Posted on Saturday, June 1, 2024
Harness is a high-growth company that is disrupting the software delivery market. Our mission is to enable the 30 million software developers in the world to deliver code to their users reliably, efficiently, securely and quickly, increasing customers’ pace of innovation while improving the developer experience. We offer solutions for every step of the software delivery lifecycle to build, test, secure, deploy and manage reliability, feature flags and cloud costs. The Harness Software Delivery Platform includes modules for CI, CD, Cloud Cost Management, Feature Flags, Service Reliability Management, Security Testing Orchestration, Chaos Engineering, Software Engineering Insights and continues to expand at an incredibly fast pace.
Harness is led by technologist and entrepreneur Jyoti Bansal, who founded AppDynamics and sold it to Cisco for $3.7B. We’re backed with $425M in venture financing from top-tier VC and strategic firms, including J.P. Morgan, Capital One Ventures, Citi Ventures, ServiceNow, Splunk Ventures, Norwest Venture Partners, Adage Capital Partners, Balyasny Asset Management, Gaingels, Harmonic Growth Partners, Menlo Ventures, IVP, Unusual Ventures, GV (formerly Google Ventures), Alkeon Capital, Battery Ventures, Sorenson Capital, Thomvest Ventures and Silicon Valley Bank.

Position Summary

Harness seeking a highly skilled Cloud Engineer to join our team at our Mountain View location. As a cloud engineer, you will play a pivotal role in ensuring the smooth operation of our systems by effectively managing incidents and enhancing our observability capabilities through the use of Prometheus/Grafana. In collaboration with our Platform Engineering team, you will have the opportunity to contribute to the design and implementation infrastructure required to support the scalable growth of our Harness platform. This is an excellent opportunity for individuals with 5-7 years of experience as a Site Reliability Engineer (SRE) in GCP/AWS, specifically with a strong background in Kubernetes and HELM and a proven track record in working with observability tooling.

About the role

  • Design and implement scalable and reliable cloud solutions on GCP / AWS.
  • Collaborate with development teams to provide guidance and support in adopting cloud-native architectures and best practices.
  • Deploy and manage cloud resources, including virtual machines, containers, and storage systems, to meet application requirements.
  • Develop and automate infrastructure-as-code (IaC) using tools like Terraform or Deployment Manager.
  • Implement and optimize CI/CD pipelines for continuous integration and deployment of cloud applications.
  • Monitor and troubleshoot cloud infrastructure, including performance monitoring, log analysis, and incident response.
  • Implement security best practices and ensure compliance with data protection and privacy regulations.
  • Collaborate with the operations team to define and implement backup, disaster recovery, and business continuity strategies.
  • Stay updated with the latest developments and advancements in GCP services and recommend their adoption when appropriate.
  • Participate in capacity planning, performance optimization, and cost management activities.
  • Provide technical expertise and support to internal teams and stakeholders regarding GCP-related initiatives.

About you

    • 5+ years of experience in Cloud Engineering / SRE roles
    • Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent experience).
    • Demonstrable background in Prometheus/Grafana
    • Proven experience as a Cloud Engineer with experience in GCP and/or AWS
    • In-depth knowledge of GCP or AWS services, including Compute Engine, Kubernetes Engine etc
    • Strong understanding of cloud-native architectures, microservices, and containerization (Docker, Kubernetes).
    • Proficiency in scripting and automation using languages like Python, PowerShell, or Bash.
    • Experience with infrastructure-as-code tools such as Terraform, Deployment Manager, or Cloud Deployment Manager.
    • Familiarity with CI/CD concepts and tools (e.g., Jenkins, GitLab CI/CD, Cloud Build).
    • Solid understanding of networking principles and experience with VPCs, VPNs, load balancers, and firewall rules.
    • Knowledge of security best practices and experience implementing security controls in a cloud environment.
    • Experience with monitoring and logging tools (e.g., Stackdriver, Prometheus, ELK stack).
    • Strong problem-solving skills and the ability to troubleshoot complex issues in a distributed environment.
    • Excellent communication and collaboration skills to work effectively with cross-functional teams

Work Location

  • Mountain View, CA - Hybrid - 3 days per week

What you will have at Harness

  • Competitive salary
  • Comprehensive healthcare benefits
  • Flexible Spending Account (FSA)
  • Flexible work schedule
  • Employee Assistance Program (EAP)
  • Flexible Time Off and Parental Leave
  • Monthly, quarterly, and annual social and team building events
  • Monthly internet reimbursement
Pay transparency
$165,000$195,000 USD