hero

Come build with us

26
companies
446
Jobs

Senior Site Reliability Engineer

Arcadia

Arcadia

Software Engineering
Chennai, Tamil Nadu, India
Posted on Feb 9, 2024

Lead Engineer – Site Reliability Engineering

Who We Are

Arcadia is a technology company empowering energy innovators and consumers to fight the climate crisis. Our software and APIs are revolutionizing an industry held back by outdated systems and institutions by creating unprecedented access to the data and clean energy needed to make a decarbonized energy grid possible.

In 2014, Arcadia set out on its mission to break the fossil fuel monopoly and since then we have been knocking down the institutional barriers to unlock decarbonization. To date, we have connected hundreds of thousands of consumers and small businesses with high-quality clean energy options. Fast forward to today, and now, we're thinking even bigger. We have launched Arc, an industry-defining SaaS platform that empowers developers and energy innovators to deliver their own custom, personalized energy experiences, accelerating the transformation of the industry from an analog energy system into a digitized information network.

Tackling one of the world's biggest challenges requires out-of-the-box thinking & diverse perspectives. We are building a team of individuals from different backgrounds, industries, & educational experiences. If you share our passion for ushering in the era of the clean electron, we look forward to learning what you would uniquely bring to Arcadia! Visit www.arcadia.com.

HO: Washington, DC

$1.5B valuation: $380 million funding to date

Job Summary:
As a Lead Engineer - Site Reliability Engineering, you will directly contribute to democratizing access to clean energy by building the technology and infrastructure that make it happen. You’ll work across the infrastructure and application stack contribute to scalable system and dive headfirst into technical material. In doing so you will unlock a more human relationship with energy; accelerating everyone’s agency to choose renewables and hopefully stabilize our climate before its too late.

What we’re looking for:
We are seeking a curious and resourceful Site Reliability Engineer to join our Chennai SRE team. The ideal candidate is a low-Ego team player who has background building scalable web infrastructure strongly believes in infrastructure as code and relishes the chance to take on a highly visible role within a collaborative engineering team. We are looking for an inquisitive problem-solver who approaches engineering problem and potential solution with a unique holistic and long-term perspective and is genuinely excited to build and support software expanding renewable energy access to millions of households across the country.

This person will report to an Engineering manager in Chennai and will also collaborate closely with SRE team members in the US. This is an exceptional opportunity for someone who relishes the chance to engage with cutting-edge technology, influence how our team builds and stays relevant and work in a fast-paced environment. Our engineering values are deeply ingrained in our culture-- you can read more about them here.

Our infrastructure is primarily AWS- based managed by Terraform and CloudFormation and deployed using best CI/CD practices. In your application, please include a link to GitHub or another place where your code is published, though we understand that not everyone has public code online.

What you’ll do:

  • Partner with Engineering product and other stakeholders to deliver new application feature third party tooling and functionality through automated testing and deployment.
  • Design implement & maintain the architecture of scalable backend services that can scale with demand & remain resilient during times of crisis.
  • Help evolve and maintain our application infrastructure using Terraform CloudFormation Kubernetes helm charts and exploring new technologies with the team that can expanded on the reliability and security of our system.
  • Mentor and guide engineering teammates empowering them to design superior services & then remain accountable for those services.
  • Mentor fellow SRE engineer for them to be able to grow in their skill and remain fully accountable for their respective services within their role.
  • Author document and maintain business critical infrastructure-as-code.

What will help you succeed:

  • 8+ Years of Experience as a Site Reliability DevOps or system engineer supporting high-availability large-scale web-based applications.
  • Experience with Terraform CloudFormation or similar
  • Experience managing and maintaining a resilient, fault-tolerant, containerized cloud infrastructure (ideally Kubernetes on AWS) where software is deployed via Cl/CD pipelines, GitOps
  • Experience with infrastructure & service monitoring and alerting
  • Ability to translate complex technical concepts into clear, actionable information
  • Comfortable managing the balance between deploying necessary infrastructure changes quickly and shipping perfect infrastructure updates
  • Flexible to jump on to calls, roll up the sleeves and take ownership as necessary during system outages and incidents, and then participate in Incident Reviews once resolved
  • Ability to scope, prioritize, and deliver on project commitments
  • Ability and internal drive to problem-solve, both creatively and pragmatically
  • Skill with mentoring and learning from other engineers, treating colleagues with respect, and guiding them through challenging tradeoffs to create scalable and reliable solutions
  • Passion for our mission, sustainability, and drive a clean-energy future

Nice-to-have:

  • Experience with common web frameworks and their deployment patterns
  • Experience with Jenkins & GitHub Actions for CI/CD pipelines and scheduling
  • Experience working with data warehouses (Redshift, BigQuery, Snowflake etc.)
  • Experience with using various data stores including PostgreSQL on RDS, Aurora, Dynamo and Elastic search.
  • Experience with application observability and alerting
  • Experience managing event-driven architectures with AWS Lambda, CloudWatch, and SQS
  • Industry certifications = AWS Solutions Architect Associate+, CNCF CKA, or relevant.

Benefits:

  • Competitive compensation based on market standards.
  • We are working on a hybrid model with remote first policy
  • Apart from Fixed Base Salary potential candidates are eligible for following benefits
  • Flexible Leave Policy
  • Office is in the heart of the city in case you need to step in for any purpose.
  • Medical Insurance (1+5 Family Members)
  • Annual performance cycle
  • Quarterly team engagement activities and rewards & recognitions
  • L&D programs to foster professional growth
  • A supportive engineering culture that values diversity, empathy, teamwork, trust, and efficiency

Eliminating carbon footprints, eliminating carbon copies.

Here at Arcadia, we cultivate diversity, celebrate individuality, and believe unique perspectives are key to our collective success in creating a clean energy future. Arcadia is committed to equal employment opportunities regardless of race, colour, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, protected veteran status, or any status protected by applicable federal, state, or local law. While we are currently unable to consider candidates who will require visa sponsorship, we welcome applications from all qualified candidates eligible to work in India.

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request an accommodation.

Thank you