Ingénieur Fiabilité Site Senior (H/F)

Rejoignez Arcadia à Chennai comme Staff Site Reliability Engineer. Pilotez des projets AWS, encadrez l’équipe et améliorez les pipelines CI/CD. Profitez d’un modèle hybride, de congés flexibles et d’une assurance santé familiale. Arcadia recherche un Staff Site Reliability Engineer à Chennai, Inde, pour diriger des projets d’infrastructure AWS et soutenir l’équipe SRE. Ce poste implique la gestion complète de projets SRE, l’exploitation Kubernetes et l’automatisation. Bénéficiez d’un modèle hybride remote-first, de congés flexibles, d’une assurance santé pour la famille et d’une culture d’entreprise inclusive axée sur l’énergie propre.

Arcadia Chennai, Tamil Nadu, Inde Hybride Temps plein UTC+05:30

Arcadia

Présentation de l'entreprise

Nom

Arcadia

Siège social

Greenwood Village, Colorado, États-Unis

Fondée

2014

Taille

Environ 300 employés (source : linkedin.com). Les chiffres de revenus pour 2023 ou plus tard ne sont pas divulgués publiquement, mais l'entreprise a levé 575,5 millions de dollars lors de 16 tours de financement, y compris une levée de 50 millions de dollars en avril 2024 à une valorisation de 1,5 milliard de dollars (source : texau.com).

Ce qu'ils font

Arcadia est une plateforme pionnière qui simplifie l'accès des consommateurs à l'énergie propre, se concentrant principalement sur les abonnements solaires communautaires. Fondée en 2014, l'entreprise visait initialement à connecter les maisons et les entreprises aux fermes solaires locales sans coûts d'installation initiaux (source : research.contrary.com). Au fil des ans, Arcadia a élargi son offre pour inclure un logiciel de gestion de l'énergie pour les entreprises connu sous le nom d'Arc, qui agrège des données provenant de près de 10 000 services publics américains, couvrant 95 % des comptes résidentiels et commerciaux (source : prnewswire.com). L'entreprise met l'accent sur l'analyse basée sur l'IA et la gestion des données des services publics, s'adressant à une clientèle diversifiée comprenant des consommateurs, des petites entreprises et des entreprises du Fortune 500 (source : arcadia.com).

Projets et antécédents

Arcadia possède un bilan significatif dans la gestion de projets solaires communautaires, avec plus de 2 GW de capacité répartis sur 1 000 projets dans 16 États, servant plus de 300 000 équivalents résidentiels (source : prnewswire.com). L'entreprise est reconnue comme le plus grand gestionnaire de solaire communautaire aux États-Unis, avec des plans pour augmenter sa capacité à plus de 3 GW grâce à de nouvelles initiatives (source : cbinsights.com). Parmi ses clients clés figurent des entreprises notables du Fortune 500 telles qu'Iron Mountain, Adobe et UPS, soulignant la capacité d'Arcadia à offrir un retour sur investissement pour les développeurs et des économies pour divers segments d'utilisateurs (source : esgtoday.com). L'entreprise collabore également avec plus de 300 prestataires de services pour améliorer ses offres et étendre sa présence géographique (source : prnewswire.com).

Développements récents

Au cours des deux dernières années, Arcadia a réalisé des avancées significatives en matière de financement et d'acquisitions pour renforcer sa position sur le marché. En avril 2024, l'entreprise a sécurisé 50 millions de dollars de financement à une valorisation de 1,5 milliard de dollars, ainsi qu'une facilité de crédit de 30 millions de dollars de JPMorgan Chase pour soutenir ses innovations en matière de solaire communautaire et d'IA (source : texau.com). De plus, Arcadia a acquis RPD Energy, améliorant ses services de conseil en approvisionnement énergétique à l'échelle nationale (source : esgtoday.com). En mars 2025, l'entreprise a annoncé une coentreprise avec Perch pour gérer 3 GW de solaire communautaire dans 16 États, consolidant ainsi son leadership dans le secteur (source : cbinsights.com).

Travailler chez eux

Arcadia propose une gamme de postes dans divers départements, y compris le développement de logiciels, l'expertise dans l'industrie de l'énergie, la gestion de produits, la recherche et le développement, le juridique, la finance et les opérations. L'équipe de direction est composée de professionnels expérimentés, y compris le PDG Kiran Bhatraju et d'autres cadres clés (source : arcadia.com). La culture d'entreprise est guidée par une mission de décarbonisation par l'innovation technologique, et elle met l'accent sur un environnement collaboratif qui favorise une croissance rapide (source : aws.amazon.com). Bien que les avantages spécifiques ne soient pas détaillés, le statut d'unicorne de l'entreprise et son accent sur l'IA suggèrent des avantages compétitifs pour les employés (source : arcadia.com).

Dernière mise à jour le févr. 23, 2026 | Signaler un problème

Arcadia is the AI-powered energy intelligence platform for businesses. We replace fragmented tools and manual workflows with one platform to pay utility bills, buy energy, and advance sustainability - across every location, at enterprise scale.

Trusted by Fortune 2000 companies, Arcadia combines unified data, AI-powered analytics, and expert advisory to help enterprise teams save money, mitigate risk, and cut carbon.

We deliver this through three comprehensive solutions:

Utility Bill Management: Automating the entire utility bill lifecycle - from data capture and validation to payment processing and auditing.
Energy Procurement Advisory: Bringing together comprehensive data, AI-powered analytics, market expertise, and a strong partner network to make sophisticated procurement options accessible to all.
Sustainability Reporting - Verified emissions data with seamless integration into leading sustainability platforms.

Tackling the world's most complex energy challenges requires diverse thinking. We're building teams of people from different backgrounds, industries, and disciplines - united by a belief that energy management should be simple, intelligent, and a genuine driver of business value.

What we're looking for

We are seeking a Staff Site Reliability Engineer (L4) to join our SRE/Platform Engineering team in India. This is a senior technical leadership role - not people management, but engineering leadership through execution, mentorship, and architectural ownership.

Our India SRE team is growing, and this role is central to that growth. As we scale, we need a technical anchor in the India timezone who can independently own multi-week SRE projects from problem statement to production, make sound architectural decisions under ambiguity, and elevate the team around them. You will be the person engineers lean on for design reviews, debugging escalations, and "how should we approach this?" conversations. You'll bring the depth and experience to drive execution autonomously in the India timezone while collaborating closely with US-based SRE leadership on roadmap priorities, incident response, and platform strategy.

This is a role for someone who doesn't wait for direction - you identify reliability gaps, propose solutions, build consensus, and ship.

Our infrastructure is primarily AWS-based, managed by Terraform and CloudFormation, and deployed using CI/CD best practices. In your application, please include a link to GitHub or another place where your code is published, though we understand that not everyone has public code online.

What you'll do

Own and deliver SRE projects end-to-end - from scoping and design through implementation, testing, rollout, and documentation
Serve as a technical anchor for the India SRE team - conduct design reviews, pair on complex debugging, and mentor engineers to develop the judgment to work through ambiguous problems independently
Design and implement infrastructure solutions across AWS (EKS, VPC, RDS, IAM, CloudWatch, CloudTrail, GuardDuty, S3, CloudFront, Lambda, SQS) using Terraform and CloudFormation, with an emphasis on making the right tradeoffs between speed, reliability, and cost
Lead Kubernetes operations including cluster upgrades, capacity planning, CNI troubleshooting, workload scaling, Helm chart packaging, and GitOps deployments - and build the runbooks and automation so these become repeatable rather than one-off heroics
Evolve CI/CD pipelines across Jenkins (Groovy scripting), GitHub Actions, AWS CodePipeline, ArgoCD, and FluxCD - with an emphasis on reducing manual deployment steps and improving rollback safety
Drive observability stack enhancements - deliver the infrastructure and architectural direction necessary for engineering teams to leverage Prometheus, Grafana, and CloudWatch effectively
Identify and execute FinOps initiatives - find zombie resources, right-size instances, enforce tagging standards, and present cost-reduction recommendations with data to back them up
Manage database reliability across MySQL and PostgreSQL including backup validation, performance tuning, replication health, failover testing, and operational runbooks
Strengthen security posture through IAM least-privilege enforcement, CSPM reviews, GuardDuty/CloudTrail monitoring, secrets management (Vault, AWS Secrets Manager, Parameter Store), and audit readiness
Troubleshoot complex cross-cutting production issues spanning networking, Kubernetes, compute, databases, and CI/CD - and then turn the fix into a runbook or automation so the same issue doesn't require the same person next time
Write the documentation the team actually needs - architecture decision records, operational runbooks, troubleshooting guides, and post-incident action items that get closed, not just filed
Collaborate daily with US-based SRE leadership on incident reviews, migration planning, roadmap execution, and platform strategy - bringing context and recommendations, not just status updates
Participate in on-call rotations and drive post-incident analysis with a focus on systemic fixes over individual blame

What will help you succeed

Must-haves

10-14 years of experience in SRE/DevOps/Cloud Engineering, with a demonstrated progression from task execution to project ownership - we're looking for evidence that you have independently scoped, designed, and delivered infrastructure projects end-to-end
Deep, hands-on expertise with AWS - EKS, IAM, RDS, EC2, VPC, CloudWatch, CloudTrail, GuardDuty, Lambda, SQS. You should be able to architect a multi-AZ, multi-account solution and explain why you made the choices you made
Strong Terraform skills with experience managing complex, multi-environment state, writing reusable modules, and reviewing others' IaC for correctness and maintainability
Advanced Kubernetes knowledge - you don't just deploy to K8s, you troubleshoot networking issues at the CNI level, tune resource requests and limits based on actual usage data, and can plan and execute cluster upgrades with minimal downtime
CI/CD pipeline design and ownership across Jenkins (Groovy), GitHub Actions, ArgoCD, or FluxCD - with a track record of improving deployment reliability and reducing manual steps
Observability stack experience with Prometheus, Grafana, Datadog, or equivalent - including defining SLOs/SLIs, building meaningful dashboards, and tuning alerting to reduce noise
Proven mentorship ability - you have helped less experienced engineers grow. This could be formal (tech lead role, code review ownership) or informal (the person everyone goes to when they're stuck). We will ask you about this in interviews
Strong written and verbal communication skills - you will interact with US-based teams daily, present proposals asynchronously, and write documentation that others can actually follow
Automation-first mindset - your instinct when you do something manually is to immediately think about how to script it. You have a track record of reducing operational toil through scripting and tooling
Incident management experience - you have led or significantly contributed to incident response and post-incident reviews in production environments, and you understand the difference between fixing the symptom and fixing the system
Ability to operate with autonomy - you don't need daily direction. Given a problem space and constraints, you can propose an approach, pressure-test it with peers, and execute

Nice-to-haves

Experience with FinOps practices - cloud cost analysis, rightsizing, tagging governance, reserved instance planning
Exposure to secrets management platforms (HashiCorp Vault, AWS Secrets Manager)
Experience with event-driven architectures using AWS Lambda, CloudWatch Events, SQS, and SNS
Exposure to AI-enabled tooling (automation assistants, MCP, RAG pipelines, LLM-based debugging)
Experience with data warehouses (Snowflake) and their operational requirements
Experience with n8n or similar workflow automation platforms
Industry certifications - AWS Solutions Architect Professional, CNCF CKA/CKS, HashiCorp Terraform Associate, or equivalent
Experience working in a company that has grown through acquisitions, with exposure to consolidating disparate infrastructure environments

Benefits

Competitive compensation based on market standards
We are working on a hybrid model with remote first policy
Apart from Fixed Base Salary potential candidates are eligible for following benefits
Flexible Leave Policy
Office located in the heart of the city in case you need to step in for any purpose
We provide comprehensive coverage including accident policy and life insurance.
Medical Insurance (1+5 Family Members)
Flexible Benefit Plan
Awards and Bonus
Annual performance cycle
Quarterly engagement activities

A supportive engineering culture that values diversity, empathy, teamwork, trust, and efficiency

Eliminating carbon footprints, eliminating carbon copies.

Here at Arcadia, we cultivate diversity, celebrate individuality, and believe unique perspectives are key to our collective success in creating a clean energy future. Arcadia is committed to equal employment opportunities regardless of race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, protected veteran status, or any status protected by applicable federal, state, or local law.

Postuler maintenant

Offre d’emploi expirée ?

Dites à Arcadia que vous avez trouvé cet emploi sur Rejobs. Cela nous aide à grandir et à attirer plus de talents dans les énergies renouvelables !

Postuler maintenant

Offre d’emploi expirée ?

Dites à Arcadia que vous avez trouvé cet emploi sur Rejobs. Cela nous aide à grandir et à attirer plus de talents dans les énergies renouvelables !

Découvrez vos liens

Voir les connexions

Consultez vos contacts chez Arcadia sur LinkedIn pour appuyer votre candidature.

Recevoir des alertes emploi

Recevez des alertes pour les emplois dans le domaine Gestion de l'Énergie à Chennai, Tamil Nadu, Inde

Rejoindre le Talent Pool

Laissez les meilleurs employeurs en énergie propre vous trouver

À propos du rôle

Offre publiée

27 mai 2026

Type d’emploi

Temps plein

Type d’organisation

Entreprise

Dernière mise à jour

2 juin 2026

Type de lieu de travail

Hybride

Secteur énergétique

Réseaux intelligents

Arcadia

Page d’accueil

arcadia.com

Lieu

Chennai, Tamil Nadu, Inde

Niveau d’expérience

10-14 years

Fuseaux horaires

UTC+05:30

Étiquettes

Gestion de l'Énergie · Intégration au réseau · Informatique en Nuage

Ingénieur Fiabilité Site Senior (H/F)

Arcadia

Présentation de l'entreprise

Ce qu'ils font

Projets et antécédents

Développements récents

Travailler chez eux

What we're looking for

What you'll do

What will help you succeed

Must-haves

Nice-to-haves

Benefits

Postuler maintenant

Postuler maintenant

Découvrez vos liens

Recevoir des alertes emploi

Rejoindre le Talent Pool

À propos du rôle

Étiquettes

Recevoir des alertes emploi

Rejoindre le Talent Pool

Envoyez-nous vos commentaires