hero

Skyview Ventures Portfolio Careers

We are driving Energy Independence in North America, come join us.

Production Support Engineer

Fermata Energy

Fermata Energy

Software Engineering, Product, Customer Service
Remote
Posted on Jul 23, 2024

If you're passionate about troubleshooting complex technical issues and are eager to make a positive impact in the renewable energy sector, we'd love to hear from you! Apply now to join our team at Fermata Energy and help drive innovation in the energy industry.

Fermata Energy is a pioneering energy technology company committed to revolutionizing the renewable energy sector. Our innovative solutions empower grid operators, businesses, and consumers to optimize energy usage, enhance grid stability, and accelerate the transition to sustainable energy sources.

We are seeking a talented and proactive Production Support Engineer to join our Operations team at Fermata Energy. In this critical role, you will be responsible for diagnosing and resolving production issues, ensuring the reliability and performance of our energy management systems, and supporting the operational needs of the organization. You will collaborate closely with cross-functional teams to identify root causes, implement solutions, and drive continuous improvement in our production environment.

Key Responsibilities:

  • Incident Management: Serve as the primary point of contact while engaging with a cross-functional team to prioritize and respond to production incidents including system outages, performance degradation, and service disruptions according to established SLAs, ensuring timely resolution and effective communication with stakeholders.

  • Debugging and Root Cause Analysis: Utilize a combination of debugging tools, logs, and diagnostic techniques to identify root causes, contributing factors, and potential areas of improvement. Either address the issue directly or create a ticket with all supporting evidence for another team member to address and fix software bugs, configuration errors, and infrastructure issues affecting production systems.

  • Hardware Troubleshooting: Respond to reported hardware faults by pulling logs, understanding the issue, and either addressing the issue (most commonly by a restart) or forwarding relevant information to the Systems team.

  • System Triage: Support Operations, Systems, and Software teams to prioritize, estimate, and track resources for service issue resolution.

  • Post Mortems: When relevant, conduct post mortems to reflect on past issues and discuss potential solutions to mitigate issues in the future.

  • Monitoring and Alerting: Implement and maintain monitoring tools and alerting systems to proactively detect and mitigate potential production issues, and optimize system performance and reliability.

  • Documentation and Knowledge Sharing: Document troubleshooting procedures, resolutions, and best practices to facilitate knowledge sharing and enable the Operations team to respond effectively to future incidents.

  • Continuous Improvement: Identify opportunities for process improvements, automation, and efficiency gains in production support activities, and work collaboratively with the Operations team to implement enhancements.

  • Training and Enablement: Provide training and support to internal teams on debugging techniques, troubleshooting methodologies, and production support best practices to build operational excellence across the organization.

Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent work experience).

  • Proven experience in a production support, site reliability engineering, or similar role, with a strong track record of diagnosing and resolving complex technical issues in a fast-paced environment.

  • Solid understanding of software debugging techniques, including log analysis, stack traces, and code profiling, as well as experience with debugging tools and IDEs.

  • Proficiency with Grafana stack tooling such as Grafana, Loki, Mimir and Tempo including the ability use LogQL and PromQL to query logs and metrics

  • Experience with networking protocols and TCP/IP including troubleshooting modems / configurations

  • Experience with Single Board Computers (eg Raspbrery Pi, Arduino, etc)

  • Strong analytical and problem-solving skills, with the ability to quickly understand and troubleshoot complex systems and applications.

  • Excellent communication and collaboration skills, with the ability to work effectively in a cross-functional team environment and communicate technical concepts clearly to both technical and non-technical stakeholders.

  • Familiarity with containerization technologies such as Docker and orchestration tools like Kubernetes

  • Experience with cloud infrastructure providers such as AWS, Azure, or GCP is a plus.

  • Experience with energy management systems or the renewable energy sector is a plus.

  • Experience with connectivity, telematics, and other communication systems is a plus.

  • Coding experience with languages such as Python, Java, and Scala is a plus

Fermata Energy is an Equal Opportunity Employer and complies with all applicable federal, state, and local fair employment practices laws. Fermata Energy strictly prohibits and does not tolerate discrimination against employees, applicants, or any other covered persons because of race, color, religion, creed, national origin or ancestry, ethnicity, sex (including pregnancy), gender (including gender nonconformity and status as a transgender individual), age, physical or mental disability, citizenship, past, current, or prospective service in the uniformed services, genetic information, or any other characteristic protected under applicable federal, state, or local law.

** Although all of our opportunities are remote, candidates must be based in the U.S. and have the legal right to work in the U.S. in order to be considered.