Job Details


Reliability Engineer - $40 Billion Hedge Fund

Job Info:


Category: Development
Company Description: More than 1,000 people who believe the scientific method is the best way to approach investing. Ideas backed up with information. And improved by iteration.
Location: Houston, TX
Job Number: 9273

Job Description:


Our client operates at scale. Their data analysis, modeling and trading systems all operate in environments where growth is continuous and stable operation at scale is critical. Software engineers within our client's Reliability Engineering Technology team are charged with developing the tools which make scalable distributed systems possible. These tools include test frameworks and infrastructure; logging, monitoring, and metrics collection; dashboarding and alerting; and coordination and deployment for distributed systems. As a member of this group of versatile engineers, your remit will include:

  • Acting as the technology arm of Reliability Engineering – our client's core DevOps organization;
  • Developing or improving foundational technology used by our client’s engineering teams to build distributed services;
  • Improving all aspects of software reliability, including better monitoring, alerting and documentation;
  • Engaging with our software engineering teams on improvements to our tools, processes and software;
  • Support of some core services used by our client’s engineering teams;
Requirements include:
  • A bachelor’s degree in computer science or another highly technical, scientific discipline.
  • Ability to program (structured and OO) with one or more high level languages (such as Python, Java, C/C++).
  • Familiarity with open source tools used for deployment, logging and monitoring (e.g. Ansible, Elastic Search, InfluxDB, Prometheus)
  • Familiarity with resource management frameworks such as Mesos, Kubernetes and Yarn
  • A proven track record of automation and an algorithmic approach to solving problems.

Additional skills preferred:

  • In-depth knowledge and experience in at least one of: host based networking, linux/unix administration, systems programming, distributed systems, databases, cloud computing, and a desire to learn more.
  • A proactive approach to spotting problems, areas for improvement, performance bottlenecks, etc.
  • An understanding of the operational concerns in a demanding environment; ideally, but not necessarily, finance.
  • The ability to understand the inherent trade-offs between various software architectures as it relates to performance, resiliency/fault tolerance, load balancing, data consistency.



All qualified candidates are encouraged to apply by submitting their resume as an MS word document including a cover letter with a summary of relevant qualifications, highlighting clearly any special or relevant experience.


Please Note: All inquiries will be treated with the utmost confidentiality. Your resume will not be submitted to any client company without your prior knowledge and consent.


Contact Recruiter
maureen.lei@andiamogo.com
Senior Technical Recruiter
Andiamo Partners | 90 Broad Street, Suite 1501, New York, NY 10004


Share Share this Job