View all jobs

Senior or Lead Site Reliability Engineer

Atrilogy Solutions Group’s direct client is searching for a Site Reliability Engineer (100% Remote) to join their team for a permanent position.

Position: Site Reliability Engineer
Location: Remote, Remote, Remote
Duration: Full Time Role

Perform software development, refine requirements, write, test, debug, maintain, and release test software that instructs a computer to accomplish certain tasks, such as saving information, performing calculations. The Software Engineer may also be responsible for managing software development infrastructure and providing the appropriate level of functional and non-functional documentation to meeting the product and engineering requirements. As necessary, this position may be called upon to assist in performing on-site client work related to consulting projects or provide technical support as required.

WHAT WILL YOU DO?  Improve service reliability through root cause analysis, blameless postmortems, and using code to prevent or respond to problem recurrence. Be the central point of contact on the health of systems to increase product reliability and organizational efficiency. Be a part of an on-call rotation for responding to customer-facing emergencies. Continuous improve production incident response and triage practices Document the details of incident, root cause, resolution and solution Participate in planning, standup, and retrospective meetings. Timely respond to incidents and to customer inquiries Effective collaboration with internal and external customers Participate in reviews and operational readiness for services and infrastructures Maintain deep technical and business knowledge of system architectures, ensuring continuous upgrade and integration of new capabilities. Perform deep dives into both systemic and latent reliability issues Perform service failure analysis across the entire application stack: front-end, back-end, cloud services, databases. Promote modern and standard methodology for everything from monitoring to troubleshooting complex code issues. Support releases activities and production environment testing Design mechanisms for alerts and responses to identify and address reliability risks. Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, planning, and reviews Maintain services once they are live by measuring and monitoring availability, latency and overall system health. Design and run performance, capacity, and monitoring tests. Create educational material such as cloud native sample apps and starter code, as well as contribute to holding cloud native educational events like hackathons and live coding sessions. Create educational documentation on how-to's and best practices, and blog about use-cases and architectures that relate to cloud platforms Liaise with the team managing our public cloud environments, including setup, management, and troubleshooting



Bachelor's Degree computer science
Minimum 7 years’ experience working within Software Product Development. Minimum 2 year experience working in a software design or architect capacity.
Minimum 3 year working with cloud software deployment.
Minimum 3 year experience working in DevOps environments.

Skills and Knowledge

Strong communication skills, ownership, and drive. 
Strong passion in diagnosis software and systems in a distributed, internet-scale Linux environment. 
Significant expertise in enterprise Java platform and modern technologies like Spring boot, Node.js, and open source technologies 
Ability to deep dive in existing infrastructure and software to assist in solving problems Solid understanding of systems and application design, including the operational trade-offs of various designs. 
Development or support experiences in these areas: application and data security, batch processing, finance platforms Ability to deep dive in existing infrastructure and software Strong cloud experience. 
Practical knowledge of various aspects of service design like messaging protocols & behavior, caching strategies and software design practices. 
Demonstrable knowledge of TCP/IP, HTTP, web application security, and experience supporting multi-tier web application architectures. 
Ability to prioritize tasks and work independently. Be adaptable and able to focus on the simplest, most efficient & reliable solutions. 
BS/MS degree in Computer Science, Engineering, or equivalent experience Java 8+, JavaScript, Linux proficiency Spring boot, NodeJS, React or Angular MySQL, Postgres Dynatrace, graylog or equivalent Data Processing and ETL experience 
Full SDLC experience in enterprise development environment 
Strong troubleshooting capabilities Test automation and continuous integration AWS technology experience is a must

Please let me know if you have any questions.
For immediate consideration please submit your resume in Word format, along with daytime contact information.  LOCAL CANDIDATES ONLY PLEASE unless you are willing to relocate yourself at your own expense.  Client is unable to provide H-1B Visa sponsorship at this time. All submittals will be treated confidentially.  Selected candidate may be asked to complete a comprehensive background, credit and/or drug screening.  Principals only, no third parties please.

Atrilogy Solutions Group, Inc. (est. 2000), in partnership with Peak17 Consulting (est. 2008), provides organizations of all sizes with high-quality, cost effective information technology (IT) staffing services. 
Atrilogy has been recognized by Inc. magazine as one of the nation’s fastest-growing, privately held companies. Headquartered in Irvine, California, Atrilogy also has offices in Denver, Phoenix, & Atlanta with satellite offices in Boston, Jersey City, Las Vegas, and Delhi, India.
Clients turn to Atrilogy for expertise in:
  • IT staffing and placement such as Project Managers, Agile/Scrum Masters, Business Analysts, DBAs, Software Engineers, Mobile Developers (iOS, Android), DevOps, Automation, QA, Systems & Network Engineers, Cyber Security / Information Security Specialists, ERP, CRM, Business Intelligence, Data Warehousing, Big Data and Creative (UI/UX, Web Design)
 Clients turn to Peak17 for expertise in:
  • Operational staffing and placement of Accounting/Finance, Human Resources, and Marketing professionals, as well as Information Technology resources.

Atrilogy Solutions Group and Peak17 Consulting are Equal Opportunity Employers. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, gender expression, national origin, protected veteran status, or any other basis protected by applicable law, and will not be discriminated against on the basis of disability.
In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.

Powered by