BroomfieldRecruiter Since 2001
the smart solution for Broomfield jobs

Senior Site Reliability Engineer (JoinOCI-SDE)

Company: Oracle
Location: Broomfield
Posted on: January 10, 2021

Job Description:

Oracle IT (OIT) group in Oracle Cloud Infrastructure (OCI) organization is seeking amotivated Senior Site Reliability Engineer that thrives in afast-pacedrapidly evolvingtechnology environment. This individual will be a member of the SRE Infrastructure services team and focused on driving for those quality standards across all of OIT.As part of the Operational Engagement programsyou will be instrumental in fostering a culture of SRE for horizontal activities and DevOps for products and tools across our global operations teams.The team you work in will have diverse expertise in systems, networking, and software development to provide the stability, performance and reliability our customers need. We work with multiple service development teams, identifying cross-team issues which create risk for operations across the organization and resolving those issues with a mixture of engineering, troubleshooting expertise, and general operational guidance. Your role also requires communication and organizational skills: you are an interface between Devops Tools, application teams that implement OCI services. You will deliver the solutions that directly contribute to our internal customer's success.Along side the software and tools development, you will be required to perform systems, networking automation running on virtualized and non-virtualized platforms in cloud through automation. Other duties include researching, proofing OCI cloud services, their features for improving operations and authoring technical documentation that are beneficial to the company and the team.What will you do* Articulate technical characteristics of services and technology areas and guide development teams to engineer and add capabilities to internal OIT tools.* Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs).* Utilize a deep understanding of service topology and the dependencies required to troubleshoot issues and define mitigations.* Understand and explain the effect of product architecture decisions on systems.* Serve as part of a 24x7 On Call rotation in support of the infrastructure life cycle* Professional curiosity and a desire to a develop deep understanding of services and technologies.* Use your experience and wisdom from building & running systems and infrastructures as a multiplier to drive operational improvements into OIT, its Service Teams and its Services.* Use your excellent written & oral communication skills to ask pertinent questions, and to assess/aggregate/report the responses.* Quickly grasp and analyze new or new-to-you systems that are complex and rapidly changing.* Educate yourself and others on anything that helps Service Teams more quickly and easily build, test, deploy & run their Services to be more reliable.* Identify problems and/or opportunities for improvements that are common across many teams/services.* Optimize application for maximum speed and scalability* Collaborate with other team members and stakeholdersQualificationsMandatory Qualifications* 10+ years experience in compute, network, storage, database troubleshooting for improving capacity, reliability, scalability, availability working as a site reliability engineer* Bachelor's or Master's degree in Computer Science or equivalent related field experience* Experience with Python including Object Oriented programming* Experience working with fault tolerant, highly available, high throughput, distributed, scalable systems* Aptitude to be a good team player and the desire to learn and implement new Cloud technologies as needed* Excellent organizational, verbal, and written communication skills* Good understanding of Agile software development principles including using common tools such as JIRAPreferred8+ years of experience in four or more of the following* Experience and knowledge of various database internals, benchmarks, testing in cloud including migrations expertise* Developing/operating large scale distributed services / applications* System Administration includingLinux internals, TCP/IP, DNS, Load balancing technologies, Windows internals* Cloud network experience* Container administration and development utilizing Kubernetes, Docker, Mesos, or similar* Infrastructure automation through Terraform, Chef, Ansible, Puppet, Packer or similar* Knowledge of cloud compute technologies, network monitoring* OS image build for Linux, Windows and patch automation using Python, PowerShell* Experience with Cloud Orchestration frameworks, development and SRE support of these systems* Experience with CI/CD pipelines including VCS (git, svn, etc), Gitlab Runners, Jenkins, Rundeck* Oracle Database expertise in ATP, ADW and programming in SQL, PL/SQL* Working with or supporting production, test, and development environments for medium to large user environments* Installing and configuring application servers and database servers* Experience in developing scripts to automate software deployments and installations* Experience in a 24--7 high-availability production environment* Ability to come with best solution by capturing big picture instead of focusing on minor details. Root cause analysisCertifications Preferred if any* Cloud Certifications - OCI Certified, AWS Certified, Kubernetes certified* Python Certifications* Network Certifications - CCNA* OS Certifications - OEL certified, RHCE certified* Security Certifications - Cloud security certsEducation (Preferred Degree)* M.S / B.S. in Computer Science, Computer Engineering, Software Engineering, or related areas is preferredAdditional Competencies1: Bias for Action Evaluates acts and communicates in SLA time. Is decisive. Makes timely, practical, effective decisions. Takes initiative without being asked. Plans efficiently while avoiding analysis paralysis. Knows how to take smart risks.Demonstrate strong follow-through and consistently keep commitments to customers and employees. Take ownership and responsibility for priority customer issues where and when required review urgent and critical incidents for quality.2: PrioritizationAbility to prioritize the assignments at hand even in loosely structured situations. Effectively handles multiple projects or tasks at the same time and complete them within a set time frame.3: Self development and teachingUnderstands personal strengths and development needs. Initiates self-development actions. Seeks and shares job-relevant learning, developmental experiences, and feedback to enhance performance. Encourages others to take personal responsibility for continual learning and skill growth. Shares knowledge with others.4: Dealing with ambiguityAble to function well in loosely structured situations. Works effectively in situations involving uncertainty or lack of information. Effectively handles multiple projects or tasks at the same time. Is open to and responds flexibly to change.5: Teamwork and willingness to roll up sleevesFosters cross-functional and cross business teamwork. Builds and promotes team morale. Works efficiently and effectively on teams to meet customers' needs. Contributes outside the scope of the job. Meets all team commitments. Consistent effort, intense commitment, and willingness to go above and beyond when needed. Willing to do low profile, non-challenging work to get the project done.Special Requirements:Successful candidates might be required to perform on-call duty on rotational bases.Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.A BS or MS in Computer Science, or equivalent. Identifies solutions to knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance. Experience running large scale customer facing web services. Identifies solutions to understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies. Work involves defining and documenting technical architecture of complex and highly scalable products. A minimum of 5+ years experience of running large scale customer facing web services.Oracle is an Affirmative Action-Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, protected veterans status, age, or any other characteristic protected by law.

Keywords: Oracle, Broomfield , Senior Site Reliability Engineer (JoinOCI-SDE), Engineering , Broomfield, Colorado

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category

Other Engineering Jobs

SCADA Engineer
Description: SCADA Engineer Denver, COSystem One is currently seeking a SCADA Engineer on a 12-month contract position located in Denver, CO.SCADA Engineer
Company: System One
Location: Denver
Posted on: 01/19/2021

Test and Evaluation Engineer
Description: Description For information about SAIC's benefits, please visit SAIC is seeking a Test and Evaluation T E Engineer to join a dynamic engaging team that brings (more...)
Company: SAIC Corporation
Location: Colorado Springs
Posted on: 01/19/2021

Senior Electrical Engineer
Description: In this role the ideal candidate will research, design, develop, test, certify, deploy and improve cutting edge products and services. These cover a very wide range from advanced mission payloads and (more...)
Company: Blackstone Talent Group
Location: Louisville
Posted on: 01/19/2021

Key Technician - Denver, CO
Description: Reporting to the Regional Manager. Responsible for soliciting business from retail automotive companies, companies with privately held fleets, financial institutions and other companies that are potential (more...)
Company: KAR Global
Location: Denver
Posted on: 01/19/2021

Auto Body Technician
Description: Job SummaryExperienced Auto Body Technician needed to repair vehicles thoroughly, safely, and profitably in a manner consistent with Caliber S.O.P., insurance partner and industry guidelines/standards.Auto (more...)
Company: Caliber Collision
Location: Henderson
Posted on: 01/19/2021

Civil Engineer
Description: Scope Assists Project Managers with detailed engineering analysis in the preparation of engineering reports, permit applications and construction plan preparation. They direct and train Civil Design Technicians (more...)
Company: IMEG Corp
Location: Denver
Posted on: 01/19/2021

Data Science Engineer
Description: Description: br BioIntelliSense is ushering in a new era of continuous health monitoring and clinical intelligence for Remote Patient Monitoring RPM and Screening COVID-19/Infection . Our medical-grade (more...)
Company: BioIntelliSense , Inc.
Location: Golden
Posted on: 01/19/2021

Sustainment Cyber Security Engineer
Description: Security Engineer must demonstrate technical knowledge of data systems and securityprocedures, as well as a familiarity with systems hardware and software. Theyrequire good communication skills and the (more...)
Company: Jacobs
Location: Colorado Springs
Posted on: 01/19/2021

Data Engineer
Description: Description Olive---s AI workforce is built to fix our broken healthcare system by addressing healthcare---s most burdensome issues delivering hospitals and health systems increased revenue, reduced costs, (more...)
Company: Olive
Location: Denver
Posted on: 01/19/2021

Full Stack Engineer
Description: Come join a Series A B2B marketplace startup that is 100 remote-friendly This Jobot Job is hosted by Brandon Bays Are you a fit Easy Apply now by clicking the Apply button and sending us your resume. (more...)
Company: Jobot
Location: Denver
Posted on: 01/19/2021

Log In or Create An Account

Get the latest Colorado jobs by following @recnetCO on Twitter!

Broomfield RSS job feeds