7,788 Devops Engineers jobs in the United States
Devops Engineers
Posted 10 days ago
Job Viewed
Job Description
Hi Professionals,
I am sending requirement, kindly get back to me if the job description suits you.
Job Title: Devops Engineers
Job Location: ustin, TX
Experience: 8-10 Years.
Contract Type: C2C
Mode of Interview: 1) Telephonic
2) Face-2-Face
Oscar| Infowaygroup.com | US IT Recruiter
Cell :
Info Way Solutions LLC | 46520 Fremont Blvd, Suite 614 | Fremont, CA - 94538
Lead Infrastructure DevOps Engineers - 24 months contract
Posted 10 days ago
Job Viewed
Job Description
Job DescriptionJob Description
This position with a healthcare client, our partner had a lot of success with in the past.
MUST be local to the Bay Area (commuting distance to Oakland) – they are working remote right now but these contracts are multiple years so they will only consider candidates that are local.
No coast-to-coast relocation candidates.
Prefer USC/GC
Sr. DevOps Lead Engineer - Oakland, CA - 24+ months
Our client has been engaged to find two Lead Infrastructure DevOps Engineers to join the Infrastructure Team of a dynamic Healthcare organization’s IT Group. The Lead Infrastructure DevOps Engineers’ primary function will be to advance the Infrastructure Team from a traditional infrastructure methodology to an Infrastructure as Code approach. You will be responsible for maintaining and expanding the orchestration platform for containers using Kubernetes, helping and supporting applications teams integrate CI/CD with Kubernetes, providing an automated build and configuration process for BareMetal that is hardware agnostic, and helping lead the infrastructure and applications teams through the DevOps journey. The role also requires on-call rotation when necessary.
Scope of Responsibilities:
- Build out additional automation which includes the Windows/Linux OS platforms, Storage platforms and DB platforms.
- Roadmap, architect, and collaborate with development teams to transition to an automated CI/CD pipeline.
- Assist with the design and building of reliable, fault tolerant private cloud infrastructure following industry best practices.
- Leverage cloud technologies and best practices on premise.
- Thought leader across multiple teams and technologies to drive change into teams to move towards and infrastructure as code approach.
- Using automation to reduce operational workload to support teams.
- Fully Automated Configuration Management using Ansible or similar tools.
- Incorporate security best practices within the CI/CD pipeline using process or tools.
- Operational support of core infrastructure services which includes the ability to transition to traditional infrastructure, server, storage, and the network.
Minimum Qualifications:
- Minimum three years of experience working in a lead role in a large, containerized environment using Kubernetes as the main platform for container orchestration.
- Minimum four years of experience working on Linux as a Systems Administrator, specifically Redhat.
- Bachelor's degree in Computer Science, Engineering, Social Science, Education, Business, Healthcare or related field and a minimum of three years working in IT or operations. Additional equivalent work experience may be substituted for the degree requirement.
Qualifications:
- Three years of experience writing documentation or standard operating procedures related to system administration.
- Two years of experience with open source technologies such as Grafana, Prometheus, Mongo, Postgres.
- Two years of experience with Ansible, automating infrastructure components; i.e., Servers, Virtual Machines and Networks.
- Two years of experience using NGINX and Calico.
- Two years of experience building and maintaining docker images.
- One year working with ServiceNow for managing incidents and service requests.
Cloud Site Reliability Engineer
Posted today
Job Viewed
Job Description
We are seeking a highly skilled Site Reliability Engineer with 3 years of experience to join our dynamic team. The ideal candidate will have a strong background in cloud technologies, with a focus on designing, implementing, and managing cloud-based solutions. As a Site Reliability Engineer, you will play a key role in ensuring the availability, performance, and security of our cloud infrastructure. In this role you will: * Lead the day-to-day technical operations, providing the highest levels of availability, reliability, and scalability of the services. * Implement best practices for cloud security, including identity and access management, encryption, and network security. * Provide technical expertise to handle customer escalations and ensure stability in customer environments. * Conduct performance analysis and lead monitoring initiatives on multiple hosted products/platforms. * Maintain operational run book procedures for all production systems and document the knowledge base. * Administer incident management activities (detection, recording, classification, and closure) and provide timely escalations and notifications as required by procedure. * Participate in on-call rotation to respond to cloud-related incidents and emergencies. * Troubleshoot and resolve complex technical issues in a timely manner. * Monitor and optimize cloud infrastructure for performance, cost, and security. * Collaborate with cross-functional teams to troubleshoot and resolve complex cloud-related issues. * Mentor junior team members and provide technical guidance and support. You've got what it takes if you have: * U.S. citizenship required * Minimum bachelor's degree in computer science, engineering, or a related field, or equivalent experience. * 3+ years of experience in cloud operations. * Comprehensive understanding of cloud computing principles and architectures. * Extensive experience in Linux/Unix environments. * Proficiency in containerization technologies like Docker and Kubernetes. * Strong scripting skills in Python or Bash. * Proficient in debugging and optimizing Java-based applications. * Hands-on experience in deploying, optimizing, and troubleshooting applications on Tomcat and JBoss application servers. * Hands-on experience in managing and optimizing Memcached, Nginx, ActiveMQ, Elasticsearch, and Redis applications. * Experience with monitoring and logging tools such as Newrelic and the ELK stack. * Sound knowledge of networking concepts, including TCP/IP, DNS, and VPN. * Proficiency in automation and configuration management tools like Ansible, Jenkins, and Bitbucket. * Thorough understanding of monitoring and alerting tools such as Nagios, New Relic, Grafana, and CheckMk. * Experience with distributed storage technologies such as NFS, Netapp, and Amazon S3, as well as dynamic resource management frameworks (e.g., Kubernetes). * Experience working in Datacenter and AWS cloud platforms. * Strong communication and collaboration skills. * Excellent troubleshooting and problem-solving skills.
Cloud Site Reliability Engineer
Posted 6 days ago
Job Viewed
Job Description
The Site Reliability Engineer designs, implements, and manages scalable, secure, and highly available cloud solutions. This role requires a strong background in cloud environments, DevOps practices, and a commitment to improving system reliability through automation and performance tuning. The ideal candidate will bring expertise in cloud platforms, mentoring skills, and a proven ability to support operations in fast-paced, innovative settings.
The site reliability engineer will help to implement and maintain the cloud environments, identify areas for improvement, bring a big picture vision, make recommendations and implement solutions. The ideal candidate for this role has a passion for automation, always keeps security in mind, and can implement best practices to further our mission. A background in computer science fundamentals allows the site reliability engineer to apply coding principles and best practices to Intermountain's systems, infrastructure and automation initiatives, embodying an 'infrastructure as code' approach. This role needs to be proficient in current testing methodologies and believe in an effective and efficient engineering approaches.
The Site Reliability Engineer is expected to collaborate with technical staff, management, and business operations staff throughout all phases of cloud deployment life cycle. In order to deliver durable solutions to complex businesses problems according to agreed upon timelines and budgets to support the mission, vision, and values of Intermountain Healthcare. The SRE role is a high-level contributor. Responsible for implementing and/or integrating new products, processes, methodologies, frameworks, and technologies in the cloud space. Works independently with minimal oversight and direction. Provides guidance, input, and instruction to lower-level technical professionals. Typically works on routine to moderately complex projects and may work under the supervision and mentorship of a higher-level engineer.
Focuses on implementing and maintaining cloud environments, identifying areas for improvement, and applying best practices to infrastructure and automation initiatives. Collaborates with technical staff, management, and business operations to deliver solutions within timelines and budgets.
Creates low-level solutions architecture diagrams, technical diagrams, and software development lifecycle diagrams that illustrate the CI/CD solutions. Communicates in an effective and professional way with customers both inside and outside of the company. Deploying new tools to support our systems and services in an automated fashion.
**Essential Functions**
+ Design, build, and maintain cloud-based solutions across multiple platforms such as Microsoft Azure, AWS, and Google Cloud Platform.
+ Develop and manage scalable, secure, and high-performing infrastructure to support diverse applications, including service-oriented architectures and client-server systems.
+ Implement and refine CI/CD pipelines using tools such as Bamboo, Chef, or Azure DevOps to enhance deployment speed and reliability.
+ Create robust monitoring, alerting, and automation solutions to improve system reliability and reduce manual intervention.
+ Collaborate with cross-functional teams to support and troubleshoot production systems, ensuring maximum uptime and performance.
+ Drive technical improvements and reduce technical debt by applying advanced knowledge of system performance, scalability, and architecture.
+ Support data privacy compliance by understanding and implementing relevant data privacy practices and laws.
+ Develop code automation solutions, with experience coding in at least one language such as Bash/Shell Scripting.
+ Work as a subject matter expert and demonstrate an understanding of cloud architecture, SaaS, and PaaS solutions.
+ Consult with business units and develop solutions for their cloud needs.
Develops code automation solutions (experience coding in at least one of the following languages):
+ Bash / Shell Scripting
Works as a subject matter expert and demonstrates an understanding of cloud architecture, SaaS and PaaS solutions (experience in at least one of the following):
+ Microsoft Azure
+ Amazon Web Services
+ Google Cloud Platform
**Skills / Qualifications**
+ Azure certifications are highly desirable.
+ Bachelor's degree in computer science, information systems, or technology related discipline.
+ Proven ability to prioritize tasks and excel under high-pressure conditions.
+ Advanced understanding of performance, scalability, and system architecture principles.
+ At least four years of experience in healthcare-related software development or system administration.
+ Experience with Docker, Kubernetes, or AKS.
+ Familiarity with CI/CD tools such as GitHub Bamboo, Chef, or Azure DevOps.
+ Strong analytical, problem-solving, and communication skills.
+ Self-motivated with keen attention to detail and customer-focused attitude.
+ Experience working in product-driven environments.
+ Ability to effectively prioritize and execute tasks in a high-pressure environment is crucial.
+ Advanced knowledge of performance, scalability and system architecture with an eye toward avoiding and reducing technical debt.
+ Proven analytical and problem-solving abilities.
+ Able to learn, understand, and apply new technologies.
+ Strong written and verbal communication skills.
+ Strong interpersonal and customer service skills.
+ Highly self-motivated and directed.
+ Keen attention to detail.
+ Proven experience working in product driven environment.
+ Knowledge of applicable data privacy practices and laws.
+ Ability to mentor other caregivers
+ Be a technical lead on projects and initiatives.
+ Understand broader cloud environments and the bigger strategic vision
+ Advanced troubleshooting and problem solving skills
Experience developing code in at least two of the following:
+ Bash / Shell Scripting
Experience in at least one of the following:
+ Microsoft Azure
+ Amazon Web Services
+ Google Cloud Platform
**Physical Requirements:**
**Physical Requirements**
+ Interact with others requiring the employee to communicate information.
+ Operate computers and other IT equipment requiring the ability to move fingers and hands.
+ See and read computer monitors and documents.
+ Remain sitting or standing for long periods of time to perform work on a computer, telephone, or other equipment.
**Location:**
Lake Park Building
**Work City:**
West Valley City
**Work State:**
Utah
**Scheduled Weekly Hours:**
40
The hourly range for this position is listed below. Actual hourly rate dependent upon experience.
$48.76 - $76.76
We care about your well-being - mind, body, and spirit - which is why we provide our caregivers a generous benefits package that covers a wide range of programs to foster a sustainable culture of wellness that encompasses living healthy, happy, secure, connected, and engaged.
Learn more about our comprehensive benefits package here ( .
Intermountain Health is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.
At Intermountain Health, we use the artificial intelligence ("AI") platform, HiredScore to improve your job application experience. HiredScore helps match your skills and experiences to the best jobs for you. While HiredScore assists in reviewing applications, all final decisions are made by Intermountain personnel to ensure fairness. We protect your privacy and follow strict data protection rules. Your information is safe and used only for recruitment. Thank you for considering a career with us and experiencing our AI-enhanced recruitment process.
All positions subject to close without notice.
Cloud Site Reliability Engineer

Posted 8 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer

Posted 8 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer

Posted 8 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Be The First To Know
About the latest Devops engineers Jobs in United States !
Cloud Site Reliability Engineer

Posted 8 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer

Posted 8 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer

Posted 8 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772