7,634 Devops Engineers jobs in the United States
Lead Infrastructure DevOps Engineers - 24 months contract
Posted today
Job Viewed
Job Description
Job Description
This position with a healthcare client, our partner had a lot of success with in the past.
MUST be local to the Bay Area (commuting distance to Oakland) – they are working remote right now but these contracts are multiple years so they will only consider candidates that are local.
No coast-to-coast relocation candidates.
Prefer USC/GC
Sr. DevOps Lead Engineer - Oakland, CA - 24+ months
Our client has been engaged to find two Lead Infrastructure DevOps Engineers to join the Infrastructure Team of a dynamic Healthcare organization’s IT Group. The Lead Infrastructure DevOps Engineers’ primary function will be to advance the Infrastructure Team from a traditional infrastructure methodology to an Infrastructure as Code approach. You will be responsible for maintaining and expanding the orchestration platform for containers using Kubernetes, helping and supporting applications teams integrate CI/CD with Kubernetes, providing an automated build and configuration process for BareMetal that is hardware agnostic, and helping lead the infrastructure and applications teams through the DevOps journey. The role also requires on-call rotation when necessary.
Scope of Responsibilities:
- Build out additional automation which includes the Windows/Linux OS platforms, Storage platforms and DB platforms.
- Roadmap, architect, and collaborate with development teams to transition to an automated CI/CD pipeline.
- Assist with the design and building of reliable, fault tolerant private cloud infrastructure following industry best practices.
- Leverage cloud technologies and best practices on premise.
- Thought leader across multiple teams and technologies to drive change into teams to move towards and infrastructure as code approach.
- Using automation to reduce operational workload to support teams.
- Fully Automated Configuration Management using Ansible or similar tools.
- Incorporate security best practices within the CI/CD pipeline using process or tools.
- Operational support of core infrastructure services which includes the ability to transition to traditional infrastructure, server, storage, and the network.
Minimum Qualifications:
- Minimum three years of experience working in a lead role in a large, containerized environment using Kubernetes as the main platform for container orchestration.
- Minimum four years of experience working on Linux as a Systems Administrator, specifically Redhat.
- Bachelor's degree in Computer Science, Engineering, Social Science, Education, Business, Healthcare or related field and a minimum of three years working in IT or operations. Additional equivalent work experience may be substituted for the degree requirement.
Preferred Qualifications:
- Three years of experience writing documentation or standard operating procedures related to system administration.
- Two years of experience with open source technologies such as Grafana, Prometheus, Mongo, Postgres.
- Two years of experience with Ansible, automating infrastructure components; i.e., Servers, Virtual Machines and Networks.
- Two years of experience using NGINX and Calico.
- Two years of experience building and maintaining docker images.
- One year working with ServiceNow for managing incidents and service requests.
Cloud Site Reliability Engineer
Posted 3 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer
Posted 5 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer
Posted 6 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer
Posted 5 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer
Posted 5 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer
Posted 5 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Be The First To Know
About the latest Devops engineers Jobs in United States !
Cloud Site Reliability Engineer
Posted 5 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer
Posted 6 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772
Cloud Site Reliability Engineer
Posted 5 days ago
Job Viewed
Job Description
Ford is seeking an experienced and passionate Site Reliability Engineer (SRE) to join our team in developing, enhancing, and expanding our global monitoring and observability platform. You'll blend software and systems engineering to ensure the uptime, scalability, and maintainability of our critical cloud services. You'll be at the intersection of SRE and Software Development, building and driving the adoption of our global monitoring capabilities.
If you're passionate about using your IT expertise and analytical skills to shape the future of transportation, this is your opportunity to make a real impact. Join us and be part of a team that's building the future of mobility!
+ Write, configure, and deploy code that improves service reliability for existing or new systems; set standard for others with respect to code quality.
+ Provide helpful and actionable feedback and review for code or production changes.
+ Drive repair/optimization of complex systems with consideration towards a wide range of contributing factors.
+ Lead debugging, troubleshooting, and analysis of service architecture and design.
+ Participate in on-call rotation.
+ Write documentation: design, system analysis, runbooks, playbooks. Provide design feedback and uplevel design skills of others.
+ Implement and manage SRE monitoring application backends using Golang, Postgres, and OpenTelemetry. Develop tooling using Terraform and other IaC tools to ensure visibility and proactive issue detection across our platforms.
+ Work within GCP infrastructure, optimizing performance and cost, and scaling resources to meet demand.
+ Collaborate with development teams to enhance system reliability and performance, applying a platform engineering mindset to system administration tasks.
+ Develop and maintain automated solutions for operational aspects such as on-call monitoring, performance tuning, and disaster recovery.
+ Troubleshoot and resolve issues in our dev, test, and production environments.
+ Participate in postmortem analysis and create preventative measures for future incidents.
+ Implement and maintain security best practices across our infrastructure, ensuring compliance with industry standards and internal policies. Participate in security audits and vulnerability assessments.
+ Participate in capacity planning and forecasting efforts to ensure our systems can handle future growth and demand. Analyze trends and make recommendations for resource allocation.
+ Identify and address performance bottlenecks through code profiling, system analysis, and configuration tuning. Implement and monitor performance metrics to proactively identify and resolve issues.
+ Develop, maintain, and test disaster recovery plans and procedures to ensure business continuity in the event of a major outage or disaster. Participate in regular disaster recovery exercises.
+ Contribute to internal knowledge bases and documentation.
+ Bachelor's degree in Computer Science, Engineering, Mathematics or equivalent experience.
+ 3+ years of experience as an SRE, Software Engineer, DevOps Engineer or similar role.
+ 2+ years experience in cloud native software application development
+ Solid programming skills in Golang and scripting languages, with a good understanding of software development best practices.
+ Proficient with IaC (Infrastructure as Code) like Terraform
+ Proficient with monitoring and observability tools, particularly OpenTelemetry, Dynatrace or other tools.
+ Proficient with cloud services, with a strong preference for Kubernetes and Google Cloud Platform (GCP) experience.
+ Experience with relational and document databases.
+ Ability to debug, optimize code, and automate routine tasks.
+ Strong problem-solving skills and the ability to work under pressure in a fast-paced environment.
+ Excellent verbal and written communication skills.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder.or all of the above? No matter what you choose, we offer a work life that works for you, including:
- Immediate medical, dental, and prescription drug coverage
- Flexible family care, parental leave, new parent ramp-up programs, subsidized back-up child care and more
- Vehicle discount program for employees and family members, and management leases
- Tuition assistance
- Established and active employee resource groups
- Paid time off for individual and team community service
- A generous schedule of paid holidays, including the week between Christmas and New Year's Day
- Paid time off and the option to purchase additional vacation time.
For a detailed look at our benefits, click here: Benefit Summary ( role is remote unless you are located within 50 mile radius of a Ford Hub, which you will be required to commute on site 4x a week_**
**_*Visa Sponsorship is NOT provided for this specific role_** ***
**_*Relocation assistance IS NOT provided for this specific role*_**
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer committed to a culturally diverse workforce. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, If you need a reasonable accommodation for the online application process due to a disability, please call .
#LI-Remote
#LI-DS2
**Requisition ID** : 48772