2,315 Safeguards jobs in the United States
Software Engineer, Safeguards
Posted 4 days ago
Job Viewed
Job Description
Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the role:
We are looking for software engineers to help build safety and oversight mechanisms for our AI systems. As a trust and safety software engineer, you will work to monitor models, prevent misuse, and ensure user well-being. This role will focus on building systems to detect unwanted model behaviors and prevent disallowed use of models. You will apply your technical skills to uphold our principles of safety, transparency, and oversight while enforcing our terms of service and acceptable use policies.
Responsibilities:
- Develop monitoring systems to detect unwanted behaviors from our API partners and potentially take automated enforcement actions; surface these in internal dashboards to analysts for manual review
- Build abuse detection mechanisms and infrastructure
- Surface abuse patterns to our research teams to harden models at the training stage
- Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale
- Analyze user reports of inappropriate content or accounts
- Bachelor's degree in Computer Science, Software Engineering or comparable experience
- 3-10+ years of experience in a software engineering position, preferably with a focus on integrity, spam, fraud, or abuse detection.
- Proficiency in SQL, Python, and data analysis tools.
- Strong communication skills and ability to explain complex technical concepts to non-technical stakeholders
- Have experience building trust and safety mechanisms for AI/ML systems, such as fraud detection models or security monitoring tools or the infrastructure to support these systems at scale
- Have experience with machine learning frameworks like Scikit-Learn, Tensorflow, or Pytorch, and experience building machine learning models
- Have experience with prompt engineering, jailbreak attacks, and other adversarial inputs
- Have worked closely with operational teams to build custom internal tooling
Deadline to apply: None. Applications will be reviewed on a rolling basis.
The expected salary range for this position is:
Annual Salary:
$300,000-$405,000 USD
Logistics
Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience.Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.
Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.
We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.
How we're different
We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.
The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us!
Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.
Data Scientist, Safeguards
Posted 4 days ago
Job Viewed
Job Description
Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the role:
As an early member of our T&S Data Science and Analytics team, you will play an instrumental role in our company's mission of building safe and beneficial artificial intelligence by building and scaling a data driven culture from the ground up. In this unique company, technology, and moment in history, your work will be critical to informing our product and commercial strategy as we deploy safe, frontier AI at scale to the world.
You will work closely with product, engineering, policy & enforcement to define and measure key company success metrics, analyze user behavior to identify new enforcement opportunities and build a culture of developing and testing hypotheses through experimentation. You've worked in cultures of excellence in the past, and are eager to apply that experience to building robust and scalable systems and processes as our company goes through a phase of rapid growth.
Responsibilities:
- Deep dive into user behavior data to provide insights on safety concerns
- Define core metrics that measure the team's success. Set goals, build forecasts, monitor performance, and develop actionable reporting
- Identify and size opportunities to improve the product, influencing product roadmap through your insights and recommendations
- Develop hypotheses on product changes, design controlled experiments, analyze the results, and make recommendations based on impact to key metrics
- Build a data driven culture from the ground up by establishing foundational data best practices and making data more accessible across the company
- 8+ years of experience in data science or analytics roles, preferably in an infrastructure or operations context.
- 5+ years of experience deeply embedding in Product teams
- A passion for the company's mission of building helpful, honest, and harmless AI.
- Expertise in Python, SQL, and data visualization tools.
- A bias for action and urgency, not letting perfect be the enemy of the effective.
- A strong disposition to thrive in ambiguity, taking initiative to create clarity and forward progress.
- A deep curiosity and energy for pulling the thread on hard questions.
- Experience in turning open questions and data into concise and insightful analysis.
- Highly effective written communication and presentation skills.
Deadline to apply: None. Applications will be reviewed on a rolling basis.
The expected salary range for this position is:
Annual Salary:
$230,000-$275,000 USD
Logistics
Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience.Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.
Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.
We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.
How we're different
We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.
The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us!
Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.
Software Engineer, Safeguards Research
Posted 8 days ago
Job Viewed
Job Description
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role The Safeguards Research Team conducts critical safety research and engineering to ensure AI systems can be deployed safely. As a Software Engineer on our team, you'll develop robust end-to-end pipelines and tooling that directly supports our safety research initiatives. You'll work on building scalable infrastructure for evaluating model behaviors, automating safety assessments, and implementing efficient data processing systems to help us understand and mitigate risks in advanced AI systems. You take a pragmatic approach to software development, preferring simple, effective solutions over complex ones. You're adept at working with language models through effective prompting techniques and can translate research concepts into production-quality code. You'll collaborate closely with researchers to build tools that address both immediate safety challenges and support longer-term research initiatives. Deep machine learning knowledge is not required for this role. The main requirement is a desire to understand researcher workflows and improve research productivity. Representative projects: Design and implement end-to-end pipelines for efficient evaluation of model safety features and vulnerabilities Develop tooling to automate the generation and analysis of jailbreak attempts Build data processing systems that can handle large-scale model outputs Optimize Python code for memory efficiency and parallelization to improve research workflow performance Create flexible interfaces for researchers to interact with models and experimental setups Implement monitoring systems to track model behavior across different safety dimensions You may be a good fit if you: Have strong software engineering experience, particularly with Python Are excited about learning foundational ML knowledge to make researchers more effective (deep ML knowledge is not required) Are experienced with prompting and working with language models Excel at building practical, scalable data pipelines and tooling Prefer implementing simple solutions that work reliably over complex ones Have experience optimizing code for performance and resource efficiency Are comfortable working in a fast-paced, collaborative research environment Care deeply about the impacts of AI Strong candidates may also: Have experience building systems that integrate with large language models Have worked on distributed computing systems or parallel processing Have implemented data processing pipelines at scale Have contributed to open-source machine learning or AI safety tools Have experience with cloud infrastructure and containerization The expected salary range for this position is: Logistics Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience. Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Note: Currently, the team has a preference for candidates who are able to be based in the Bay Area. However, we remain open to any candidate who can travel 25% to the Bay Area. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. The expected salary range for this position is: $300,000 - $405,000 USD Logistics Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience. Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Apply for this job * indicates a required field First Name * Last Name * Preferred First Name Email * Phone Resume/CV Enter manually Accepted file types: pdf, doc, docx, txt, rtf Enter manually Accepted file types: pdf, doc, docx, txt, rtf LinkedIn Profile Website (Optional) Personal Preferences * How do you pronounce your name? Website Publications (e.g. Google Scholar) URL Are you open to working in-person in one of our offices 25% of the time? * Select. When is the earliest you would want to start working with us? Do you have any deadlines or timeline considerations we should be aware of? AI Policy for Application * Select. While we encourage people to use AI systems during their role to help them work faster and more effectively, please do not use AI assistants during the application process. We want to understand your personal interest in Anthropic without mediation through an AI system, and we also want to evaluate your non-AI-assisted communication skills. Please indicate 'Yes' if you have read and agree. Why Anthropic? * Why do you want to work at Anthropic? (We value this response highly - great answers are often 200-400 words.) Will you now or will you in the future require employment visa sponsorship to work in the country in which the job you're applying for is located? * Select. Do you require visa sponsorship? * Select. What is your preferred programming language for interviews? * Select. LinkedIn Profile Please ensure to provide either your LinkedIn profile or Resume, we require at least one of the two. Are you open to relocation for this role? * Select. What is the address from which you plan on working? If you would need to relocate, please type "relocating". Have you ever interviewed at Anthropic before? * Select. Additional Information Add a cover letter or anything else you want to share. Voluntary Self-Identification For government reporting purposes, we ask candidates to respond to the below self-identification survey.Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiringprocess or thereafter. Any information that you do provide will be recorded and maintained in aconfidential file. As set forth in Anthropic’s Equal Employment Opportunity policy,we do not discriminate on the basis of any protected group status under any applicable law. If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection.As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measurethe effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categoriesis as follows: A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability. A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service. An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense. An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985. Select. Voluntary Self-Identification of Disability Form CC-305 Page 1 of 1 OMB Control Number 1250-0005 Expires 04/30/2026 Voluntary Self-Identification of Disability Form CC-305 Page 1 of 1 OMB Control Number 1250-0005 Expires 04/30/2026 Why are you being asked to complete this form? We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years. Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at . How do you know if you have a disability? A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to: Alcohol or other substance use disorder (not currently using drugs illegally) Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS Blind or low vision Cancer (past or present) Cardiovascular or heart disease Celiac disease Cerebral palsy Deaf or serious difficulty hearing Diabetes Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders Epilepsy or other seizure disorder Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome Intellectual or developmental disability Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD Missing limbs or partially missing limbs Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS) Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities Partial or complete paralysis (any cause) Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema Short stature (dwarfism) Traumatic brain injury Disability Status Select. PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete. #J-18808-Ljbffr
Analyst, Safeguards (Cyber Harms)
Posted today
Job Viewed
Job Description
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As a Safeguards Analyst focusing on Cyber Harms, you will play a critical role in protecting our platform and users from cyber security risks through consistent policy enforcement and trend analysis. Important Context: In this position, you may be exposed to and engage with explicit content spanning a range of topics, including those of a sexual, violent, or psychologically disturbing nature. There is also an on-call responsibility across the Policy and Enforcement teams. Responsibilities: Enforce trust and safety policies with a specific focus on detecting and mitigating potential cyber security risks and harmful use of AI systems Monitor and analyze platform activity to identify emerging cyber threat patterns and trends that may require policy updates or enforcement actions Work with engineers to develop and iterate on safety systems that govern responsible use of our models for emerging capabilities and use cases related to cyber threats Conduct thorough investigations of potential policy violations related to cyber harms, gathering and documenting evidence to support enforcement decisions, and working to escalate cases with investigations and/or Security to identify coordinated activity Collaborate with the Policy team to provide feedback on policy gaps and ambiguities based on real enforcement scenarios involving cyber threats Support the development and refinement of detection methods for cyber-related abuse through data analysis and pattern recognition Work closely with cross-functional teams to ensure consistent application of policies across different use cases and scenarios Maintain detailed documentation of investigation findings and enforcement actions Participate in regular policy reviews and provide insights from an enforcement perspective Operationalize review workflows and determine prioritization of reviews Handle user appeals and communications related to enforcement actions with professionalism and clarity You may be a good fit if you have: 2+ years of experience in cybersecurity, or related field Strong understanding of cybersecurity concepts, web security, and common attack patterns Experience in offensive cybersecurity, CTFs, or penetration testing (OSCP Certification is not required, but valued) Ability to utilize Python and/or other data analysis tools and interact with large databases Demonstrated ability to analyze complex situations and make well-reasoned decisions under pressure Strong attention to detail and ability to maintain accurate documentation Excellent written and verbal communication skills Ability to work independently while maintaining strong collaboration with team members Bachelor's degree in Computer Science, Information Security, or related field (or equivalent practical experience) Strong candidates may: Have a deep interest in AI safety and responsible technology development Have a background in ethical hacking/pen-testing/malware analysis Can balance competing priorities and handle time-sensitive issues effectively Are comfortable working in ambiguous situations and can make sound judgments based on available information Demonstrate strong analytical thinking and problem-solving skills Are proactive in identifying emerging threats and suggesting improvements to existing processes Have experience with or interest in content moderation and policy enforcement at scale Can effectively communicate technical concepts to both technical and non-technical stakeholders The expected salary range for this position is: Annual Salary: $170,000 — $325,000 USD Logistics Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience. Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. #J-18808-Ljbffr
Analyst, Safeguards (Cyber Harms)
Posted today
Job Viewed
Job Description
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As a Safeguards Analyst focusing on Cyber Harms, you will play a critical role in protecting our platform and users from cyber security risks through consistent policy enforcement and trend analysis. Important Context: In this position, you may be exposed to and engage with explicit content spanning a range of topics, including those of a sexual, violent, or psychologically disturbing nature. There is also an on-call responsibility across the Policy and Enforcement teams. Responsibilities: Enforce trust and safety policies with a specific focus on detecting and mitigating potential cyber security risks and harmful use of AI systems Monitor and analyze platform activity to identify emerging cyber threat patterns and trends that may require policy updates or enforcement actions Work with engineers to develop and iterate on safety systems that govern responsible use of our models for emerging capabilities and use cases related to cyber threats Conduct thorough investigations of potential policy violations related to cyber harms, gathering and documenting evidence to support enforcement decisions, and working to escalate cases with investigations and/or Security to identify coordinated activity Collaborate with the Policy team to provide feedback on policy gaps and ambiguities based on real enforcement scenarios involving cyber threats Support the development and refinement of detection methods for cyber-related abuse through data analysis and pattern recognition Work closely with cross-functional teams to ensure consistent application of policies across different use cases and scenarios Maintain detailed documentation of investigation findings and enforcement actions Participate in regular policy reviews and provide insights from an enforcement perspective Operationalize review workflows and determine prioritization of reviews Handle user appeals and communications related to enforcement actions with professionalism and clarity You may be a good fit if you have: 2+ years of experience in cybersecurity, or related field Strong understanding of cybersecurity concepts, web security, and common attack patterns Experience in offensive cybersecurity, CTFs, or penetration testing (OSCP Certification is not required, but valued) Ability to utilize Python and/or other data analysis tools and interact with large databases Demonstrated ability to analyze complex situations and make well-reasoned decisions under pressure Strong attention to detail and ability to maintain accurate documentation Excellent written and verbal communication skills Ability to work independently while maintaining strong collaboration with team members Bachelor's degree in Computer Science, Information Security, or related field (or equivalent practical experience) Strong candidates may: Have a deep interest in AI safety and responsible technology development Have a background in ethical hacking/pen-testing/malware analysis Can balance competing priorities and handle time-sensitive issues effectively Are comfortable working in ambiguous situations and can make sound judgments based on available information Demonstrate strong analytical thinking and problem-solving skills Are proactive in identifying emerging threats and suggesting improvements to existing processes Have experience with or interest in content moderation and policy enforcement at scale Can effectively communicate technical concepts to both technical and non-technical stakeholders The expected salary range for this position is: Annual Salary: $170,000 — $325,000 USD Logistics Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience. Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. #J-18808-Ljbffr
Sr Safeguards & Security Spec

Posted today
Job Viewed
Job Description
Mission Support and Test Services, LLC (MSTS) manages and operates the Nevada National Security Site (NNSS) for the U.S. National Nuclear Security Administration (NNSA). Our MISSION is to help ensure the security of the United States and its allies by providing high-hazard experimentation and incident response capabilities through operations, engineering, education, field, and integration services and by acting as environmental stewards to the Site's Cold War legacy. Our VISION is to be the user site of choice for large-scale, high-hazard, national security experimentation, with premier facilities and capabilities below ground, on the ground, and in the air. (See NNSS.gov for our unique capabilities.) Our 2,750+ professional, craft, and support employees are called upon to innovate, collaborate, and deliver on some of the more difficult nuclear security challenges facing the world today.
+ MSTS offers our full-time employees highly competitive salaries and benefits packages including medical, dental, and vision; both a pension and a 401k; paid time off and 96 hours of paid holidays; relocation (if located more than 75 miles from work location); tuition assistance and reimbursement; and more.
+ MSTS is a limited liability company consisting of Honeywell International Inc. (Honeywell), Jacobs Engineering Group Inc. (Jacobs), and HII Nuclear Inc.
**Responsiblities**
The qualified candidate will report to the Facility Manager and will perform the duties of the Facility Security Officer (FSO) for the Livermore Operations (LO) Facility located in Livermore, CA. Security areas of responsibility include (but are not limited to): physical security of facilities, protection of government property, lock and key control, controlled and prohibited articles, defining and maintaining facility security areas, access authorizations and security clearances, incoming and outgoing visitor control, security badges, creating, handling, storing, processing, and transporting classified matter, secure communications, security training, awareness and employee briefings, OPSEC, communications security (COMSEC), incidents of security concern, and assisting with the local implementation of the Insider Threat and Counterintelligence (CI) programs.
**Key Responsibilities**
+ Ensure that the security program at LO conforms and complies with the requirements and regulations in the applicable Department of Energy (DOE) Orders, Mission Support & Test Services (MSTS) Company Directives, and other relevant Nevada National Security Sites (NNSS) procedures.
+ Security areas of responsibility include but also are not limited to physical security of facilities, protection of government property, lock and key control, controlled and prohibited articles, defining and maintaining facility security areas, access authorizations and security clearances, incoming and outgoing visitor control, security badges, creating, handling, storing, processing and transporting classified matter, secure communications, security training, awareness and employee briefings, OPSEC, investigating incidents of security concern and assisting with the local implementation of the Insider Threat and Counterintelligence (CI) programs.
+ Setting up subcontracts and providing oversight of subcontractors providing security services in the areas of alarms, cameras, card readers, locks, and other needed security services.
+ Implements and follows company policies, procedures and directives.
+ Create an environment where employees feel safe to raise issues, empowered to address issues, and supported to resolve issues.
+ Demonstrate environment, safety, health, and quality leadership and consistently enforce environment, safety, health, and quality policies and procedures. Implement applicable environment, safety, health, and quality requirements; emphasize the safety of each employee, and the protection of equipment and property in area of responsibility.
+ Take immediate action to correct reported or observed unacceptable environment, safety, health and quality conditions and/or behaviors.
+ Assure that appropriate procedures, training, equipment, warnings, and tools are provided to employees to permit work to be performed safely.
+ Promote and actively participate in the MSTS safety concept. Support and encourage employee participation in MSTS environment, safety, health, and quality initiatives.
**Qualifications**
**Due to the nature of our work, US Citizenship is required for all positions.**
+ Bachelor's degree in field related to the position or equivalent training and experience plus a minimum of 5 years of progressive related experience.
+ Bachelor's degree in security, business management or field related to the position is preferred.
+ Knowledge of security policies, procedures, and technical terminology associated with security functions.
+ Demonstrates leadership qualities with emphasis on continuous improvement and team building.
+ Skill to develop and analyze information for studies and reports.
+ Able to work independently on safeguards and security program objectives and strategies and formulate strategies for improving operations/processes.
+ Must possess interpersonal communication skills of an influencing and motivating nature to interface effectively with all levels of management, DOE security personnel, and outside agencies.
+ Able to develop and maintain relationships with all levels of employees throughout the company, customers, outside agencies, and various levels of personnel within parent organizations, DOE/HQ,DOE/NNSA, and other contractors including LANL, LLNL, and Sandia as needed to facilitate meeting safeguards and security program objectives while screening and maintaining confidentiality.
+ Able to prioritize and schedule multiple activities in the most efficient manner and meet required deadlines.
+ Able to use software applications needed in the position, including word processing software, spreadsheet software, presentation software, and database software.
+ Working knowledge of LENEL and Milestone applications preferred (not required).
+ Attention to detail and accuracy are required to ensure that policy decisions, procedures, and operations are compliant with MSTS and DOE regulations, procedures, and federal and state laws.
+ Must possess planning/organizing skills and initiative; employ independent judgment; and apply knowledge and experience to ensure that requirements are completed efficiently and on time.
+ Current Q or TS clearance is preferred.
+ The primary work location will be at the Livermore Operations Facility located in Livermore, CA.
+ Flexible work schedule can be negotiated with the manager; employees can work 5/8, 9/80 or 4/10 workweeks.
+ Pre-placement physical examination, which includes a drug screen, is required. MSTS maintains a substance abuse policy that includes random drug testing.
+ Must possess a valid driver's license.
MSTS is required by DOE directive to conduct a pre-employment drug test and background review that includes checks of personal references, credit, law enforcement records, and employment/education verifications. Applicants offered employment with MSTS are also subject to a federal background investigation to meet the requirements for access to classified information or matter if the duties of the position require a DOE security clearance. Substance abuse or illegal drug use, falsification of information, criminal activity, serious misconduct or other indicators of untrustworthiness can cause a clearance to be denied or terminated by DOE, resulting in the inability to perform the duties assigned and subsequent termination of employment. In addition, Applicants for employment must be able to obtain and maintain a DOE Q-level security clearance, which requires U.S. citizenship, at least 18 years of age. Reference DOE Order 472.2 ( , "Personnel Security". If you hold more than one citizenship (i.e., of the U.S. and another country), your ability to obtain a security clearance may be impacted.
**Department of Energy Q Clearance** (position will be cleared to this level). Reviews and tests for the absence of any illegal drug as defined in 10 CFR Part 707.4 ( , "Workplace Substance Abuse Programs at DOE Sites," will be conducted. Applicant selected will be subject to a Federal background investigation, required to participate in subsequent reinvestigations, and must meet the eligibility requirements for access to classified matter. Successful completion of a counterintelligence evaluation, which may include a counterintelligence-scope polygraph examination, may also be required. Reference 10 CFR Part 709 ( , "Counterintelligence Evaluation Program."
MSTS is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, national origin, disability, veteran status or other characteristics protected by law. MSTS is a background screening, drug-free workplace.
Annual salary range for this position is: **$78,832.00 - $118,248.00.**
Starting salary is determined based on the position market value, the individual candidate education and experience and internal equity.
Prin Safeguards & Sec Spec

Posted today
Job Viewed
Job Description
Mission Support and Test Services, LLC (MSTS) manages and operates the Nevada National Security Site (NNSS) for the U.S. National Nuclear Security Administration (NNSA). Our MISSION is to help ensure the security of the United States and its allies by providing high-hazard experimentation and incident response capabilities through operations, engineering, education, field, and integration services and by acting as environmental stewards to the Site's Cold War legacy. Our VISION is to be the user site of choice for large-scale, high-hazard, national security experimentation, with premier facilities and capabilities below ground, on the ground, and in the air. (See NNSS.gov for our unique capabilities.) Our 2,750+ professional, craft, and support employees are called upon to innovate, collaborate, and deliver on some of the more difficult nuclear security challenges facing the world today.
+ MSTS offers our full-time employees highly competitive salaries and benefits packages including medical, dental, and vision; both a pension and a 401k; paid time off and 96 hours of paid holidays; relocation (if located more than 75 miles from work location); tuition assistance and reimbursement; and more.
+ MSTS is a limited liability company consisting of Honeywell International Inc. (Honeywell), Jacobs Engineering Group Inc. (Jacobs), and HII Nuclear Inc.
**Responsiblities**
The qualified candidate will report to the Livermore Operations (LO) Facility Manager and will perform the duties of the Facility Security Officer (FSO) for the LO Facility located in Livermore, CA. Security areas of responsibility include (but are not limited to): physical security of facilities, protection of government property, lock and key control, controlled and prohibited articles, defining and maintaining facility security areas, access authorizations and security clearances, incoming and outgoing visitor control, security badges, creating, handling, storing, processing, and transporting classified matter, secure communications, security training, awareness and employee briefings, OPSEC, communications security (COMSEC), incidents of security concern, and assisting with the local implementation of the Insider Threat and Counterintelligence (CI) programs.
**Responsibilities**
+ Ensure that the security program at LO conforms and complies with the requirements and regulations in the applicable Department of Energy (DOE) Orders, Mission Support & Test Services (MSTS) Company Directives, and other relevant Nevada National Security Sites (NNSS) procedures.
+ Security areas of responsibility include but also are not limited to physical security of facilities, protection of government property, lock and key control, controlled and prohibited articles, defining and maintaining facility security areas, access authorizations and security clearances, incoming and outgoing visitor control, security badges, creating, handling, storing, processing and transporting classified matter, secure communications, security training, awareness and employee briefings, OPSEC, investigating incidents of security concern and assisting with the local implementation of the Insider Threat and Counterintelligence (CI) programs.
+ Setting up subcontracts and providing oversight of subcontractors providing security services in the areas of alarms, cameras, card readers, locks, and other needed security services.
+ Implements and follows company policies, procedures and directives.
+ Create an environment where employees feel safe to raise issues, empowered to address issues, and supported to resolve issues.
+ Demonstrate environment, safety, health, and quality leadership and consistently enforce environment, safety, health, and quality policies and procedures. Implement applicable environment, safety, health, and quality requirements; emphasize the safety of each employee, and the protection of equipment and property in area of responsibility.
+ Take immediate action to correct reported or observed unacceptable environment, safety, health and quality conditions and/or behaviors.
+ Assure that appropriate procedures, training, equipment, warnings, and tools are provided to employees to permit work to be performed safely.
+ Promote and actively participate in the MSTS safety concept. Support and encourage employee participation in MSTS environment, safety, health, and quality initiatives.
**Qualifications**
+ Bachelors' degree or equivalent training and experience, plus a minimum of 8 years of related and progressively responsible experience.
+ Bachelor's degree in security, business management or field related to the position is preferred.
+ Knowledge of security policies, procedures, and technical terminology associated with security functions.
+ Demonstrates leadership qualities with emphasis on continuous improvement and team building.
+ Skill to develop and analyze information for studies and reports.
+ Able to work independently on safeguards and security program objectives and strategies and formulate strategies for improving operations/processes.
+ Must possess interpersonal communication skills of an influencing and motivating nature to interface effectively with all levels of management, DOE security personnel, and outside agencies.
+ Able to develop and maintain relationships with all levels of employees throughout the company, customers, outside agencies, and various levels of personnel within parent organizations, DOE/HQ,DOE/NNSA, and other contractors including LANL, LLNL, and Sandia as needed to facilitate meeting safeguards and security program objectives while screening and maintaining confidentiality.
+ Able to prioritize and schedule multiple activities in the most efficient manner and meet required deadlines.
+ Able to use software applications needed in the position, including word processing software, spreadsheet software, presentation software, and database software.
+ Working knowledge of LENEL and Milestone applications preferred (not required).
+ Attention to detail and accuracy are required to ensure that policy decisions, procedures, and operations are compliant with MSTS and DOE regulations, procedures, and federal and state laws.
+ Must possess planning/organizing skills and initiative; employ independent judgment; and apply knowledge and experience to ensure that requirements are completed efficiently and on time.
+ Current Q or TS clearance is preferred.
+ The primary work location will be at the Livermore Operations Facility located in Livermore, CA.
+ Flexible work schedule can be negotiated with the manager; employees can work 5/8, 9/80 or 4/10 workweeks.
+ Pre-placement physical examination, which includes a drug screen, is required. MSTS maintains a substance abuse policy that includes random drug testing.
+ Must possess a valid driver's license.
MSTS is required by DOE directive to conduct a pre-employment drug test and background review that includes checks of personal references, credit, law enforcement records, and employment/education verifications. Applicants offered employment with MSTS are also subject to a federal background investigation to meet the requirements for access to classified information or matter if the duties of the position require a DOE security clearance. Substance abuse or illegal drug use, falsification of information, criminal activity, serious misconduct or other indicators of untrustworthiness can cause a clearance to be denied or terminated by DOE, resulting in the inability to perform the duties assigned and subsequent termination of employment. In addition, Applicants for employment must be able to obtain and maintain a DOE Q-level security clearance, which requires U.S. citizenship, at least 18 years of age. Reference DOE Order 472.2 ( , "Personnel Security". If you hold more than one citizenship (i.e., of the U.S. and another country), your ability to obtain a security clearance may be impacted.
**Department of Energy Q Clearance** (position will be cleared to this level). Reviews and tests for the absence of any illegal drug as defined in 10 CFR Part 707.4 ( , "Workplace Substance Abuse Programs at DOE Sites," will be conducted. Applicant selected will be subject to a Federal background investigation, required to participate in subsequent reinvestigations, and must meet the eligibility requirements for access to classified matter. Successful completion of a counterintelligence evaluation, which may include a counterintelligence-scope polygraph examination, may also be required. Reference 10 CFR Part 709 ( , "Counterintelligence Evaluation Program."
MSTS is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, national origin, disability, veteran status or other characteristics protected by law. MSTS is a background screening, drug-free workplace.
Annual salary range for this position is: **$99,790.57 - $152,180.63.**
Starting salary is determined based on the position market value, the individual candidate education and experience and internal equity.
Be The First To Know
About the latest Safeguards Jobs in United States !
Data Scientist, Safeguards (Seattle)
Posted today
Job Viewed
Job Description
About Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the role:As an early member of our T&S Data Science and Analytics team, you will play an instrumental role in our company’s mission of building safe and beneficial artificial intelligence by building and scaling a data driven culture from the ground up. In this unique company, technology, and moment in history, your work will be critical to informing our product and commercial strategy as we deploy safe, frontier AI at scale to the world.
You will work closely with product, engineering, policy & enforcement to define and measure key company success metrics, analyze user behavior to identify new enforcement opportunities and build a culture of developing and testing hypotheses through experimentation. You’ve worked in cultures of excellence in the past, and are eager to apply that experience to building robust and scalable systems and processes as our company goes through a phase of rapid growth.
Responsibilities:- Deep dive into user behavior data to provide insights on safety concerns
- Define core metrics that measure the team's success. Set goals, build forecasts, monitor performance, and develop actionable reporting
- Identify and size opportunities to improve the product, influencing product roadmap through your insights and recommendations
- Develop hypotheses on product changes, design controlled experiments, analyze the results, and make recommendations based on impact to key metrics
- Build a data driven culture from the ground up by establishing foundational data best practices and making data more accessible across the company
- 8+ years of experience in data science or analytics roles, preferably in an infrastructure or operations context.
- 5+ years of experience deeply embedding in Product teams
- A passion for the company's mission of building helpful, honest, and harmless AI.
- Expertise in Python, SQL, and data visualization tools.
- A bias for action and urgency, not letting perfect be the enemy of the effective.
- A strong disposition to thrive in ambiguity, taking initiative to create clarity and forward progress.
- A deep curiosity and energy for pulling the thread on hard questions.
- Experience in turning open questions and data into concise and insightful analysis.
- Highly effective written communication and presentation skills.
Deadline to apply: None. Applications will be reviewed on a rolling basis.
The expected salary range for this position is:
Annual Salary:
$220,000 — $315,000 USD
Logistics
Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience.
Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.
Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.
We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.
How we're differentWe believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.
The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us!Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.
#J-18808-LjbffrEPD Project Engineer - Safeguards & Security
Posted today
Job Viewed
Job Description
At Fluor, we are proud to design and build projects and careers. We are committed to fostering a welcoming and collaborative work environment that encourages big-picture thinking, brings out the best in our employees, and helps us develop innovative solutions that contribute to building a better world together. If this sounds like a culture you would like to work in, you’re invited to apply for this role.
Fluor is a leading government contractor with a proven track record of delivering high‑value technical solutions around the world to U.S. government agencies such as the DOE, NNSA, the Department of Defense and the Intelligence Community.
Job Description
The ideal candidate for this position will be responsible for planning and performing work requiring sound technical/business judgment in the evaluation, organization, and execution of project management assignments worldwide. This role has the overall objective of managing and/or coordinating project activities that are in compliance with the contract and ensure the safety, quality, value, timeliness, and Fluor profitability of the completed project. At this level, this position may assume Project Engineer, Area Project Manager or Engineering management job assignment responsibilities on a medium size project, multiple small size projects or complex segment of a larger project, in compliance with the project needs or per directions provided by the Project Director, Project Manager or Engineering Manager.
• Perform essential project engineering functions involving monitoring of progress, preparation of procedures, documentation of communications and meetings, and identification/evaluation of project issues and problems
• Coordinate and/or manage the efforts of technical disciplines, vendors and licensors to ensure integrated and completed designs that meet project requirements contractual obligations
• Support creation and coordination of overall project plans and schedules, and monitoring activities, progress, and milestones against the plans
• Support creation and coordination of project effort hours, and cost estimates and budgets, and monitoring progress and cost performance against these
• Coordinate the preparation, delivery and coordination of project deliverables, design documents, and bid packages
• Other duties as assigned
Basic Job Requirements
• Accredited four (4) year degree or global equivalent in engineering field of study and seven (7) years of work-related experience; a recognized professional certification or registration in the applicable field, if required; some locations may have additional or different qualifications in order to comply with local requirements
• Ability to communicate effectively with audiences that include but are not limited to management, coworkers, clients, vendors, contractors, and visitors
• Job related technical knowledge necessary to complete the job
• Ability to learn and apply knowledge of applicable local, state/province, and federal/national statutes and guidelines
• Ability to attend to detail and work in a time-conscious and time-effective manner
Other Job Requirements
• Proof of US Citizenship
• May support or participate in presentations to larger project audiences
• Participate in Fluor University courses for continued learning experiences
• Utilize knowledge management communities to capture, support and leverage relevant knowledge to enhance project execution
• Participate in vendor trade shows and become familiar with new technologies and industry business direction
Preferred Qualifications
• Must have Engineering/EPC design experience.
• Seven (7) years of experience in engineering, procurement, fabrication, and construction/construction management (EPFC/CM) industry including minimum one (1) successful construction and /or commissioning completed field assignment
• Experience executing and managing risk assessments initiatives
• Experience in international locations and diverse cultural environments is recommended
• Experience in the performance of functional tasks on projects with a well developed understanding of procedures and interfaces
• Detailed knowledge of Fluor’s software tools and databases preferred
• Proficient at initiating and growing solid relationships with the client, vendors and suppliers while meeting the company business needs and goals
• Adaptable and able to maintain effectiveness in changing circumstances
• Ability to set and maintain high standards of performance with responsibility and accountability for successfully completing assignments and tasks
• Proactive in taking prompt and appropriate action to ensure objectives are accomplished and apply necessary follow-up to monitor progress and results of project tasks and assignments
• Analytical approach to problem solving and identifying potential solutions
• Technical and business writing skills
• Basic computer and software skills to include the use of word processing, email, spreadsheets, electronic presentations, and project management tools
• Certification in project management suggested, for example Project Management Professional (PMP)
We are an equal opportunity employer. All qualified individuals will receive consideration for employment without regard to race, color, age, sex, sexual orientation, gender identity, religion, national origin, disability, veteran status, genetic information, or any other criteria protected by governing law.
Benefits Statement: Fluor is proud to offer a comprehensive benefits package designed to promote employee health, wellness, and financial security. Our offerings include medical, dental and vision plans, EAP, disability coverage, life insurance, AD&D, voluntary benefit plans, 401(k) with a company match, paid time off (personal, bereavement, sick, holidays) for salaried employees, paid sick leave per state requirement for craft employees, parental leave, and training and development courses.
Market Rate Statement: The market rate for the role is typically at the mid-point of the salary range; however, variations in final salary are determined by additional factors such as the candidate’s qualifications, relevant years of experience, geographic location, internal pay equity, and prevailing market conditions for the specific role.
Notice to Candidates: Background checks are carried out as part of any conditional offer made, including (but not limited to & role dependent) education, professional registration, employment, references, passport verifications and Global Watchlist screening.
To be Considered Candidates: Must be authorized to work in the country where the position is located.
Salary Range: $118,500.00 - $213,500.00
Machine Learning Engineer, Safeguards Research
Posted today
Job Viewed
Job Description
Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About Anthropic
Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the role
The Safeguards Research Team conducts critical safety research and engineering to ensure AI systems can be deployed safely. As a Machine Learning Engineer on our team, you'll bridge the gap between research and engineering, developing robust end-to-end pipelines and ML systems that directly support our safety research initiatives. You'll work on building scalable infrastructure for evaluating safety systems, implementing efficient training pipelines for safeguards, and creating automated systems to help us understand and mitigate risks in advanced AI systems.
You bring both ML fundamentals and strong engineering practices to the team. You're comfortable training and fine-tuning models, have intuitions about hyperparameter optimization, and can implement efficient data processing pipelines. You take a pragmatic approach to ML engineering, preferring simple, effective solutions over complex ones. You'll collaborate closely with researchers to translate experimental concepts into production-quality ML systems that address both immediate safety challenges and support longer-term research initiatives.
While deep theoretical ML knowledge is beneficial, we value practical ML experience and the ability to implement reliable systems that improve research productivity.
Representative projects:
- Design and implement ML pipelines for training and evaluating safety classifiers and detection models
- Develop systems to fine-tune language models for specific safety evaluation tasks
- Build infrastructure for hyperparameter optimization and model selection across safety experiments
- Create efficient data processing pipelines that can handle large-scale model outputs and training datasets
- Develop tooling to automate the generation, analysis, and classification of jailbreak attempts
- Build evaluation frameworks that can systematically test model behaviors across safety dimensions
- Create flexible interfaces for researchers to experiment with different model architectures and training configurations
- Have hands-on experience training and fine-tuning basic ML models
- Understand fundamental ML concepts like overfitting and regularization
- Have practical experience with improving and evaluating ML models
- Are proficient with ML frameworks (e.g., PyTorch, TensorFlow, JAX) and can implement custom training loops
- Have strong software engineering skills, particularly with Python
- Excel at building scalable data pipelines and ML infrastructure
- Are experienced with prompting and working with large language models
- Prefer implementing simple, reliable solutions over complex ones
- Are comfortable working in a fast-paced, collaborative research environment
- Care deeply about the impacts of AI
- Have implemented custom loss functions and evaluation metrics
- Have experience with experiment and evaluation tracking tools
- Have built systems that integrate training, evaluation, and deployment pipelines
- Have contributed to open-source machine learning or AI safety tools
Annual Salary: $320,000 - $30,000 USD
Logistics
Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience.
Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.
Note: Currently, the team has a strong preference for candidates who are able to be based in the Bay Area. However, we remain open to any candidate who can travel 25% to the Bay Area.
Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.
We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.
How we're different
We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.
Come work with us!
Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.
The expected salary range for this position is:
Annual Salary:
315,000- 340,000 USD
Logistics
Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience.Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.
Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.
We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.
How we're different
We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.
The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us!
Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process