34,431 Distributed Systems jobs in the United States
Software Engineer, Distributed Systems
Posted 3 days ago
Job Viewed
Job Description
About the Team
The Compute Runtime team builds the low level framework components to power our ML training systems. We work on building robust, scalable, high performance components to support our distributed training workloads. Our priorities are to maximize the productivity of our researchers and our hardware, with the goal of accelerating progress towards AGI.
About the Role
As a Distributed Systems engineer, you will work to deliver powerful APIs orchestrating thousands of computers moving and persisting vast amounts of data. This requires both providing easy to use, introspectable systems that can promote a fast debugging and development cycle, while also enabling that experience to scale to our newest supercomputers maintaining stability and performance throughout.
We're looking for people who love optimizing an end to end system, understanding high performance I/O to maximize local performance and distributed across our supercomputers. We want someone excited by the rapid pace of responding to the dynamic and evolving needs of our training systems architectures.
This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.
In this role, you will:
Work across our Python and Rust stack
Profile and optimize and help design for scale our compute and data capabilities
Work on deploying our training framework to our latest supercomputers rapidly responding to the changing shapes and needs of the ML systems.
You might thrive in this role if you:
Have worked on large distributed systems
Love figuring out how systems work and continuously come up with ideas for how to make them faster while minimizing complexity and maintenance burden
Have strong software engineering skills and are proficient in Python and Rust or equivalent.
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement.
Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.
To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
Compensation Range: $250K - $460K
Software Engineer, Distributed Systems
Posted 4 days ago
Job Viewed
Job Description
Mixpanel is an event analytics platform for builders who need answers from their data at their fingertips-no SQL required. When everyone in the organization can see and learn from the impact of their work on product, marketing, and company revenue metrics, they are poised to make better decisions.
Over 9,000 paid customers, including companies like Netflix, Pinterest, Sweetgreen, and Samsara, use Mixpanel to understand their customers and measure progress. Our commitment is to provide the most comprehensive and reliable analytics platform accessible and trusted by all.
We are actively recruiting for multiple Software Engineers across different levels for our org!
About the Role
Mixpanel is powered by a custom distributed database. This system ingests more than 1 Trillion user-generated events every month while ensuring end-to-end latencies of under a minute and queries typically scan more than 1 Quadrillion events over the span of a month. Over the last year, our inbound traffic has doubled. As our existing customers grow in volume and we add new ones, we expect this growth in traffic to continue. The Distributed Systems engineering teams are responsible for adding new capabilities and ensuring the smooth operation of the underlying systems.
Responsibilities
Mixpanel's infrastructure runs on Google Cloud Platform. We rely on Kubernetes and Docker for orchestration and containerization of our services. We primarily use Golang for writing services and all internal communication happens via GRPC. We use a combination of C and C++ wherever Golang doesn't meet our performance goals.
As an engineer on the Distributed Systems teams, you'll be responsible for:
- Working with other engineers to build distributed systems that can handle data at scale
- Debugging production issues across multiple services and all levels of our infrastructure stack
- Ensuring reliability and uptime of the services you're responsible for
- Keeping an eye on how much your service costs every month and removing inefficiencies wherever possible
- Improving engineering standards and holding a high bar for code quality and simplicity
- Pushing the boundaries on how our customers analyze their product data
- Most of the systems in our stack provide at least once semantics. As a result, we risk duplicating events that flow through them. To overcome this limitation, we added support for event deduplication that can work at our scale. Typical approaches for deduplication don't perform well on large amounts of data, so we had to do something highly custom for our stack. We wrote about this on our engineering blog here.
- Back in 2019, we migrated our ingestion API service from Python to Golang for better performance and type safety. We had to do this while ensuring that both systems handle data the same way. Because we had to compare, both, HTTP responses and transformed payloads, nothing out of the box worked for us. This blog post talks about how we did the actual migration without any customer visible downtime.
- In 2021, as our traffic grew almost 100%, the cost of storing data became untenable. Our engineers worked on an incremental way to eventually realize almost $3000 in savings per month.
We're Looking For Someone Who Has
We have openings across multiple Distributed Systems teams. We're looking for engineers who have:
- A strong grasp of computer science fundamentals when it comes to dealing with distributed systems and networks. You'll routinely run into issues where "one in a million" chances actually happen in production
- A knack for problem-solving and thinking from first principles. You don't shy away from any problem, no matter the scale or impact
- A bias towards shipping early and iterating. We believe in making small incremental changes to existing systems instead of large multi-quarter undertakings
- Engineering Life Page
- Tracking events at millisecond granularity
- Ensuring Data Consistency Across Replicas
- Saving $000 a month by improving Garbage Collection
- Strategies For Effective Data Compaction
- Monitoring Apache Kafka with JMX Exporter and Kafka Exporter
- Resharding petabytes of data to improve performance for our largest customers
Compensation
The amount listed below is the total target cash compensation (TTCC) and includes base compensation and variable compensation in the form of either a company bonus or commissions. Variable compensation type is determined by your role and level. In addition to the cash compensation provided, this position is also eligible for equity consideration and other benefits including medical, vision, and dental insurance coverage. You can view our benefits offerings here.
Our salary ranges are determined by role and level and are benchmarked to the SF Bay Area Technology data cut released by Radford, a global compensation database. The range displayed represents the minimum and maximum TTCC for new hire salaries for the position across all of our US locations. To stay on top of market conditions, we refresh our salary ranges twice a year so these ranges may change in the future. Within the range, individual pay is determined by experience, job-related skills, qualifications, and other factors. If you have questions about the specific range, your recruiter can share this information.
Mixpanel Compensation Range
191,000- 233,000 USD
Benefits and Perks
- Comprehensive Medical, Vision, and Dental Care
- Mental Wellness Benefit
- Generous Vacation Policy & Additional Company Holidays
- Enhanced Parental Leave
- Volunteer Time Off
- Additional US Benefits: Pre-Tax Benefits including 401(K), Wellness Benefit, Holiday Break
Culture Values
- Make Bold Bets: We choose courageous action over comfortable progress.
- Innovate with Insight: We tackle decisions with rigor and judgment - combining data, experience and collective wisdom to drive powerful outcomes.
- One Team: We collaborate across boundaries to achieve far greater impact than any of us could accomplish alone.
- Candor with Connection: We build meaningful relationships that enable honest feedback and direct conversations.
- Champion the Customer: We seek to deeply understand our customers' needs, ensuring their success is our north star.
- Powerful Simplicity: We find elegant solutions to complex problems, making sophisticated things accessible.
Why choose Mixpanel?
We're a leader in analytics with over 9,000 customers and 277M raised from prominent investors: like Andreessen-Horowitz, Sequoia, YC, and, most recently, Bain Capital. Mixpanel's pioneering event-based data analytics platform offers a powerful yet simple solution for companies to understand user behaviors and easily track overarching company success metrics. Our accomplished teams continuously facilitate our expansion by tackling the ever-evolving challenges tied to scaling, reliability, design, and service. Choosing to work at Mixpanel means you'll be helping the world's most innovative companies learn from their data so they can make better decisions.
Mixpanel is an equal opportunity employer supporting workforce diversity. At Mixpanel, we are focused on things that really matter-our people, our customers, our partners-out of a recognition that those relationships are the most valuable assets we have. We actively encourage women, people with disabilities, veterans, underrepresented minorities, and LGBTQ+ people to apply. We do not discriminate on the basis of race, religion, color, national origin, gender, gender identity or expression, sexual orientation, age, marital status, veteran status, or disability status. Pursuant to the San Francisco Fair Chance Ordinance or other similar laws that may be applicable, we will consider for employment qualified applicants with arrest and conviction records. We've immersed ourselves in our Culture and Values as our guiding principles for the impact we want to have and the future we are building.
Software Engineer (Distributed Systems)
Posted 4 days ago
Job Viewed
Job Description
Inngest is solving long standing developer problems in a novel way which means we're creating first-of-its-kind solutions. We're building tools that developers use every day in their own products.
The role
The Execution layer is the core of Inngest and the primary way in which users interact with the platform. As a software engineer on the Execution team, you'll curate the developer experience for every person using Inngest, and so must have a strong intuition for clean, idiomatic API design. DX is critical to Inngest, and achieving the ideal abstraction is key.
You'll work on the underlying execution engine and APIs that make orchestration, step functions, and events work, providing the building blocks for every developer to access durable, reliable code from anywhere in their stack.
You'll work with the systems team who build the underlying infrastructure that the executor sits on top of, and the console team, who build the core local and cloud product that gives visibility into how our functions execute.
Your work will directly impact millions of developers, and you'll collaborate with our designers, engineers, and founders to build the best experience possible.
This role is US based, the SF bay area is preferred, but not required.
What you'll do
- Architect and implement solutions in our execution layer and our core systems (eg. step APIs, orchestration, etc.).
- Plan and implement improvements on throughput and latency at hundreds of thousands to millions of requests per second.
- Contribute to systems architecture and infrastructure changes as we grow.
- Work in Golang, Typescript, Python, and/or other languages to help build and shape our SDKs.
- Collaborate with team members to track metrics and data across function runs, events, traces, and telemetry.
- Work with backend engineers to design APIs that can be used across the Inngest cloud dashboard, dev server and CLIs.
- Dogfood the Inngest product and develop ideas for improvements, features, or integrations.
- Communicate with our users through Github, email and Discord.
- Write technical specs for features and documentation for our users.
- 3+ years working on distributed systems.
- Experience with Go (Golang) in production.
- You've architected or been involved in designing systems that can handle massive-scale.
- Deep knowledge of Typescript, Python or other typed languages.
- You've used Redis and ClickHouse.
- Good understanding of gRPC and Protocol Buffers (protobuf).
- Experience contributing and managing open source, user-facing code.
- Backend: Go, Postgres, FoundationDB, Redis, ClickHouse, PubSub/Kafka, k8s
- APIs: gRPC internally, GraphQL and REST APIs for UI
- SDKs: TypeScript, Go, Python, Kotlin, more to come
- Hosted on AWS, GCP and Bare Metal
- Github, Linear, Discord (Community), Slack, Notion
Interview process
Here's what our hiring process for this role is like:
- Application . Please note: While we have several engineering roles open at times, we recommend applying to only one role. If during our review or interviews we think you'd be great for a different position, we'll re-route your application internally.
- Screen interview . An introductory call to share what it's like to work at Inngest and make sure our expectations are aligned.
- Technical positioning interview . Chat to one of our engineers to understand how your technical skills could fit into our team.
- Technical interview . A deeper interview with a couple of our engineers, focused on your past experience and problem-solving approach.
- Product/ collaboration interview . A chance to meet more of the team (including a founder) to talk about product mindset, and how we'd collaborate day-to-day.
Software Engineer (Distributed Systems)
Posted 4 days ago
Job Viewed
Job Description
Core (aka Browserbase Core Infrastructure) is the backbone of everything we do. This team keeps our browsers running at scale, solving massive distributed systems challenges and making sure our platform is fast, reliable, and scalable.
What you'll do
- Build, operate, and grow the Browserbase Core platform - designing and developing robust, scalable distributed backends with developer-friendly APIs.
- Work closely with the rest of Engineering, gathering input and providing great support so every team can build on Core with confidence.
- Help define, scope, and review key projects; set priorities on the roadmap; and sequence deliverables that push the platform forward.
- Establish and reinforce best practices around development, operations, and reliability.
- Continuously enhance the platform to meet rapidly expanding customer adoption and demand.
- Investigate, troubleshoot, and resolve operational issues that arise in production.
- Document as you go and share your knowledge with the team.
- Deep experience building and scaling distributed systems, with scale on the order of hundreds or thousands of instances.
- Strong expertise coding in Go or Typescript; bonus if you've touched Firecracker or similar VMMs.
- Familiarity with CI/CD pipelines, Kubernetes and Docker, message queues, relational databases, automated testing, performance optimization, and zero-downtime multi-region deployments.
- A systems-thinking mindset you understand how infrastructure choices ripple all the way up to customer experience.
- Have high agency: you can set direction, make judgment calls, and drive projects forward without waiting for perfect clarity.
- Have a strong sense of ownership and bias toward action.
- Can drive projects independently, with high accountability without much outside input.
- Communicate clearly in writing and in person, and choose the right medium for the message.
- Are adaptable and able to dive into unfamiliar systems, learn quickly, and make sound technical decisions.
- Enjoy collaborating with a small, ambitious team in a fast-paced environment.
- Are excited to work 5 days a week in our San Francisco HQ (or open to relocating here).
- Work on some of the hardest problems in modern infrastructure.
- Collaborate with a small, high-bar team of engineers who care deeply about quality.
- Build core technology that every Browserbase product depends on.
- Join an ambitious startup where your impact will be felt immediately.
Software Engineer, Distributed Systems
Posted 4 days ago
Job Viewed
Job Description
Replit is the fastest way to turn ideas into software. With our powerful AI-powered Agent and Assistant, anyone can create and launch apps from natural language in just one click. Build and deploy full-stack applications directly from your browser-no setup required. Never written a line of code in your life? No problem. Replit makes software creation accessible, collaborative, and lightning-fast. Join us in our mission to empower the next generation of builders.
About the role:
We are seeking talented distributed systems engineers who are passionate about building innovative solutions for application deployment. Your mission will be to enhance the capabilities of Replit Infrastructure, optimize performance across global regions, and drive efficiency while delivering an exceptional user experience. If you have a strong foundation in software development, a deep understanding of cloud technologies, and a track record of delivering high-quality code, we want to hear from you.
In this role you will:
- Expand Replit's cloud infrastructure offerings: Launch new cloud products to be used by Replit Agent to build complex apps. Collaborate with cross-functional teams to design and implement these features, empowering developers with a comprehensive suite of tools to build and deploy their applications efficiently.
- Enhance reliability and scalability: Identify bottlenecks, optimize critical paths, and implement robust monitoring and alerting systems. Work closely with the SRE team to ensure high availability and minimal downtime. Enable our customers to seamlessly scale their applications to meet the demands of their growing user base.
- Improve utilization of cloud infrastructure: Analyze our infrastructure costs and identify opportunities for optimization. Implement strategies to reduce cloud expenses without compromising performance or reliability. This could involve techniques such as resource provisioning, auto-scaling, cost-aware scheduling, and data lifecycle management. Your efforts will directly contribute to the financial efficiency of our cloud services.
- Distributed systems: Track record of working with platform-as-a-service, distributed storage, or information retrieval systems. Experience in designing scalable architectures and optimizing systems for latency or cost.
- Problem-solving mindset: Ability to approach complex challenges pragmatically and devise effective solutions. You think radically but ship incrementally.
- Self-directed and autonomous: Able to work independently, set priorities, and drive projects forward. You take ownership and initiative.
- Versatility and flexibility: Able to wear multiple hats and tackle a wide range of challenges. You are comfortable working across different layers of the stack and adapting to the needs of the project.
- Continuous learning and adaptability: Passionate about staying up-to-date with industry trends and expanding your skill set. You embrace change and adapt quickly.
- Experience working on cloud infrastructure or platform products, particularly in the areas of application deployment, serverless computing, or container orchestration.
- Familiarity with Google Cloud Platform (GCP) services and tools, such as GCE, GKE,, Cloud Run, or Cloud Storage.
- Contributions to open-source projects related to cloud technologies, deployment frameworks, or developer tools. We love OSS!
- Golang, Rust
- You are a generalist backend engineer who hasn't built scalable distributed systems.
- You cannot take part in the oncall rotation of min 6 people.
- You do not enjoy diving into Linux internals.
Competitive Salary & Equity
401(k) Program
Health, Dental, Vision and Life Insurance
Short Term and Long Term Disability
Paid Parental, Medical, Caregiver Leave
Commuter Benefits
Monthly Wellness Stipend
Autonoumous Work Environement
In Office Set-Up Reimbursement
Flexible Time Off (FTO) + Holidays
Quarterly Team Gatherings
In Office Amenities
Want to learn more about what we are up to?
- Meet the Replit Agent
- Replit: Make an app for that
- Replit Blog
- Amjad TED Talk
- Operating Principles
- Reasons not to work at Replit
To achieve our mission of making programming more accessible around the world, we need our team to be representative of the world. We welcome your unique perspective and experiences in shaping this product. We encourage people from all kinds of backgrounds to apply, including and especially candidates from underrepresented and non-traditional backgrounds.
This is a full-time role that can be held from our Foster City, CA office. The hybrid role has an in-office requirement of Monday, Wednesday, and Friday.
Software Engineer - Distributed Systems
Posted 4 days ago
Job Viewed
Job Description
Mux is video for developers. Our mission is to democratize video by solving the hard problems developers face when building video: video encoding and streaming (Mux Video), video monitoring (Mux Data), and more. Video is a huge part of people's lives, and we want to help make it better.
We're committed to building a healthy team that welcomes diverse backgrounds and experiences. We want people who care about our mission, are ready to grow, believe in our values (from Be Human to Turn Customers Into Fans), and want to improve the people around them.
You'll join a tight-knit team with experience at places like Google, YouTube, Twitch, Reddit, Zencoder, Fastly, and more. Our founders previously started (and sold) Zencoder, an early leader in cloud video technology, and authored Video.js, the biggest HTML5 video player on the web. We organize Demuxed, the premier conference for video engineers in the world.
We're backed by top investors like Coatue, Accel, Andreessen Horowitz, and Y Combinator. You'll get to work with amazing companies: hundreds of startups, plus Strava, Patreon, Vimeo, Robinhood, PBS, and Equinox. Customers, large and small, love working with us and love our team.
We are building something big together. We'd love to hear from you!
About the Role
As a Software Engineer at Mux, you will play a key role in building Mux's next-generation Video products that power delightful user experiences for millions worldwide.
You will lead and execute complex projects across our Video stack and infrastructure, handling hundreds of thousands of videos ingested and more than a billion encodes per month using our proprietary just-in-time transcoding architecture. You will also help chart the technical direction of our platform and product offerings and work closely with the rest of the engineering team to advance how we build software collaboratively.
What You'll Do
- Work cross-functionally with product, customer success, and other engineering teams to execute on product and business strategy and build cutting-edge Video products that our customers will love.
- Contribute to the full development cycle: technical design, development, test, experimentation, analysis, launch & on-call. You'll review code and design docs, give feedback on product specs, and run your code.
- Take accountability for the planning and delivery of projects, both as a hands-on contributor and architect and as a facilitator.
- Bring ideas and directly influence your team's roadmap, collaborating closely with cross-functional stakeholders, especially regarding Video features and functionality.
- Build & promote best practices in your team for availability, reliability, and production readiness.
- 3-6 years of experience in production Backend & Video Engineering using Golang, C, C++, or other similar languages, with a successful track record of contributing to sizable projects from start to finish with end-user impact.
- Expertise in building and operating distributed video systems in a service-oriented architecture, including best practices for fault tolerance, latency, and observability.
- A track record of writing high-quality, maintainable code across multiple services & team boundaries.
- Solid operational experience with Kubernetes, monitoring tools (we use Grafana & Prometheus), databases (we use CockroachDB, Clickhouse, & Redis) and data streaming technologies (we use Kafka).
- Excellent communication, collaboration, and problem-solving skills.
U.S. Benefits
You'd join an amazing team from places like Google/YouTube, Amazon/Twitch, Facebook/Oculus, Reddit, Brightcove, Bain, and the BBC. We have a supportive culture that cares about both excellent work and work-life balance. We are remote-equal, with office spaces in Downtown San Francisco, New York City, and London.
- Flexible PTO + 11 company holidays
- Weekly no-meeting days + quarterly focus weeks
- Healthy work-life balance encouraged
- Competitive health, dental, and vision insurance (100% employee and 65% dependent premium coverage)
- Fully funded fertility benefits
- HSA available, compatible with high deductible plan only ($100 per single employee/month & $200 per family/month employer contribution)
- FSA available
- Short-term and long-term disability insurance
- Group life insurance
- Travel accident insurance
- Employee Assistance Program (EAP)
- Medical support concierge service
- 401(k)
- Paid parental leave
- Investment in career growth through professional development stipend
- Reimbursements for headphones, cell phones, device upgrades, and SVoD services of Mux customers
- Lunch reimbursement program
Mux is an Equal Opportunity employer committed to building a diverse company. We believe diversity makes us better, and we strive to be inclusive and equitable. That's why we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status or disability status.
Software Engineer, Distributed Systems
Posted 4 days ago
Job Viewed
Job Description
What is Verse?
Organizations today are under growing pressure to navigate the transition to clean energy - not just to meet sustainability goals, but to manage risk, control costs, and build long-term resilience. Yet the complexity of energy markets and a lack of accessible tools have made it difficult for most companies to take meaningful action. Verse was created to change that.
Our mission is to make the case for clean energy irrefutable. Through our AI-powered platform, Aria, we help organizations plan, procure, and operate clean energy to achieve their financial and sustainability goals. Verse transforms clean energy ambition into action - giving businesses the clarity and confidence to lead in a rapidly evolving energy landscape.
The Role
As a Software Engineer focusing on Distributed Systems at Verse, you will work in collaboration with some of the brightest industry experts in the field building cloud-native applications that scale to trillions of data points collected from electricity markets globally. You will be a part of a dynamic, robust team primarily supporting the backend needs of our Aria software product spanning hundreds of data sources, sinks, services, and jobs.
Your expertise will not only have a direct impact on product decisions, but you also be well-positioned to drive the development and trajectory of our entire platform and infrastructure and influence important architectural decisions that affect the whole organization.
Key Responsibilities
- Foster a culture and mindset of well-designed systems, test-driven software, and transparent communication with a high caliber of mutual respect and consideration for stakeholders
- Read and write a lot of Go, Python, and Protobuf
- Build, test, debug, maintain, and scale our ecosystem of backend services and APIs with hyper-precise data structure alignment
- Design, implement, and coordinate critical application interface and database model changes, migrate and evolve schemas, and scale queries for multiple product work streams
- Interoperate with various storage and retrieval systems for both structured and unstructured data sets across the entire stack
- Troubleshoot and enhance system performance, reliability, and security to meet the evolving needs of the business
- Collaborate closely with product managers, UX designers, and frontend engineers to vet user requirements and data provenance with an end-to-end mindset
- Work on occasion in the platform and infrastructure layers to develop deployment automation, observability, monitoring, and alerting tools
- Actively participate in code reviews, maintain technical documentation, and adhere to best software development practices.
- Fluency with Python and Go language runtimes, build tools, and library ecosystems
- First principles knowledge of modern distributed systems and how they apply to the needs of a small startup and throughout its growth phases
- Proven expertise in managing and orchestrating container-based deployments and microservices
- Command of complex application build configurations and workflows in a mono-repository setting
- Strong understanding of service oriented architectures, networking, and security fundamentals
- Technical leadership and commitment to delivering high quality software on time or ahead of schedule and adhering to best software development practices
- A bachelor's degree or higher, ideally in Computer Science or some STEM related field
- Demonstrated track record for Senior or Staff level software development talent and up
- Hands-on experience with Kubernetes, Google Cloud Platform and its various services.
- Advanced programming skills in systems languages such as Rust, Scala, Java, or C/C++, development shell runtimes, and configuration languages
- Familiarity with the gRPC + Protocol Buffer ecosystem
- Experience deploying internet security solutions across engineering and non-engineering teams to protect employee resources
- General knowledge of neural network architectures and AI/ML landscape is a big plus
- Lead with Empathy: We lift each other up with humility and kindness, always putting colleagues and customers first
- Be Honest & Transparent: We prioritize effective communication to build trust with our team, customers, and stakeholders
- Move with Balance & Precision: We believe speed and perseverance must be accompanied by thoughtfulness and reflection
- Leave the World a Better Place: We are passionate about our mission, and we strive to create a sustainable world for future generations
$146,000 - $173,000
This is the estimated base salary range for this position, which does not include the value of benefits or a potential equity grant. A wide range of factors are considered in making compensation decisions, including but not limited to level, skill sets, market conditions, experience and training, licensure and certifications, and business and organizational needs.
Benefits and Employee Perks
- Competitive compensation and equity grant at a high-growth start up
- Comprehensive benefits package including medical, dental and vision insurance, and 401k
- Flexible hours and unlimited PTO
- Diverse and inclusive working environment
Verse is an equal opportunity employer. All applicants and employees are considered for hire, promotion, and compensation without regard to race, color, religion, sex, national origin, age, disability, sexual orientation, marital or familial status.
Be The First To Know
About the latest Distributed systems Jobs in United States !
Distributed Systems Software Engineer

Posted 16 days ago
Job Viewed
Job Description
Insight Global's top Telecom client is seeking a Sr. Software engineer to join their Labs organization. This group is focused on developing leading edge systems to solve complex problems related to their radio access and 5g network. The Sr. Engineer will be joining a specialized team developing a new platform that will solve for next generation use cases such as the viability of autonomous vehicles and drones. The Software engineer should have diverse skills surrounding backend, frontend, and systems development. C++, Python, JavaScript CUDA, SQL DBs, Linux,
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: and Requirements
5+ years' in JavaScript, C++, and/or Python C++ and distributed systems experience
Relational Databases, Web Server, Unix/Linux Exp
CUDA and GPU experience
Python/Machine Learning experience Experience with 3D Rendering and three.js frameworks
Principal Distributed Systems Engineer
Posted 4 days ago
Job Viewed
Job Description
Arcesium is a global financial technology firm that solves complex data-driven challenges faced by some of the world's most sophisticated financial institutions. We constantly innovate our platform and capabilities to meet tomorrow's challenges, anticipate the risks our clients encounter, and design advanced solutions to help our clients achieve transformational business outcomes.
Financial technology is a high-growth industry as change and innovation continue to disrupt the status-quo and prompt major transformation. Arcesium is at a particularly interesting time in our own growth as we look to leverage our successfully established market position and expand operations in pursuit of strategic new business opportunities. We value intellectual curiosity, proactive ownership, and collaboration with colleagues, and we empower you to meaningfully contribute from day one and accelerate your professional development.
Responsibilities
- Building next generation technology used by some of the most sophisticated hedge funds in the world
- Designing exciting new products for our offering with best of breed distributed systems technologies
- Leading high-visibility engineering efforts on some of our data intensive core components
The ideal candidate will have a strong academic background in computer science and at least 3-5 years of relevant experience as a software engineer at a top startup or technology firm.
They must possess expertise in at least one of the following:
- Kafka or equivalent streaming technology
- Distributed cache/in memory data grids like Redis, Hazelcast, Ignite, or Memcached
- Columnar databases like Cassandra or HBase
- Hadoop MapReduce, Spark or Flink
- Java (or any other JVM language), Go, Haskell, or C#
- Relational Databases (SQL)
The expected annual base salary for this position is $200,000-$250,000. Our compensation package includes variable compensation in the form of a year-end bonus, guaranteed in the first year of hire, benefits including medical and prescription drug coverage, and 401k contribution matching.
Remote eligible states include: NY, NJ, MA, GA, MN, IL, FL, TX, OH, PA, CT, NC
Arcesium and its affiliates do not discriminate in employment matters on the basis of race, color, religion, gender, gender identity, pregnancy, national origin, age, military service eligibility, veteran status, sexual orientation, marital status, disability, or any other category protected by law. Note that for us, this is more than just a legal boilerplate. We are genuinely committed to these principles, which form an important part of our corporate culture, and are eager to hear from extraordinarily well qualified individuals having a wide range of backgrounds and personal characteristics.
Cloud - Distributed Systems Engineer
Posted 9 days ago
Job Viewed
Job Description
Bedrock Robotics is transforming the physical world with autonomy. While others debate the future of AI, we're deploying it. With a team that helped put self-driving cars on public roads at Waymo, scaled systems for Segment's $3.2B acquisition, and grew Uber Freight to $B in revenue, and with the backing of world-class advisors and investors from 8VC, Eclipse, Valor, Two Sigma, and many others, we are uniquely positioned to succeed in our optimistic vision of autonomy and industrial transformation.
We need you - exceptional autonomy is built on the highest scale and most complex infrastructure engineering. We are solving the challenges of scaled model training, data systems, observability, and real-time robotics.
As a passionate distributed systems engineer you will have immediate impact on the core mission, and will also work on novel problems at the intersection of infrastructure and robotics. We are looking for experienced software engineers / TLs at all stages of their career experienced in large scale distributed systems or big data.
Some of the challenges our team takes on include:
- Build the foundation for large scale end to end model training.
- Running inference of petabytes of video and lidar data.
- Scaling out Simulation-based evaluation of our stack.
- Building the Data platform that writes, indexes, queries, billions of metrics and labels.
- Power semantic search and data analysis using state of the art VLMs.
- Stream large amounts of data from robots operating across the US.
- Enable hardware-in-the-loop testing and validation.
Also, you get to drive 100,000 lb excavators.
Our roles are often flexible. If you don't fit all the criteria, or are in another location (especially one where we have an office like SF of NY) please apply anyway! We'd love to consider you.
Join the team bringing advanced autonomy to the built world
At Bedrock, we've assembled one of the most experienced autonomous technology teams in the industry, with deep expertise scaling breakthroughs across transportation, infrastructure, and enterprise software. Our leaders helped put the first self-driving cars on public roads at Waymo, scaled systems for Segment's 3.2B acquisition, and grew Uber Freight to 5B in revenue.
While others debate the future of AI, we're deploying it in the real world. Our systems are already installed on heavy machines across the country, learning on real construction sites and working to reshape the earth with survey-grade precision and exceptional safety. This isn't a simulation-it's autonomous intelligence working on billion-dollar infrastructure projects.
In just over a year, we've raised 80M, put our equipment into the field, and established partnerships with forward-thinking contractors who are integrating our technology into their operations. We're working quickly to close the gap between America's surging demand for housing, data centers, manufacturing hubs, and the construction industry's growing labor shortage.
Here, algorithms meet steel-toed boots. You'll collaborate with both construction veterans and experienced engineers, tackling problems where your work directly impacts how the physical world get built. If you're interested in applying cutting-edge technology to solve meaningful problems alongside a talented team-we'd love to have you join us.