Together AI's Posts (85)

Startup Account Executive

As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, and mature automation to our operating environments and codebase. You specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems. Requirements Responsibilities About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - 5+ years of professional SRE or related experience - Bachelor's degree in Computer Science or a related field or equivalent work experience - Expert knowledge of Ansible (roles, playbooks), Terraform, and Kubernetes - Proficiency in programming/scripting languages - Direct experience in monitoring and observability practices - Advanced knowledge of cloud services - Ability to thrive in a collaborative environment involving different stakeholders and subject matter experts - Be on an on-call (PagerDuty) rotation to respond to incidents that impact availability - Build and run our infrastructure with Ansible, Terraform, and Kubernetes to enable scaling to a massive number of concurrent users - Build monitoring systems to ensure the highest quality service for our customers - Design and implement operational processes (such as deployments and upgrades) - Debug production issues across all services and levels of the stack - Identify improvements for the product architecture from the reliability, performance and availability perspectives - Plan the growth of Together AI’s infrastructure

Location: San Francisco

Salary range: None - None

Startup Account Executive

As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, and mature automation to our operating environments and codebase. You specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems. Requirements Responsibilities About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - 7+ years of professional SRE or related experience - Bachelor's degree in Computer Science or a related field or equivalent work experience - Expert knowledge of Ansible (roles, playbooks), Terraform, and Kubernetes - Proficiency in programming/scripting languages - Direct experience in monitoring and observability practices - Advanced knowledge of cloud services - Ability to thrive in a collaborative environment involving different stakeholders and subject matter experts - Be on an on-call (PagerDuty) rotation to respond to incidents that impact availability - Build and run our infrastructure with Ansible, Terraform, and Kubernetes to enable scaling to a massive number of concurrent users - Build monitoring systems to ensure the highest quality service for our customers - Design and implement operational processes (such as deployments and upgrades) - Debug production issues across all services and levels of the stack - Identify improvements for the product architecture from the reliability, performance and availability perspectives - Plan the growth of Together AI’s infrastructure

Location: San Francisco

Salary range: None - None

Startup Account Executive

As a Senior Infrastructure Software Engineer, you will focus on automating infrastructure installations and decommissions at scale. You will build tools to constantly improve our scale and speed of deployment. You will nurture a passion for an “automate everything” approach that makes systems failure-resistant and ready-to-scale. Your work will enable our partners to bring up new data centers for AI and replace servers and networking in existing data centers as quickly and efficiently as possible without impacting running services. You will also review hardware changes, plan deployments, and aggressively execute to expand our network. The ideal candidate has a passionate curiosity about how the Internet, GPUs, and computers fundamentally work and has a strong knowledge of Linux and AI or GPU hardware. We require strong coding ability in Python, Go, or similar languages. This is a highly visible position that requires deep technical understanding of datacenter infrastructure, physical and logical networking, Linux, and basic experience with project management. Requirements Responsibilities About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - 5 years of relevant Development experience - Intermediate level software development skills in Python, Go, or similar - Linux systems administration experience - Strong skills in network services, including REST APIs and HTTP - Strong tooling and automations development experience - Network fundamentals DHCP, ARP, subnetting, routing, firewalls, IPv6 - Experience with configuration management and infrastructure-as-code systems such as Saltstack, Chef, Puppet, Ansible, or Terraform - Experience with continuous / rapid release engineering - Experience working in a 24/7/365 service environment - Familiarity with day-to-day tasks and projects common in Data Center Operations - Excellent understanding of low level operating systems concepts including multi-threading, memory management, networking and storage, performance, and scale - Experience with Kubernetes and containerization, VPNs, AI workloads, and blockchain based protocols a plus - Deep knowledge of network engineering and protocols used in data center switching and routing, Internet routing, and optical line systems a plus - GPU programming, NCCL, CUDA knowledge a plus - Experience with PyTorch or Tensorflow a plus - Aggressively seek opportunities to introduce cutting-edge technology and automation solutions that are effective, efficient, and scalable in order to improve our ability to deploy and maintain our global infrastructure - Provisioning, monitoring, and maintaining hardware, software, and network in new data centers - Perform architecture and research work for decentralized AI workloads - Work with vendors to obtain, debug, and maintain the most efficient and effective next-generation hardware and software for Together AI’s workloads - Collaborate with Together AI’s partners to make informed decisions about hardware strategy - Plan and implement network and server installations, including in the areas of facility power (AC/DC), cooling, security/access, rack layout, and cable management - Provide technical leadership and guidance during deployment activities - Create and maintain documentation, plans, SOP’s, MOP’s, etc. - Communicate your results and updates through blog posts, internal talks, and tickets

Location: San Francisco

Salary range: None - None

Startup Account Executive

As the first SDET at Together AI, you will be a key player in setting a high quality bar for our users and customers. Your primary focus will be on designing and implementing automated testing processes using Python, Golang, or TypeScript. We’re looking for a leader with experience working closely with a group of stakeholders and engineers, defining test strategies and executing test plans, and ensuring the overall quality of the products we have at Together AI. Requirements Responsibilities About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - Bachelor's degree in Computer Science, Software Engineering, or a related field or 5+ years of industry experience - Proven experience as a QA Automation Engineer, an SDET, or a similar role in a software development environment - Strong knowledge of automation testing methodologies, tools, and best practices - Proficiency in Python, Golang, or TypeScript for test automation - Familiarity with AI and machine learning concepts is a plus - Excellent problem-solving skills and attention to detail - Strong communication and collaboration skills - Self-motivated and adaptable in a fast-paced startup environment - Passionately committed to ensuring the highest standards of software quality and dedicated to delivering top-notch products to our users - Develop a sustainable test automation strategy and drive accountability and ownership across relevant teams to maintain these practices - Identify project needs and establish QA best practices and processes that take into account the team's resources, roadmap, and quality standards - Hold teams accountable for upholding quality and user impact as a factor in decisions - Create and maintain robust test automation frameworks using Cypress to increase test efficiency and coverage - Write, maintain, and execute automated test scripts for functionality, performance, and reliability testing - Identify and report software defects, track issues, and collaborate with developers to ensure timely resolution - Conduct automated regression testing to validate software changes and updates - Work closely with engineering and product teams to understand project requirements and align on testing goals - Document test automation processes, findings, and results for reference and reporting purposes - Stay current on emerging testing tools, best practices, and quality assurance trends - Implement process improvements and promote a culture of automation and quality

Location: San Francisco

Salary range: None - None

Startup Account Executive

As a senior front-end engineer, you will be responsible for developing user experiences that our customers will love. Your day to day activities will be hands-on developing software and contributing to design—working closely with leads, designers, and product managers to experiment and deliver experiences to our customers all over the world. You will develop and iterate on written technical proposals—outlining how solutions will be structured and developed. You will proactively identify opportunities and lead the development of solutions you’ve designed from the ground up through deployment into production and evaluate whether they were successful after release. You will identify and address performance bottlenecks within the application and the broader infrastructure. Requirements Responsibilities About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - 5+ years of professional software development experience - Expert at building web applications using React, Next.js, and TypeScript - Experience with developer tool web applications - Experience with designing data intensive and highly responsive web applications - Design-oriented approach and a strong sense of aesthetics - Penchant for achieving the right balance between craft, speed, and business priorities - Deep understanding of usability issues in web applications - Excellent oral and written communication - Experience building web APIs with NodeJS, especially with document databases such as MongoDB - Ability to thrive in a collaborative environment involving different stakeholders and subject matter experts - Ability to write high-performance, reusable code for UI components, including appropriate testing - Create beautiful web interfaces and interactive experiences in Together AI’s playground using technologies such as React, TypeScript, and CSS - Build developer tools that allow novice to expert level users of AI systems to get the most of out of our platform and APIs - Scope, design, and lead technical projects, laying the groundwork for early-stage products to iteratively evolve and scale - Work closely with leads, designers, product managers, and engineers to design, develop, and launch features and product experiments - Build tools and frameworks that help us rapidly and reliably conduct experiments across different parts of the Together app - Debug production issues across services and multiple levels of the stack with an eye towards improving maintainability over the long term - Improve engineering standards, tooling, and processes - Participate in design meetings, hiring interviews, and code reviews - Ensure our UI components and libraries are reliable, secure, extensible and accessible - Translate high level designs into production-ready UI - Communicate effectively with stakeholders across Together when developing a solution; seek and incorporate diverse perspectives to address complex issues - As part of the Together.ai team, you will be a core member of the group that designs and delivers this platform and in the process creates equitable access to AI and computing

Location: San Francisco

Salary range: None - None

1 ... 15 16 17