Together AI's Posts (85)

2025-07-19

Lead Cloud Infrastructure Engineer

Together AI is hiring a Lead Cloud Infrastructure Engineer to own and operate the cloud foundation that powers our rapidly scaling data platforms. In this role, you will be the primary engineer responsible for defining, building, and maintaining the AWS infrastructure that underpins data engineering systems across the company — from internal analytics pipelines to customer-facing metering and billing systems. You will work closely with multiple data engineering teams, enabling them to move faster by building reliable, secure-by-default infrastructure they can depend on. You’ll partner with our dedicated security engineering team to ensure best practices around IAM, network design, and data lake access — while focusing on platform reliability, scalability, and developer experience. This is a high-ownership, hands-on engineering role. You’ll manage everything from Terraform modules and CI/CD pipelines to Lake Formation permissions and observability tools — with a mandate to build infrastructure that just works, and keeps working. Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $240,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - Own the design, implementation, and operation of all AWS-based infrastructure supporting data systems - Build and maintain reproducible, well-documented Terraform infrastructure modules - Collaborate with multiple data engineering teams to understand and support their infrastructure needs - Implement and maintain IAM policies and Lake Formation permissions in partnership with the security engineering team - Build internal tooling and automation to support self-service infra provisioning - Monitor infrastructure health and cost, and drive continuous improvements in reliability and efficiency - Serve as the point person for infrastructure questions, issues, and requests across the company’s data platforms - 5+ years of experience in infrastructure, SRE, or DevOps roles - Deep experience with Terraform and AWS infrastructure design, including VPCs, IAM, S3, EC2, etc. - Experience with access control and permissions in AWS, including Lake Formation or similar - Experience supporting data infrastructure or working alongside data engineering teams (Kafka/Kinesis/ClickHouse a plus) - Familiarity with CI/CD for infrastructure (GitHub Actions, TeamCity, ArgoCD, etc.) - Proficiency in scripting or programming (Python, Go, or Bash preferred) - Comfortable working autonomously and supporting multiple stakeholders - Bonus: experience with Kubernetes, hybrid cloud/on-prem infrastructure, or cost optimization at scale

Location: San Francisco

Salary range: None - None

2025-07-19

Machine Learning Operations (MLOps) Engineer

Together AI is looking for an MLOps engineer who will develop systems and APIs that enable our customers to perform inference and fine tune LLMs. Relevant experience includes implementing runtime systems that perform inference at scale using AI/ML models from simple models up to the largest LLMs. Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $240,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - 5+ years experience working on a production level ML training or inference system. - Bachelor’s degree in computer science or equivalent industry experience. - Strong understanding of the state of the art in machine learning especially LLMs. - Experience with DevOps practices like CI/CD, automation, containerization (Docker), and orchestration (Kubernetes). - Proficiency in cloud platforms like AWS, Google Cloud, or Azure. - Expertise in programming (Python, go, etc.) and frameworks for ML (TensorFlow, PyTorch, Scikit-learn). - Work closely with engineering, research, and sales on deploying, evaluating, and operating inference systems for both customers and internal use. - Build and maintain tools, services, and documentation for automation and testing. - Analyze and improve efficiency, scalability, and stability of various system resources. - Conduct design and code reviews. - Participate in an on-call rotation to respond to critical incidents as needed.

Location: San Francisco

Salary range: None - None

2025-07-19

Platform Engineer, Model ShapingNew

The Model Shaping team at Together AI works on products and research for tailoring open foundation models to downstream applications. We build services that allow machine learning developers to choose the best models for their tasks and further improve these models using domain-specific data. In addition to that, we develop new methods for more efficient model training and evaluation, drawing inspiration from a broad spectrum of ideas across machine learning, natural language processing, and ML systems. As a Platform Engineer at Model Shaping, you will work on the foundational layers of Together’s platform for model customization and evaluation. You will design the infrastructure and backend services that will allow us to sustainably and reliably scale the systems powering production workflows launched by our users, as well as internal research experiments. You will operate in a cross-functional environment, collaborating with other engineers and researchers in the team to improve the infrastructure based on the needs of projects they work on. You will also interact with other engineering teams at Together (such as Commerce, Data Engineering, and Cloud Infrastructure) to integrate the services developed by Model Shaping with systems developed by those teams. Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancements such as FlashAttention, RedPajama, SWARM Parallelism, and SpecExec. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is $200,000 - $290,000. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - Design and build Together’s systems and infrastructure for model customization, including user-facing features and internal improvements - Contribute to reliability improvements for the platform, participating in an on-call rotation and improving processes for incident response - Create and improve internal tooling for deployment, continuous integration, and observability - Build a job orchestration platform spanning multiple data centers, supporting a highly heterogeneous hardware landscape - Partner with teams developing internal services, co-designing these services and incorporating them in systems built by Model Shaping - 3+ years of experience in building infrastructure or backend components of production services - Comfortable with the fundamentals of Linux environments and modern container/orchestration stacks (e.g., Docker and Kubernetes) - Strong software engineering background in Python or Go - Experienced with infrastructure automation tools (Terraform, Ansible), monitoring/observability stacks (Prometheus, Grafana), and CI/CD pipelines (GitHub Actions, ArgoCD) - Skilled with analyzing non-trivial issues of complex software systems and documenting your findings - Have cloud environment (e.g., AWS/GCP/Azure) administration experience, preferably with a hybrid bare-metal/cloud environment - Strong communication skills, willing to document systems and processes and collaborate with peers of varying technical expertise - Developing large-scale production systems with high reliability requirements - Pipeline orchestration frameworks (e.g., Kubeflow, Argo Workflows, Flyte) - Managing GPU workloads on HPC clusters, ideally with hands-on experience in operating NVIDIA’s networking stack (e.g., NCCL, Mellanox firmware, GPUDirect RDMA) - Deployment of services for AI training or inference - Maintaining or contributing to open-source projects

Location: San Francisco

Salary range: None - None

2025-07-19

Senior Backend Engineer - Commerce

Together AI is seeking a Senior Backend Engineer to shape, build, and scale the commerce platform that drives our Together’s Cloud products. As a member of the Commerce Engineering team, you will develop and work on mission-critical commerce capabilities including usage-based billing, payment processing, customer-facing analytics, and product entitlements. This role is for someone who thrives on solving complex challenges in distributed systems, and has expertise in backend API services, relational databases, and event-driven architectures for a rapidly scaling and commerce-intensive company. You will work across cloud-native services and globally distributed data centers to deliver high-performance, reliable solutions. Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems and API microservices - Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources - Excellent understanding of low level operating systems concepts including multi-threading, memory management, networking and storage, performance and scale - Expert-level programmer in one or more of Golang, Rust, Python, Java, or TypeScript - Proficiency in writing and maintaining Infrastructure as Code (IaC) using tools like Terraform, AWS CDK, or Pulumi - Proficiency in version control practices and integrating IaC with CI/CD pipelines. - Experience with payment processors (e.g. Stripe) and billing systems a plus - Experience with Kubernetes, or containers a plus - Experience building and operating data infrastructure (Kinesis, Airflow, Kafka, etc) a plus - Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience - Identify, design, and develop foundational backend services that power Together’s commerce platform - Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure - Partner with product teams to understand functional requirements and deliver solutions that meet business needs - Write clear, well-tested, and maintainable software and IaC for both new and existing systems - Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance - Participate in an on-call rotation to address critical incidents when necessary

Location: San Francisco

Salary range: None - None

2025-07-19

Marketing Analytics and Ops Manager

We are seeking a highly analytical and process-oriented Marketing Analytics and Operations professional to join our dynamic marketing team. This role will report into the Revenue Strategy and Operations team. The ideal candidate will bridge the gap between marketing strategy and execution, leveraging data to optimize campaigns, streamline operations, and enhance overall marketing performance. Marketing Analytics: Marketing Operations: Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $170-210K OTE + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. This is a hybrid role based in the Bay Area. Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - Develop, maintain, and automate marketing dashboards and reports that provide actionable insights into campaign performance, pipeline contribution, and ROI. - Analyze marketing data from various sources (CRM, marketing automation, web analytics, advertising platforms) to identify trends, opportunities, and areas for optimization. - Conduct deep-dive analyses on specific campaigns, channels, and segments to understand their impact and inform future strategies. - Define and track key marketing metrics (e.g., MQLs, SQLs, CAC, LTV, conversion rates, funnel velocity) and communicate performance to stakeholders. - Collaborate with sales operations and finance teams to ensure alignment on reporting, data definitions, and attribution models. - Forecast marketing performance and identify potential risks and opportunities based on data. - Manage and optimize our marketing technology stack (e.g., Salesforce, Pardot, Outreach, Google Analytics, BI tools). - Develop, implement, and optimize marketing processes, workflows, and best practices to improve efficiency and scalability. - Ensure data integrity and cleanliness within marketing systems, including lead routing, lead scoring, and data enrichment. - Oversee the implementation and maintenance of lead scoring models, nurturing programs, and segmentation strategies. - Manage audience segmentation and targeting efforts to ensure effective delivery of marketing messages. - Support campaign execution by providing technical assistance, list management, and performance tracking setup. - Document marketing operations processes and provide training to marketing team members on system usage and best practices. - Stay up-to-date with industry trends, marketing automation best practices, and new technologies. - Bachelor's degree in Marketing, Business, Statistics, Economics, Computer Science, or a related quantitative field. - 5-7 years of experience in Marketing Analytics, Marketing Operations, Business Intelligence, or a similar role, preferably in a B2B SaaS environment. - Strong proficiency with Marketing Automation Platforms (e.g., HubSpot, Marketo, Pardot). - Proven experience with data visualization tools (e.g., Tableau, Power BI, Google Data Studio, Looker) and creating impactful dashboards. - Excellent analytical skills with the ability to collect, organize, analyze, and disseminate significant amounts of information with attention to detail and accuracy. - Advanced Excel skills (e.g., pivot tables, VLOOKUPs, complex formulas). - Solid understanding of marketing funnel mechanics, lead management processes, and sales & marketing alignment. - Experience with SQL, R, or Python for data extraction and analysis is a strong plus. - Familiarity with web analytics platforms (e.g., Google Analytics, Adobe Analytics). - Strong project management skills and the ability to manage multiple priorities in a fast-paced environment. - Exceptional communication and interpersonal skills, with the ability to translate complex data into clear, actionable insights for non-technical stakeholders.

Location: San Francisco

Salary range: None - None

1 ... 3 4 5 6 ... 17