Together AI's Posts (85)

Startup Account Executive

Role As a Systems Research Engineer specialized in GPU Programming, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. Working closely with the modeling and algorithm team, you will co-design GPU kernels and model architecture to enhance the performance and efficiency of our AI systems. Collaborating with the hardware and software teams, you will contribute to the co-design of efficient GPU architectures and programming models, leveraging your expertise in GPU programming and parallel computing. Your research skills will be vital in staying up-to-date with the latest advancements in GPU programming techniques, ensuring that our AI infrastructure remains at the forefront of innovation. Requirements Responsibilities About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - Strong background in GPU programming and parallel computing, such as CUDA and/or Triton. - Knowledge of ML/AI applications and models - Knowledge of performance profiling and optimization tools for GPU programming - Excellent problem-solving and analytical skills - Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experiences - Optimize and fine-tune GPU code to achieve better performance and scalability - Collaborate with cross-functional teams to integrate GPU-accelerated solutions into existing software systems - Stay up-to-date with the latest advancements in GPU programming techniques and technologies

Location: San Francisco

Salary range: None - None

Startup Account Executive

About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy

Location: San Francisco

Salary range: None - None

Startup Account Executive

Role Together AI is seeking a Machine Learning Engineer to join our Inference Engine team, focusing on optimizing and enhancing the performance of our AI inference systems. This role involves working with state-of-the-art large language models models and ensuring they run efficiently and effectively at scale. If you are passionate about AI inference, PyTorch, and developing high-performance systems, we want to hear from you. This position offers the chance to collaborate closely with AI researchers and engineers to create cutting-edge AI solutions. Join us in shaping the future at Together AI! Responsibilities Requirements About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society. Together, we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI. Our team has been behind technological advancements such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey to build the next-generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance, and other competitive benefits. The US base salary range for this full-time position is $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level, and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunities to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. - Design and build the production systems that power the Together AI inference engine, enabling reliability and performance at scale. - Develop and optimize runtime inference services for large-scale AI applications. - Collaborate with researchers, engineers, product managers, and designers to bring new features and research capabilities to the world. - Conduct design and code reviews to ensure high standards of quality. - Create services, tools, and developer documentation to support the inference engine. - Implement robust and fault-tolerant systems for data ingestion and processing. - 3+ years of experience writing high-performance, well-tested, production-quality code. - Proficiency with Python and PyTorch. - Demonstrated experience in building high performance libraries and tooling. - Excellent understanding of low-level operating systems concepts including multi-threading, memory management, networking, storage, performance, and scale. - Preferred: Knowledge of existing AI inference systems such as TGI, vLLM, TensorRT-LLM, Optimum - Preferred: Knowledge of AI inference techniques such as speculative decoding. - Preferred: Knowledge of CUDA/Triton programming. - Nice to have: Knowledge of Rust, Cython and compilers.

Location: San Francisco

Salary range: None - None

Startup Account Executive

As an ML Engineer within the Fine-Tuning API team at Together AI, you will develop a platform that allows our users to customize open-source models with their data. You will collaborate with our product and research teams, developing features that will enable new use cases for the Fine-Tuning API. You will also work with the engineering team to ensure that our API is reliable and well-integrated into the company’s technical infrastructure. You will have an opportunity to build the foundational layer of the open-source AI ecosystem, letting developers all over the world efficiently create high-quality models tailored to their application scenarios. Key responsibilities You may be a good fit if you: Experience in any of the following will make you stand out: About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy athttps://www.together.ai/privacy - Build and improve Together’s systems and infrastructure for customizing open-source models - Create tooling and documentation to set our users up for success with the Fine-Tuning API - Make sure the service is stable and robust, participating in an on-call rotation and ensuring 24/7 availability of our platform - Analyze and increase the efficiency of fine-tuning methods, both from research and engineering perspectives - Develop new features and capabilities for the API based on latest research in AI - Have 2+ years of experience building and deploying machine learning-based services in a production environment - Have working knowledge of latest methods for fine-tuning LLMs or other AI models - Have a strong software engineering background in Python or Go - Are passionate about making reliable and convenient tools for software developers - Follow latest advances and trends in the Machine Learning community - Developing large-scale and high-load production systems - Implementation and design of advanced methods for efficient fine-tuning - Maintaining or contributing to open-source ML projects - Managing machine learning workloads on Kubernetes clusters

Location: San Francisco

Salary range: None - None

Startup Account Executive

Role Together AI is seeking a Distributed ML Systems Engineer to design and build scalable machine learning systems that power our accelerated AI initiatives. This role involves developing large-scale, fault-tolerant distributed systems that handle high-load and high-performance requirements. If you are passionate about designing ML systems that operate at scale and eager to create impactful solutions, we want to hear from you. This position offers the chance to work closely with our AI researchers and infrastructure teams to ensure our systems are robust and efficient. Join us in shaping the future at Together AI! Responsibilities Requirements About Together AI Together AI is a research-drven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society. Together, we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI. Our team has been behind technological advancements such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey to build the next-generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance, and other competitive benefits. The US base salary range for this full-time position is $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level, and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunities to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. - Design and build large-scale, distributed machine learning systems that are fault-tolerant and high-performance. - Develop and optimize distributed processing frameworks and storage systems. - Collaborate with researchers, engineers, and product managers to integrate ML systems into our infrastructure. - Conduct architecture and design reviews to ensure best practices in system design. - Implement robust monitoring and logging systems to ensure the health and performance of our ML systems. - 3+ years of experience in building large-scale, fault-tolerant, high-performance distributed systems. - Strong programming skills in one or more of Python, Go, Rust, or C/C++. - Excellent understanding of low-level operating systems concepts including multi-threading, memory management, networking, and storage, performance, and scale. - Experience with cloud computing platforms (AWS, GCP, Azure etc.) and large-scale infrastructure. - Strong problem-solving skills and ability to work in a fast-paced environment. - Preferred: Experience with Kubernetes - Preferred: Experience with Pytorch

Location: San Francisco

Salary range: None - None

1 ... 13 14 15 16 17