AI Model Optimization Engineer

ByteDanceSan Jose, CA
144d

About The Position

The Intelligent Creation - AI Platform team is a team focusing on building advanced end-to-end AI production pipelines, including deep learning model training, optimization, deployment and applications. We provide AI capabilities to empower content creation and consumption on TikTok and serve billions of users. We are seeking an experienced AI model optimization engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal candidate will work at the cutting edge of AI efficiency, enhancing the performance, scalability, and deployment of large-scale generative AI models.

Responsibilities

  • Optimize AI model training and inference workflows to improve efficiency, speed, and scalability.
  • Develop and implement distributed training strategies to accelerate model convergence and reduce computational overhead.
  • Design and optimize inference pipelines for low-latency, high-throughput deployments across diverse hardware architectures.
  • Benchmark and profile deep learning models to identify performance bottlenecks and optimize computational resources.
  • Improve model parallelism and memory efficiency for large-scale AI models.
  • Research and implement state-of-the-art techniques in model compression, quantization, and pruning.
  • Collaborate with data scientists, production engineers, and infrastructure teams to ensure seamless integration of optimized models into production environments.
  • Stay up to date with the latest advancements in AI model efficiency, distributed computing, and hardware acceleration.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service