Software Engineer II (Backend + Data pipelines)

Scribd
161d$126,000 - $196,000

About The Position

We’re seeking a Software Engineer II with strong backend development experience and a passion for solving complex data challenges at scale. In this role, you’ll design, build, and optimize distributed systems that extract, enrich, and process metadata for a wide range of content. You’ll work closely with ML engineers, product managers, and cross-functional partners to integrate machine learning models and LLM-based services into production pipelines and deliver impactful, high-performance solutions. This role offers the opportunity to work on cutting-edge generative AI and metadata enrichment problems at a truly global scale.

Requirements

  • 4+ years of professional software engineering experience.
  • Proficiency in Python, Scala, Ruby, or similar languages.
  • Experience designing and building distributed systems at scale.
  • Hands-on experience building, deploying, and optimizing solutions using ECS, EKS, or AWS Lambda.
  • Experience with infrastructure-as-code tools like Terraform (or similar).
  • Experience working with a public cloud provider (AWS, Azure, or Google Cloud).
  • Familiarity with data processing frameworks like Spark or Databricks for large-scale workloads.
  • Proven ability to test, profile, and optimize systems for performance, scalability, and reliability.
  • Bachelor’s degree in Computer Science or equivalent professional experience.

Nice To Haves

  • Experience working with LLMs or integrating ML models into production systems.

Responsibilities

  • Design and build scalable systems to extract, enrich, and process metadata from millions of documents, images, and audio content.
  • Leverage LLMs to integrate capabilities like summarization, classification, extraction, and enrichment into metadata pipelines.
  • Collaborate with cross-functional teams, including ML engineers and product managers, to deliver scalable, efficient, and reliable metadata solutions.
  • Optimize and refactor existing systems for performance, scalability, and reliability.
  • Ensure data accuracy, integrity, and quality through automated validation and monitoring.
  • Participate in code reviews, ensuring best practices are followed and maintaining high-quality standards in the codebase.
  • Manage and maintain data pipelines, security and infrastructure.

Benefits

  • Healthcare Insurance Coverage (Medical/Dental/Vision): 100% paid for employees.
  • 12 weeks paid parental leave.
  • Short-term/long-term disability plans.
  • 401k/RSP matching.
  • Onboarding stipend for home office peripherals + accessories.
  • Learning & Development allowance.
  • Learning & Development programs.
  • Quarterly stipend for Wellness, WiFi, etc.
  • Mental Health support & resources.
  • Free subscription to the Scribd Inc. suite of products.
  • Referral Bonuses.
  • Book Benefit.
  • Sabbaticals.
  • Company-wide events.
  • Team engagement budgets.
  • Vacation & Personal Days.
  • Paid Holidays (+ winter break).
  • Flexible Sick Time.
  • Volunteer Day.
  • Company-wide Employee Resource Groups and programs that foster an inclusive and diverse workplace.
  • Access to AI Tools: We provide free access to best-in-class AI tools, empowering you to boost productivity, streamline workflows, and accelerate bold innovation.

Stand Out From the Crowd

Upload your resume and get instant feedback on how well it matches this job.

Upload and Match Resume

What This Job Offers

Job Type

Full-time

Career Level

Mid Level

Education Level

Bachelor's degree

Number of Employees

251-500 employees

© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service