Apply now »

Senior Software Engineer (Big Data Engineer in Machine Learning)

Location: 

Pune, MH, IN

Department:  Machine Learning/Ad Serving

 

PubMatic (Nasdaq: PUBM) is an independent technology company maximizing customer value by delivering digital advertising’s supply chain of the future.

 

PubMatic’s sell-side platform empowers the world’s leading digital content creators across the open internet to control access to their inventory and increase monetization by enabling marketers to drive return on investment and reach addressable audiences across ad formats and devices.

 

Since 2006, our infrastructure-driven approach has allowed for the efficient processing and utilization of data in real time. By delivering scalable and flexible programmatic innovation, we improve outcomes for our customers while championing a vibrant and transparent digital advertising supply chain.

 

Job Description

PubMatic is seeking a big data engineer who can work on building the next generation ML platform. The ideal candidate is a self-motivated problem solver with a strong background in big data tech stack, software design and development.

If you get excited about building a highly impactful machine learning platform which processes large datasets, in a creative and fast-paced open cultured environment, then you should consider applying for this position.

Responsibilities:  

  • Build, design and implement our highly scalable, fault-tolerant, highly available big data platform to process terabytes of data and provide customers with in-depth analytics. 
  • Developing Big Data pipelines using modern technology stack such as Spark, Hadoop, Kafka, HBase, Hive, Presto etc.
  • Develop ML based generic workflows, tools, which can be used by ML team for different projects
  • Doing POC to evaluate different tools which can be used to build a ML platform that enables ML engineers to write and manage their workflows with ease.
  • Develop robust & fault-tolerant systems and monitor implications of changes on data processing pipeline and performance. 
  • Leveraging a broad range of PubMatic’s data architecture strategies and proposing both data flows and storage solutions. 
  • Managing hadoop map reduce and spark jobs & solving any ongoing issues with operating the cluster. 
  • Expertise in developing Implementation of professional software engineering best practices for the full software development life cycle, including coding standards, performing code reviews, committing to Github, preparing documents in Confluence, continuous delivery using Jenkins, automated testing, and operations. 
  • Participate in Agile/Scrum processes such as sprint planning, sprint retrospective, backlog grooming, user story management, work item prioritization, etc

Qualifications

  • 3+ years of coding experience
  • Solid computer science fundamentals including data structure and algorithm design, and creation of architectural specifications. 
  • Expertise in developing Implementation of professional software engineering best practices for the full software development life cycle, including coding standards, code reviews, source control management, documentation, build processes, automated testing, and operations. 
  • A passion for developing and maintaining a high-quality code and test base and enabling contributions from engineers across the team. 
  • Expertise in big data technologies like Hadoop, Spark, Kafka, HBase etc. 
  • Experience in developing and delivering large-scale big-data pipelines, real-time systems & data warehouses would be preferred. 
  • Knowledge of basic Machine Learning modeling will be an added advantage. 
  • Experience with Airflow, mlflow will be preferred.
  • Demonstrated ability to achieve stretch goals in a very innovative and fast paced environment. 
  • Demonstrated ability to learn new technologies quickly and independently. 
  • Excellent verbal and written communication skills, especially in technical communications. 
  • Strong interpersonal skills and a desire to work collaboratively. 

#LI-DNI

 

Return to Office: PubMatic employees throughout the global have returned to our offices via a hybrid work schedule (3 days “in office” and 2 days “working remotely”) that is intended to maximize collaboration, innovation, and productivity among teams and across functions. 

 

Benefits: Our benefits package includes the best of what leading organizations provide, such as stock options, paternity/maternity leave, healthcare insurance, broadband reimbursement. As well, when we’re back in the office, we all benefit from a kitchen loaded with healthy snacks and drinks and catered lunches and much more!

 

Diversity and Inclusion: PubMatic is proud to be an equal opportunity employer; we don’t just value diversity, we promote and celebrate it. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Apply now »