Dolby

Sr Foundational AI Researcher (Audio)

Dolby, Atlanta, Georgia, United States, 30383

Join the leader in entertainment innovation and help us design the future. At Dolby, science meets art, and high tech means more than computer code. As a member of the Dolby team, you'll see and hear the results of your work everywhere, from movie theaters to smartphones. We continue to revolutionize how people create, deliver, and enjoy entertainment worldwide. To do that, we need the absolute best talent. We're big enough to give you all the resources you need, and small enough so you can make a real difference and earn recognition for your work. We offer a collegial culture, challenging projects, and excellent compensation and benefits, not to mention a

Flex Work

approach that is truly flexible to support where, when, and how you do your best work.

The

Advanced Technology Group

(ATG) is the research division of the company. ATG's mission is to look ahead, deliver insights, and innovate technological solutions that will fuel Dolby's continued growth. Our researchers have a broad range of expertise related to computer science and electrical engineering, such as AI/ML, algorithms, digital signal processing, audio engineering, image processing, computer vision, data science & analytics, distributed systems, cloud, edge & mobile computing, computer networking, and IoT.

Dolby is looking for a talented Senior AI Researcher to join Dolby's research efforts to develop the next generation of AI based audio and video technologies. You will work with Dolby's world-class audio and vision experts to invent new multimedia analysis, processing and rendering technologies. As a part of a global team, the Senior AI Research Engineer will work on ideas exploring new horizons in audio processing, analysis, replay and organization. You will be responsible for performing fundamental new research, transferring technology to product groups, and draft patent applications.

Summary

You will push the boundaries of the state-of-the-art in audio and media technologies. The ideal candidate would have a strong background in deep learning, both in terms of conceptual understanding, as well as practical experience. A core aspect of this role involves being able to keep up to date with the literature, implement, and innovate with the bleeding edge in generative models, self-supervised learning, and multi-modal learning.

With the explosion of large language models and natural language processing, you will partner closely with Dolby's Applied AI team, which actively pursues the integration of such models into audio and media experiences. You will be able to hit the ground running, innovate, and contribute to such projects. Consequently, experience with language models, question answering, vision-language models, captioning, etc. would be highly beneficial.

Main responsibilitiesPartner closely with other domain experts to refine and execute Dolby's technical strategy in artificial intelligence and machine learning.Use deep learning to create new solutions (including foundation models) and enhance existing applications.Push the state-of-the-art and develop intellectual property.Transfer technology to product groups and draft patent applications.Advise internal leaders on recent deep learning advancements in the industry and academia to further influence research direction and business decisions.

RequirementsPh.D. in Computer Science or similar field.A strong background in deep learning, both in terms of conceptual understanding, as well as practical experience.Knowledge in audio, video, or text processing is desirable.Strong publication record, with publications in major machine learning conferences (e.g. NeurIPS, ICLR, ICML). Publications in top domain-specific conferences is desirable (e.g., ACL, CVPR, ICASSP).Good knowledge about current machine learning literature.Highly skilled in Python and one or more popular deep learning frameworks (TensorFlow or PyTorch).Ability to envision new technologies and turn them into innovative products. Creativity.Good communication and collaboration skills.

Consequently, knowledge or experience in any/all of the following are helpful:Diffusion, autoregressive, or other generative models.Self-supervised, contrastive learning, auto-encoders.Audio, image, or text applications - Source separation, text-to-speech, music synthesis, image segmentation, image captioning, question answering, language models, etc.

The Atlanta Area base salary range for this full-time position is $129,700-$158,500, which can vary if outside this location,plus bonus, benefits, and some roles may also include equity. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, competencies, experience, market demands, internal parity, and relevant education or training. Your recruiter can share more about the specific salary range and perks and benefits for your location during the hiring process.

Dolby will consider qualified applicants with criminal histories in a manner consistent with the requirements of San Francisco Police Code, Article 49, and Administrative Code, Article 12

Equal Employment Opportunity:Dolby is proud to be an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making employment decisions without regard to race, religious creed, color, age, sex, sexual orientation, gender identity, national origin, religion, marital status, family status, medical condition, disability, military service, pregnancy, childbirth and related medical conditions or any other classification protected by federal, state, and local laws and ordinances.