Get new jobs for this search by email

Create Job Alerts

Research Scientist, GenAI - Multimodal Audio (Speech, Sound and M...
META - Bellevue, Washington, us, 98009 3 days ago
The GenAI org at Meta builds industry leading LLM and multimodal generative foundation models, which sets the industry benchmark of open source founda...
More...
Director, Global Marketing Programs
Twist Bioscience - San Francisco, California, United States, 94199 10 days ago
The Director, Global Marketing Programs will manage the global marketing programs team to drive growth by executing our outbound marketing strategy in...
More...
Communications Specialist
Intel - San Jose, California, United States, 95123 21 hours ago
Job Details: Job Description: In Q4 2023, Intel announced Altera will be reported as a separate business unit beginning on January 1, 2024, with ongoi...
More...
Toy Designer - Sanrio Plastic Toys
Jazwares - Culver City, California, United States, 90232 15 days ago
As the Product Designer in our fast growing Girls Product Design Team, you will successfully design and develop product for one of the hottest brands ...
More...
Senior Toy Designer - Squish-a-Longs
Jazwares - Culver City, California, United States, 90232 3 days ago
As the Senior Product Designer, you will successfully design and develop innovative products for one of the hottest brands in the market! You will hav...
More...
Thinker & Content Intern
Gorilla 76 - St Louis, Missouri, United States, 3 days ago
Gorilla 76 is looking for a part-time Content Intern to write content that drives results for industrial marketing programs. Writers at Gorilla 76 are...
More...
Senior Specialist, Audience Engagement (Content Marketing)
The WNET Group - New York 6 days ago
At The WNET Group, we are looking for a creative, strategic thinker who can use a range of content marketing and communications tactics to increase vi...
More...
Senior Marketing Manager IOT / Industrial Segment
Western Digital Capital - Milpitas, California, United States, 95035 13 days ago
Full-timeBusiness Function: Technical Product MarketingCompany Description At Western Digital, our vision is to power global innovation and push the b...
More...
Fire Protection - Service Designer
EMCOR Group - Harrisburg, Pennsylvania, us, 17124 2 days ago
DescriptionAbout Us:S.A. Comunale has been a local industry leader for end-to-end mechanical, fire protection and HVAC services for nearly 100 years.W...
More...

Go to next page

Research Scientist, GenAI - Multimodal Audio (Speech, Sound and M...

META - Bellevue, Washington, us, 98009

Work at META

Overview
Apply

Overview

The GenAI org at Meta builds industry leading LLM and multimodal generative foundation models, which sets the industry benchmark of open source foundation models and enables many Meta products. The team is working on the industrial leading research on multimodal generative foundation models with a focus on the audio modality (including speech, sound and music). The team is working closely with the language and the vision research teams, and is collaborating with product teams in bringing the results to benefit billions of Meta users around the world.

Research Scientist, GenAI - Multimodal Audio (Speech, Sound and Music) Responsibilities

Full life-cycle research on multimodal generative foundation models with a focus on the audio modality, including bringing up ideasDesigning and implementing models and algorithmsCollecting and selecting training data, training / tuning / scaling the models, evaluating the performance, open sourcing and publicationWork together with collaborating teams (e.g. language and vision) to leverage each other and deliver the high-level goals.Minimum Qualifications

Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.Solid track record of research in the audio (speech, sound, or music) or vision (image or video) domains. Can be publication records or unpublished industrial experience.PhD degree in the related field with 3+ years of experience, or BS degree with 5+ years of industrial research experience in the related field.Related research fields: audio (speech, sound, or music) generation, text-to-speech (TTS) synthesis, text-to-music generation, text-to-sound generation, speech recognition, speech / audio representation learning, vision perception, image / video generation, video-to-audio generation, audio-visual learning, audio language models, lip sync, lip movement generation / correction, lip reading, etc.Proven knowledge in neural networks.Experienced in one of the following popular ML frameworks: Pytorch, Tensorflow, JAX.Experienced in Python programming language.Solid communication skills.Preferred Qualifications

Solid publication track record in related fields.Solid experience in either of the following: audio dataset curation, model scaling, audio generation model evaluation.Experienced in large-scale data processing.Experienced in solving complex problems involving trade-offs, alternative solutions, cross functional collaboration, taking into account diverse points of views.

See details and apply

Research Scientist, GenAI - Multimodal Audio (Speech, Sound and M...

Get new jobs for this search by email

Research Scientist, GenAI - Multimodal Audio (Speech, Sound and M...

Director, Global Marketing Programs

Communications Specialist

Toy Designer - Sanrio Plastic Toys

Senior Toy Designer - Squish-a-Longs

Thinker & Content Intern

Senior Specialist, Audience Engagement (Content Marketing)

Senior Marketing Manager IOT / Industrial Segment

Fire Protection - Service Designer

Overview

See details and apply