Logo
Meta

Research Scientist Intern, AI Research - Speech & Audio (PhD) Job at Meta in

Meta, Menlo Park, CA, United States


Research Scientist Intern, AI Research - Speech & Audio (PhD)

Apply to this job

Location pin icon

Menlo Park, CA •Seattle, WA •San Francisco, CA + 2 more
- Hide

Apply to this job

The GenAI Speech team at Meta is currently looking for Research Scientist interns. Our team creates spoken language technology to make it faster and easier for people to build community and connect with others around the world. We conduct product-motivated research in ML/AI and design, develop and deploy state-of-the-art algorithms to the rest of Meta. We work in all aspects of AI for speech and audio processing, including speech recognition, speech synthesis, speaker identification, keyword spotting, noise robustness, multi-lingual systems, and speech with large language models (LLM). Our work powers voice interactions on AR/VR devices, such as Ray-Ban | Meta smart glasses and Quest 3 mixed-reality headsets, and video content understanding, including captioning and understanding of videos on Facebook and Instagram. As a Research Scientist Intern, you will help us develop innovative models and algorithms and apply them to large-scale production speech tasks. Our teams at Meta AI offer twelve (12) to twenty-four (24) weeks long internships and we have various start dates throughout the year. Internships are available in the Bay Area, CA and Seattle, WA

Research Scientist Intern, AI Research - Speech & Audio (PhD) Responsibilities

  • Perform research to advance the science and technology of intelligent machines.
  • Develop novel and accurate speech algorithms and systems, leveraging deep learning and machine learning on big data resources.
  • Contribute research that can be applied to Meta product development.
  • Analyze and improve efficiency, scalability, and stability of various deployed systems.
  • Collaborate with team members from prototyping to production.


Minimum Qualifications

  • Currently has, or is in the process of obtaining a PhD degree in the field of Computer Science, Artificial Intelligence, Natural Language Processing, or related field..
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment.
  • Experience in C/C++ and Python.
  • Experience in deep learning frameworks (PyTorch, Tensorflow, etc).
  • Research and/or work experience in machine learning, deep learning, and/or speech technology.


Preferred Qualifications

  • Experience manipulating and analyzing complex, high-volume, high-dimensionality data from varying sources.
  • Proven track record of achieving results as demonstrated by grants, fellowships, patents, as well as first-authored publications at workshops or conferences such as Interspeech, ICASSP or similar.
  • A strong interest in theoretical and empirical research and for answering hard questions with research.
  • Interpersonal experience: cross-group and cross-culture collaboration.
  • Ability to stay in touch with the literature of a particular domain and has the ability to reproduce results if needed.
  • Experienced with training deep neural networks for key Speech tasks such as speech recognition, speech translation, speech synthesis, speaker diarization, sentiment analysis, acoustic event recognition, wake word, scene understanding, etc.
  • Intent to return to a degree-program after the completion of the internship/co-op.


For those who live in or expect to work from California if hired for this position, please click here for additional information.

Locations

2

Use Ctrl and scroll to zoom the map

Zoom in

Zoom out

Re-centre

Data Center

About Meta

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.

$7,800/month to $11,293/month + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.

Equal Employment Opportunity and Affirmative Action

Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here .

Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com .