Karkidi
Machine Learning Engineer Intern, Computer Vision Algorithm (Video Understanding
Karkidi, Cupertino, CA, United States
Content Summary: Machine Learning Engineer Intern, Computer Vision Algorithm (Video Understanding) at Cupertino, for Karkidi
The computer vision algorithm intern will work in a dynamic team as part of the Video Computer Vision org, which develops on-device computer vision and machine perception technologies across Apple’s products. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers' hands.
Minimum Qualifications
During the internship, you must be enrolled in a M.S. or PhD program in Electrical Engineering/Computer Science or a related field (mathematics, physics, or computer engineering), with a focus on computer vision and/or machine learning.
Rich experiences in video machine learning covering one of the topics: Video Understanding / Video Foundation Model / Multi-modal LLM.
Proven prototyping skills and proficient in coding (C, C++, Python).
Excellent written and verbal communication skills, be comfortable presenting research to large audiences, and have the ability to work hands-on in multi-functional teams.
Preferred Qualifications
Publication record in relevant venues (e.g., NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, SIGGRAPH).
Industry experiences with multi-modal foundation models and frameworks.
Knowledge and understanding of generative AI, multi-modal large language models, and video captioning.
Solid understanding of the state-of-the-art in Video Understanding and familiarity with the challenges of developing algorithms that run efficiently on resource-constrained platforms.
Team-oriented, result-oriented, and self-motivated.
#J-18808-Ljbffr