Monday, March 4, 2013

The world's coolest machine learning internships - part 2

Continuing the great success of my last year blog post regarding machine learning internships last summer (more than 15,000 page views, posting in hacker news and reddit.com) I will start again collecting some of the interesting opportunities I hear about for this summer.

This is what I got from Mauricio Breternitz from AMD: AMD Research has exciting opportunities for interns who will be conducting research in one of key areas of Systems Research (Processor, Software stack, architecture) supporting the development of new systems and architectures.
Required skills and interests: - Programming in Java or C++, a scripting language such as Perl or other, Linux, Cloud computing - Hadoop, system-level performance analysis Microarchitecture and System-Level performance analysis: CPU utilization, disk utilization, IPC, cache behavior
Desired skills: Data mining and machine learning (ML) concepts; Text analytics (unstructured data), parsing XML or HTML and extracting Big data - Experience with Hadoop - RESTful APIs and related coding experience. Client device protocols and virtualization experience would be highly desirable. Multiple internship projects are available. Interested candidates please contact Mauricio Breternitz
 ( Mauricio.breternitz@amd.com )

Would you like to use data to solve one of the world's most important problems? Udemy is on a mission to democratize education. For us, that means two things: 1) Enabling the top experts in the world to teach any student, anywhere, and 2) Radically lowering the price point on a top quality education. With over 6,000 courses published and 600,000 students taking courses on Udemy, we're on our way, but we need your help! We are a technology company in our core, so we track every single bit of learning data. We are looking for amachine learning contractor / intern to use this data to work on problems like: Which courses are the best fit for a particular student? What make a student complete a course vs drop out? Who is most likely to enroll in a 2nd course? If you're interested, please shoot us an email at jobs@udemy.com with subject "Machine Learning Contractor". We'd love to hear from you.

I got the following open positions from Srinivassan Soundar: Health Informatics

They also have a few fulltime positions too.

Want to work on the largest scale music recommender and playlisting engine in the US? Pandora's playlist team is building the next innovations in recommendation algorithms that help hundreds of millions of listeners discover music they love. We're looking for 1-2 interns (preferably 2nd or 3rd year Ph.D. Students) who has a research interest in ML and are passionate about music. We are also open to a longer term research collaboration with universities. Potential topics include personalization, scalable algorithms, real time and effective recommendation measurement, etc. This position is already filled.


My collaborator Ted Willke sent me the following: We’d like to perform a comparative study of various graph-based programming models for machine learning algorithms this summer, looking at GraphLab, D4M, Galois, and possibly others. We’d love to have a smart and passionate graduate student join us for 3 months (minimum). Contact Ted: theodore.l.willke@intel.com

My friend Udi Weinsberg from Technicolor raised my attention that Technicolor are also looking for interns. Technicolor Palo Alto research lab studies personalized computing, data privacy and recommendation systems. You can apply here.

I got the following from Jan Neumann from Comcast, who is looking for Industry-leading research in audio/video information retrieval and content discovery technologies to help millions of households discover video and music content on their TV, PC, Phone, and Mobile devices. Comcast's Washington DC research lab is looking to fill 2-3 graduate student intern positions for this summer (minimum of 12 weeks, May through September) . Projects can focus on using Social Networks for Recommendations and Click-through Prediction, NLP for Voice-based Interfaces, and/or Video Search/Segmentation of premium video. Read more.


This is what I got from Grant Ingersoll, a well known Mahout contributor: LucidWorks, the leading commercial company for Apache Lucene and Solr, is looking for interns to work on building next generation search, analytics and machine learning technologies based on Apache Solr, Mahout, Hadoop and other cutting edge capabilities. This internship will be practically focused on working on real problems in search and machine learning as they relate to Lucid products and technologies as well as open source. Interested students (see eligibility below) should send their resume/profile, course work and evidence of open source activity (github account, ASF patches or other, etc.) to careers@lucidworks.com.

Walt Disney Animation Studios have a summer internship for improving the quality of animation data. Apply here.

Please note this position was already filled. I will post more positions as they come through..
I'm planning on hosting an intern for Summer 2013. The project will be related to online learning and large data, motivated by click-through-rate prediction for ads targeting. I'm also interested in the interaction between machine learning and auction mechanisms. The plan is to do something publishable with the goal of getting a paper out. The details of the project are flexible based on the skills and interests of the intern. Ideally I'm looking for someone who already has a strong background in online learning, optimization, or auction theory. Contact Brandan MacMahan.


The Systems department at the IBM T.J. Watson Research Center (Yorktown Heights, NY) has an opening for a Summer Research Intern.  The candidate will be conducting research in one of key areas of Systems Research that will lead the development of new systems.  The candidate will have theopportunity to work on real systems while pursuing innovative research of
both industrial and academic interest.
Required skills: - Programming in Java or C++, a scripting language such as Perl or other,
SQL - Performance tuning under Linux and at least one other Unix environment
Desired skills: - DBMS query performance tuning, implementing UDFs (user defined
functions) - Data mining and machine learning (ML) concepts; hands on experience with
R, SPSS, SAS - Fraud detection (models, algorithms, architectures for real-time fraud
detection) - Text analytics (unstructured data), parsing XML or HTML and extracting
data - Experience with Hadoop - RESTful APIs and related coding experience. Contact: yefim@us.ibm.com

No comments:

Post a Comment