At the request of many prospective participants, here's an update about our DSA (Data Science Apprenticeship). Also, I have added a few large data sets, new projects and more material. [size=1em]Click here for details. If you have already earned a data science certificate or diploma, but was not requested to develop and use your own API in batch mode, and harvest/work on a data set with at least 50 million observations in a distributed environment, then it's time to learn the real stuff that will land you a real job! Klein Bottle [size=1em]Click here for a general overview of our apprenticeship. We have published the data and source code for our big data keyword correlation API. [size=1em]Read the material and download the three files (and post your comments if you have questions, I'll reply ASAP): it will teach you how API's work, and how to write your first API from scratch! Our next API example will come with the source code of a web crawler, and will illustrate how to detect copyright infringement or how to detect the original, first version of an article published in multiple news outlets (doing a better job than Google). All the training material will be offered for free to everyone. We have not yet put everything into a nice booklet, but some of the content is already available: A few data sets available for download, from the following articles: The following articles will be included in our curriculum, so you can start reading them now List of potential projects for students: Starred items (*) are recent additions.本帖隐藏的内容