作者:Tony Ojeda, Sean Patrick Murphy, Benjamin Bengfort, Abhijit Dasgupta
出版社:Packt Publishing
页数:448
出版时间:2014
语言:English
格式:pdf
内容简介:
89 hands-on recipes to help you complete real-world data science projects in R and Python
About This Book
- Learn about the data science pipeline and use it to acquire, clean, analyze, and visualize data
- Understand critical concepts in data science in the context of multiple projects
- Expand your numerical programming skills through step-by-step code examples and learn more about the robust features of R and Python
In Detail As increasing amounts of data is generated each year, the need to analyze and operationalize it is more important than ever. Companies that know what to do with their data will have a competitive advantage over companies that don't, and this will drive a higher demand for knowledgeable and competent data professionals.
Starting with the basics, this book will cover how to set up your numerical programming environment, introduce you to the data science pipeline (an iterative process by which data science projects are completed), and guide you through several data projects in a step-by-step format. By sequentially working through the steps in each chapter, you will quickly familiarize yourself with the process and learn how to apply it to a variety of situations with examples in the two most popular programming languages for data analysis—R and Python.