Statistics for Data Science and Analytics

Statistics for Data Science and Analytics

by: Peter C. Bruce (Author),Peter Gedeck(Author),Janet Dobbins(Author)&0more

Publisher: Wiley

Edition: 1st

Publication Date: 2024/9/4

Language: English

Print Length: 384 pages

ISBN-10: 139425380X

ISBN-13: 9781394253807

Book Description

Introductory statistics textbook with a focus on data science topics such as prediction, correlation, and data explorationStatistics for Data Science and Analytics is a comprehensive guide to statistical analysis using Python, presenting important topics useful for data science such as prediction, correlation, and data exploration. The authors provide an introduction to statistical science and big data, as well as an overview of Python data structures and operations. A range of statistical techniques are presented with their implementation in Python, including hypothesis testing, probability, exploratory data analysis, categorical variables, surveys and sampling, A/B testing, and correlation. The text introduces binary classification, a foundational element of machine leaing, validation of statistical models by applying them to holdout data, and probability and inference via the easy-to-understand method of resampling and the bootstrap instead of using a myriad of “kitchen sink” formulas. Regression is taught both as a tool for explanation and for prediction. This book is informed by the authors’ experience designing and teaching both introductory statistics and machine leaing at Statistics.com. Each chapter includes practical examples, explanations of the underlying concepts, and Python code snippets to help readers apply the techniques themselves. Statistics for Data Science and Analytics includes information on sample topics such as:Int, float, and string data types, numerical operations, manipulating strings, converting data types, and advanced data structures like lists, dictionaries, and setsExperiment design via randomizing, blinding, and before-after pairing, as well as proportions and percents when handling binary dataSpecialized Python packages like numpy, scipy, pandas, scikit-lea and statsmodels―the workhorses of data science―and how to get the most value from themStatistical versus practical significance, random number generators, functions for code reuse, and binomial and normal probability distributionsWritten by and for data science instructors, Statistics for Data Science and Analytics is an excellent leaing resource for data science instructors prescribing a required intro stats course for their programs, as well as other students and professionals seeking to transition to the data science field.

About the Author

Introductory statistics textbook with a focus on data science topics such as prediction, correlation, and data explorationStatistics for Data Science and Analytics is a comprehensive guide to statistical analysis using Python, presenting important topics useful for data science such as prediction, correlation, and data exploration. The authors provide an introduction to statistical science and big data, as well as an overview of Python data structures and operations. A range of statistical techniques are presented with their implementation in Python, including hypothesis testing, probability, exploratory data analysis, categorical variables, surveys and sampling, A/B testing, and correlation. The text introduces binary classification, a foundational element of machine leaing, validation of statistical models by applying them to holdout data, and probability and inference via the easy-to-understand method of resampling and the bootstrap instead of using a myriad of “kitchen sink” formulas. Regression is taught both as a tool for explanation and for prediction. This book is informed by the authors’ experience designing and teaching both introductory statistics and machine leaing at Statistics.com. Each chapter includes practical examples, explanations of the underlying concepts, and Python code snippets to help readers apply the techniques themselves. Statistics for Data Science and Analytics includes information on sample topics such as:Int, float, and string data types, numerical operations, manipulating strings, converting data types, and advanced data structures like lists, dictionaries, and setsExperiment design via randomizing, blinding, and before-after pairing, as well as proportions and percents when handling binary dataSpecialized Python packages like numpy, scipy, pandas, scikit-lea and statsmodels―the workhorses of data science―and how to get the most value from themStatistical versus practical significance, random number generators, functions for code reuse, and binomial and normal probability distributionsWritten by and for data science instructors, Statistics for Data Science and Analytics is an excellent leaing resource for data science instructors prescribing a required intro stats course for their programs, as well as other students and professionals seeking to transition to the data science field.

代发服务PDF电子书10立即求助
1111
打赏
未经允许不得转载:Wow! eBook » Statistics for Data Science and Analytics

觉得文章有用就打赏一下文章作者

支付宝扫一扫

微信扫一扫