Foundations of data science with Python /
Shea, John M. (Professor of electrical engineering),
Foundations of data science with Python / John M. Shea. - First edition. - Boca raton: CRC Press, 2024. - 488 pages : illustrations (some color) ; 27 cm. - Chapman & Hall/CRC data science series .
Includes bibliographical references and index.
First simulations, visualizations, and statistical tests -- First visualizations and statistical tests with real data -- Introduction to probability -- Null hypothesis tests -- Conditional probability, dependence, and independence -- Introduction to Bayesian methods -- Random variables -- Expected value, parameter estimation, and hypothesis tests on sample means -- Decision making with observations from continuous distributions -- Categorical data, tests for dependence, and goodness of fit for discrete distributions -- Multidimensional data : vector moments and linear regression -- Working with dependent data in multiple dimensions.
"Foundations of Data Science with Python introduces readers to the fundamentals of data science, including data manipulation and visualization, probability, statistics, and dimensionality reduction. This book is targeted toward engineers and scientists, but it should be readily understandable to anyone who knows basic calculus and the essentials of computer programming. It uses a computational-first approach to data science: the reader will learn how to use Python and the associated data-science libraries to visualize, transform, and model data, as well as how to conduct statistical tests using real data sets. Rather than relying on obscure formulas that only apply to very specific statistical tests, this book teaches readers how to perform statistical tests via resampling; this is a simple and general approach to conducting statistical tests using simulations that draw samples from the data being analyzed. The statistical techniques and tools are explained and demonstrated using a diverse collection of data sets to conduct statistical tests related to contemporary topics, from the effects of socioeconomic factors on the spread of the COVID-19 virus to the impact of state laws on firearms mortality. This book can be used as an undergraduate textbook for an Introduction to Data Science course or to provide a more contemporary approach in courses like Engineering Statistics. However, it is also intended to be accessible to practicing engineers and scientists who need to gain foundational knowledge of data science"--
9781032346748 9781032350424
2023037553
Statistics--Data processing,
Probabilities--Data processing.
Information visualization.
Python (Computer program language)
QA276.4 / .S454 2024
519.502855133 SHE/Fou
Foundations of data science with Python / John M. Shea. - First edition. - Boca raton: CRC Press, 2024. - 488 pages : illustrations (some color) ; 27 cm. - Chapman & Hall/CRC data science series .
Includes bibliographical references and index.
First simulations, visualizations, and statistical tests -- First visualizations and statistical tests with real data -- Introduction to probability -- Null hypothesis tests -- Conditional probability, dependence, and independence -- Introduction to Bayesian methods -- Random variables -- Expected value, parameter estimation, and hypothesis tests on sample means -- Decision making with observations from continuous distributions -- Categorical data, tests for dependence, and goodness of fit for discrete distributions -- Multidimensional data : vector moments and linear regression -- Working with dependent data in multiple dimensions.
"Foundations of Data Science with Python introduces readers to the fundamentals of data science, including data manipulation and visualization, probability, statistics, and dimensionality reduction. This book is targeted toward engineers and scientists, but it should be readily understandable to anyone who knows basic calculus and the essentials of computer programming. It uses a computational-first approach to data science: the reader will learn how to use Python and the associated data-science libraries to visualize, transform, and model data, as well as how to conduct statistical tests using real data sets. Rather than relying on obscure formulas that only apply to very specific statistical tests, this book teaches readers how to perform statistical tests via resampling; this is a simple and general approach to conducting statistical tests using simulations that draw samples from the data being analyzed. The statistical techniques and tools are explained and demonstrated using a diverse collection of data sets to conduct statistical tests related to contemporary topics, from the effects of socioeconomic factors on the spread of the COVID-19 virus to the impact of state laws on firearms mortality. This book can be used as an undergraduate textbook for an Introduction to Data Science course or to provide a more contemporary approach in courses like Engineering Statistics. However, it is also intended to be accessible to practicing engineers and scientists who need to gain foundational knowledge of data science"--
9781032346748 9781032350424
2023037553
Statistics--Data processing,
Probabilities--Data processing.
Information visualization.
Python (Computer program language)
QA276.4 / .S454 2024
519.502855133 SHE/Fou