This textbook considers statistical learning applications when interest centers on the conditional distribution of the response variable, given a set of predictors, and when it is important to characterize how the predictors are related to the response. This fully revised new edition includes important developments over the past 8 years. Consistent with modern data analytics, it emphasizes that a proper statistical learning data analysis derives from sound data collection, intelligent data management, appropriate statistical procedures, and an accessible interpretation of results. As in the first edition, a unifying theme is supervised learning that can be treated as a form of regression analysis. Key concepts and procedures are illustrated with real applications, especially those with practical implications. The material is written for upper undergraduate level and graduate students in the social and life sciences and for researchers who want to apply statistical learning procedures to scientific and policy problems. The author uses this book in a course on modern regression for the social, behavioral, and biological sciences. All of the analyses included are done in R with code routinely provided.
Les mer
Consistent with modern data analytics, it emphasizes that a proper statistical learning data analysis derives from sound data collection, intelligent data management, appropriate statistical procedures, and an accessible interpretation of results.
Les mer
Statistical Learning as a Regression Problem.- Splines, Smoothers, and Kernels.- Classification and Regression Trees (CART).- Bagging.- Random Forests.- Boosting.- Support Vector Machines.- Some Other Procedures Briefly.- Broader Implications and a Bit of Craft Lore.
Les mer
This textbook considers statistical learning applications when interest centers on the conditional distribution of the response variable, given a set of predictors, and when it is important to characterize how the predictors are related to the response. As a first approximation, this can be seen as an extension of nonparametric regression.This fully revised new edition includes important developments over the past 8 years. Consistent with modern data analytics, it emphasizes that a proper statistical learning data analysis derives from sound data collection, intelligent data management, appropriate statistical procedures, and an accessible interpretation of results. A continued emphasis on the implications for practice runs through the text. Among the statistical learning procedures examined are bagging, random forests, boosting, support vector machines and neural networks. Response variables may be quantitative or categorical. As in the first edition,a unifying theme is supervised learning that can be treated as a form of regression analysis.Key concepts and procedures are illustrated with real applications, especially those with practical implications. A principal instance is the need to explicitly take into account asymmetric costs in the fitting process. For example, in some situations false positives may be far less costly than false negatives.  Also provided is helpful craft lore such as not automatically ceding data analysis decisions to a fitting algorithm. In many settings, subject-matter knowledge should trump formal fitting criteria. Yet another important message is to appreciate the limitation of one’s data and not apply statistical learning procedures that require more than the data can provide.The material is written for upper undergraduate level and graduate students in the social and life sciences and for researchers who want to apply statistical learning procedures to scientific and policy problems. The author uses this book in a course on modern regression for the social, behavioral, and biological sciences. Intuitive explanations and visual representations are prominent. All of the analyses included are done in R with code routinely provided.
Les mer
“This book is an outstanding example of synthesizing theoretical knowledge with applications, mathematical notations with R code, and statistics with machine learning. It has relevant exercise sets and will be an excellent textbook for a broad range of quantitatively oriented students, specifically, for those specializing in data science or taking a course on statistical learning.” (Vyacheslav Lyubchich, Technometrics, Vol. 59 (4), November, 2017)“The book focuses on supervised learning techniques that can be viewed as a form of regression … . There are instructive problems at the end ... and examples with code in R to illustrate throughout. … This is a thought provoking book worthy of serious attention by machine learning practitioners.” (Peter Rabinovitch, MAA Reviews, July, 2017)“The intended audience includes advanced undergraduate and graduate students biostatistics in the fields of social science and life science, as well as researchers who want to apply statistical learning procedures to scientific and policy problems. … This is an excellent overview of statistical learning applications. It is strongly recommended to advanced researchers and statisticians particularly interested in the social and behavioral aspects of data analysis.” (Puja Sitwala, Doody's Book Reviews, January, 2017)
Les mer
Accessible discussion of statistical learning procedures for practitioners with real-world applications in the social and policy sciences Methods also of interest in the natural sciences and engineering Fully revised new edition with intuitive explanations and visual representation of underlying statistical concepts Includes supplementary material: sn.pub/extras
Les mer

Produktdetaljer

ISBN
9783319829692
Publisert
2018-06-16
Utgave
2. utgave
Utgiver
Vendor
Springer International Publishing AG
Høyde
235 mm
Bredde
155 mm
Aldersnivå
Upper undergraduate, P, 06
Språk
Product language
Engelsk
Format
Product format
Heftet

Forfatter

Biographical note

Richard Berk is Distinguished Professor of Statistics Emeritus from the Department of Statistics at UCLA and currently a Professor at the University of Pennsylvania in the Department of Statistics and in the Department of Criminology. He is an elected fellow of the American Statistical Association and the American Association for the Advancement of Science and has served in a professional capacity with a number of organizations such as the Committee on Applied and Theoretical Statistics for the National Research Council and the Board of Directors of the Social Science Research Council. His research has ranged across a variety of applications in the social and natural sciences.