Kapat
0 Ürün
Alışveriş sepetinizde boş.
Kategoriler
    Filtreler
    Preferences
    Ara

    R for Data Science: Import, Tidy, Transform, Visualize, and Model Data 2e

    Yayınevi : O'Reilly Media
    ISBN :9781492097402
    Sayfa Sayısı :576
    Baskı Sayısı :2
    Ebatlar :17.78 x 3.18 x 23.5 cm
    Basım Yılı :2023
    3200,00 ₺

    Bu ürün için iade seçeneği bulunmamaktadır.

    Tahmini Kargoya Veriliş Zamanı: Stoktan Teslim

    Use R to turn data into insight, knowledge, and understanding. With this practical book, aspiring data scientists will learn how to do data science with R and RStudio, along with the tidyverseâ??a collection of R packages designed to work together to make data science fast, fluent, and fun. Even if you have no programming experience, this updated edition will have you doing data science quickly.

    You'll learn how to import, transform, and visualize your data and communicate the results. And you'll get a complete, big-picture understanding of the data science cycle and the basic tools you need to manage the details. Updated for the latest tidyverse features and best practices, new chapters show you how to get data from spreadsheets, databases, and websites. Exercises help you practice what you've learned along the way.

    You'll understand how to:

    • Visualize: Create plots for data exploration and communication of results
    • Transform: Discover variable types and the tools to work with them
    • Import: Get data into R and in a form convenient for analysis
    • Program: Learn R tools for solving data problems with greater clarity and ease
    • Communicate: Integrate prose, code, and results with Quarto

     

    From the Publisher

    R for Data Science: Import, Tidy, Transform, Visualize, and Model Data

    From the Preface

    Introduction

    Data science is an exciting discipline that allows you to transform raw data into understanding, insight, and knowledge. The goals of R for Data Science are to help you learn the most important tools in R that will allow you to do data science efficiently and reproducibly and to have some fun along the way! After reading this book, you’ll have the tools to tackle a wide variety of data science challenges using the best parts of R.

    From the Preface

    Welcome to the second edition of R for Data Science (R4DS)! This is a major reworking of the first edition, removing material we no longer think is useful, adding material we wish we included in the first edition, and generally updating the text and code to reflect changes in best practices. We’re also very excited to welcome a new co-author: Mine Çetinkaya-Rundel, a noted data science educator and one of our colleagues at Posit (the company formerly known as RStudio).

     
     

    A brief summary of the biggest changes follows:

     

    • The first part of the book has been renamed to “Whole Game.” The goal of this section is to give you the rough details of the “whole game” of data science before we dive into the details.
    • The second part of the book is “Visualize.” This part gives data visualization tools and best practices a more thorough coverage compared to the first edition. The best place to get all the details is still the ggplot2 book, but now R4DS covers more of the most important techniques.
    • The third part of the book is now called “Transform” and gains new chapters on numbers, logical vectors, and missing values. These were previously parts of the data transformation chapter but needed much more room to cover all the details.
    • The fourth part of the book is called “Import.” It’s a new set of chapters that goes beyond reading flat text files to working with spreadsheets, getting data out of databases, working with big data, rectangling hierarchical data, and scraping data from websites.
    • The “Program” part remains but has been rewritten from top to bottom to focus on the most important parts of function writing and iteration. Function writing now includes details on how to wrap tidyverse functions (dealing with the challenges of tidy evaluation), since this has become much easier and more important over the last few years. We’ve added a new chapter on important base R functions that you’re likely to see in wild-caught R code.
    • The “Modeling” part has been removed. We never had enough room to fully do modeling justice, and there are now much better resources available. We generally recommend using the tidymodels packages and reading Tidy Modeling with R by Max Kuhn and Julia Silge (O’Reilly).
    • The “Communicate” part remains but has been thoroughly updated to feature Quarto instead of R Markdown. This edition of the book has been written in Quarto, and it’s clearly the tool of the future.

     

    What You Will Learn

    Data science is a vast field, and there’s no way you can master it all by reading a single book. This book aims to give you a solid foundation in the most important tools and enough knowledge to find the resources to learn more when necessary. Our model of the steps of a typical data science project looks something like the following:

     

    Editorial Reviews

    About the Author

    Hadley Wickham is Chief Scientist at RStudio and a member of the R Foundation. He builds tools (both computational and cognitive) that make data science easier, faster, and more fun. His work includes packages for data science (ggplot2, dplyr, tidyr), data ingest (readr, readxl, haven), and principled software development (roxygen2, testthat, devtools). He is also a writer, educator, and frequent speaker promoting the use of R for data science. Learn more on his homepage, http://hadley.nz.

    Mine Çetinkaya-Rundel is Professor of the Practice and the Director of Undergraduate Studies at the Department of Statistical Science and an affiliated faculty in the Computational Media, Arts, and Cultures program at Duke University as well as Educator at RStudio. Mine works on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education. At RStudio, Mine's work focuses primarily on education for open-source R packages as well as building resources and tools for educators teaching statistics and data science with R and RStudio. Mine has authored four undergraduate statistics textbooks as part of the OpenIntro projects, teaches the popular MOOC Statistics with R on Coursera and is the developer and maintainer of Data Science in a Box. Mine is a Fellow of the ASA and an Elected Member of the ISI as well as the recipient of the 2021 Robert V. Hogg Award For Excellence in Teaching Introductory Statistics, the 2018 Harvard Pickard Award, and the 2016 ASA Waller Education Award.

    Garrett Grolemund is the author of Hands-On Programming with R and co-author of R for Data Science and R Markdown: The Definitive Guide. He is Director of Learning at RStudio and holds a Ph.D. in Statistics, but specializes in teaching. He’s taught people how to use R at over 50 government agencies, small businesses, and multi-billion dollar global companies; and he’s designed RStudio’s training materials for R, Shiny, R Markdown and more. Garrett wrote the popular lubridate package for dates and times in R and creates the RStudio cheat sheets.
    Ürünler özellikleri
    Cilt Tipi Ciltsiz
    Kendi yorumunuzu yazın
    • Sadece kayıtlı kullanıcılar yorum yazabilir.
    • Kötü
    • Mükemmel

    Use R to turn data into insight, knowledge, and understanding. With this practical book, aspiring data scientists will learn how to do data science with R and RStudio, along with the tidyverseâ??a collection of R packages designed to work together to make data science fast, fluent, and fun. Even if you have no programming experience, this updated edition will have you doing data science quickly.

    You'll learn how to import, transform, and visualize your data and communicate the results. And you'll get a complete, big-picture understanding of the data science cycle and the basic tools you need to manage the details. Updated for the latest tidyverse features and best practices, new chapters show you how to get data from spreadsheets, databases, and websites. Exercises help you practice what you've learned along the way.

    You'll understand how to:

    • Visualize: Create plots for data exploration and communication of results
    • Transform: Discover variable types and the tools to work with them
    • Import: Get data into R and in a form convenient for analysis
    • Program: Learn R tools for solving data problems with greater clarity and ease
    • Communicate: Integrate prose, code, and results with Quarto

     

    From the Publisher

    R for Data Science: Import, Tidy, Transform, Visualize, and Model Data

    From the Preface

    Introduction

    Data science is an exciting discipline that allows you to transform raw data into understanding, insight, and knowledge. The goals of R for Data Science are to help you learn the most important tools in R that will allow you to do data science efficiently and reproducibly and to have some fun along the way! After reading this book, you’ll have the tools to tackle a wide variety of data science challenges using the best parts of R.

    From the Preface

    Welcome to the second edition of R for Data Science (R4DS)! This is a major reworking of the first edition, removing material we no longer think is useful, adding material we wish we included in the first edition, and generally updating the text and code to reflect changes in best practices. We’re also very excited to welcome a new co-author: Mine Çetinkaya-Rundel, a noted data science educator and one of our colleagues at Posit (the company formerly known as RStudio).

     
     

    A brief summary of the biggest changes follows:

     

    • The first part of the book has been renamed to “Whole Game.” The goal of this section is to give you the rough details of the “whole game” of data science before we dive into the details.
    • The second part of the book is “Visualize.” This part gives data visualization tools and best practices a more thorough coverage compared to the first edition. The best place to get all the details is still the ggplot2 book, but now R4DS covers more of the most important techniques.
    • The third part of the book is now called “Transform” and gains new chapters on numbers, logical vectors, and missing values. These were previously parts of the data transformation chapter but needed much more room to cover all the details.
    • The fourth part of the book is called “Import.” It’s a new set of chapters that goes beyond reading flat text files to working with spreadsheets, getting data out of databases, working with big data, rectangling hierarchical data, and scraping data from websites.
    • The “Program” part remains but has been rewritten from top to bottom to focus on the most important parts of function writing and iteration. Function writing now includes details on how to wrap tidyverse functions (dealing with the challenges of tidy evaluation), since this has become much easier and more important over the last few years. We’ve added a new chapter on important base R functions that you’re likely to see in wild-caught R code.
    • The “Modeling” part has been removed. We never had enough room to fully do modeling justice, and there are now much better resources available. We generally recommend using the tidymodels packages and reading Tidy Modeling with R by Max Kuhn and Julia Silge (O’Reilly).
    • The “Communicate” part remains but has been thoroughly updated to feature Quarto instead of R Markdown. This edition of the book has been written in Quarto, and it’s clearly the tool of the future.

     

    What You Will Learn

    Data science is a vast field, and there’s no way you can master it all by reading a single book. This book aims to give you a solid foundation in the most important tools and enough knowledge to find the resources to learn more when necessary. Our model of the steps of a typical data science project looks something like the following:

     

    Editorial Reviews

    About the Author

    Hadley Wickham is Chief Scientist at RStudio and a member of the R Foundation. He builds tools (both computational and cognitive) that make data science easier, faster, and more fun. His work includes packages for data science (ggplot2, dplyr, tidyr), data ingest (readr, readxl, haven), and principled software development (roxygen2, testthat, devtools). He is also a writer, educator, and frequent speaker promoting the use of R for data science. Learn more on his homepage, http://hadley.nz.

    Mine Çetinkaya-Rundel is Professor of the Practice and the Director of Undergraduate Studies at the Department of Statistical Science and an affiliated faculty in the Computational Media, Arts, and Cultures program at Duke University as well as Educator at RStudio. Mine works on innovation in statistics and data science pedagogy, with an emphasis on computing, reproducible research, student-centered learning, and open-source education. At RStudio, Mine's work focuses primarily on education for open-source R packages as well as building resources and tools for educators teaching statistics and data science with R and RStudio. Mine has authored four undergraduate statistics textbooks as part of the OpenIntro projects, teaches the popular MOOC Statistics with R on Coursera and is the developer and maintainer of Data Science in a Box. Mine is a Fellow of the ASA and an Elected Member of the ISI as well as the recipient of the 2021 Robert V. Hogg Award For Excellence in Teaching Introductory Statistics, the 2018 Harvard Pickard Award, and the 2016 ASA Waller Education Award.

    Garrett Grolemund is the author of Hands-On Programming with R and co-author of R for Data Science and R Markdown: The Definitive Guide. He is Director of Learning at RStudio and holds a Ph.D. in Statistics, but specializes in teaching. He’s taught people how to use R at over 50 government agencies, small businesses, and multi-billion dollar global companies; and he’s designed RStudio’s training materials for R, Shiny, R Markdown and more. Garrett wrote the popular lubridate package for dates and times in R and creates the RStudio cheat sheets.
    Ürünler özellikleri
    Cilt Tipi Ciltsiz
    WhatsApp