Big Data and Social Science (2nd Ed.)
Data Science Methods and Tools for Research and Practice

Chapman & Hall/CRC Statistics in the Social and Behavioral Sciences Series

Coordinators: Foster Ian, Ghani Rayid, Jarmin Ron S., Kreuter Frauke, Lane Julia

Language: English

68.67 €

In Print (Delivery period: 14 days).

Add to cartAdd to cart
Big Data and Social Science
Publication date:
· 19.1x23.5 cm · Paperback

166.30 €

In Print (Delivery period: 14 days).

Add to cartAdd to cart
Big Data and Social Science
Publication date:
· 19.1x23.5 cm · Hardback

Big Data and Social Science: Data Science Methods and Tools for Research and Practice, Second Editionshows how to apply data science to real-world problems, covering all stages of a data-intensive social science or policy project. Prominent leaders in the social sciences, statistics, and computer science as well as the field of data science provide a unique perspective on how to apply modern social science research principles and current analytical and computational tools. The text teaches you how to identify and collect appropriate data, apply data science methods and tools to the data, and recognize and respond to data errors, biases, and limitations.

Features:

  • Takes an accessible, hands-on approach to handling new types of data in the social sciences
  • Presents the key data science tools in a non-intimidating way to both social and data scientists while keeping the focus on research questions and purposes
  • Illustrates social science and data science principles through real-world problems
  • Links computer science concepts to practical social science research
  • Promotes good scientific practice
  • Provides freely available workbooks with data, code, and practical programming exercises, through Binder and GitHub

New to the Second Edition:

  • Increased use of examples from different areas of social sciences
  • New chapter on dealing with Bias and Fairness in Machine Learning models
  • Expanded chapters focusing on Machine Learning and Text Analysis
  • Revamped hands-on Jupyter notebooks to reinforce concepts covered in each chapter

This classroom-tested book fills a major gap in graduate- and professional-level data science and social science education. It can be used to train a new generation of social data scientists to tackle real-world problems and improve the skills and competencies of applied social scientists and public policy practitioners. It empowers you to use the massive and rapidly growing amounts of available data to interpret economic and social activities in a scientific and rigorous manner.

Introduction. Working with Web Data and APIs. Record Linkage. Databases. Scaling up through Parallel and Distributed Computing. Information Visualization. Machine Learning. Text Analysis. Networks: The Basics. Data Quality and Inference Errors. Bias and Fairness. Privacy and Confidentiality. Workbooks

Ian Foster, PhD, is a professor of computer science at the University of Chicago as well as a senior scientist and distinguished fellow at Argonne National Laboratory. His research addresses innovative applications of distributed, parallel, and data-intensive computing technologies to scientific problems in such domains as climate change and biomedicine. Methods and software developed under his leadership underpin many large national and international cyberinfrastructures. He is a fellow of the American Association for the Advancement of Science, the Association for Computing Machinery, and the British Computer Society. He earned a PhD in computer science from Imperial College London.

Rayid Ghani is a professor in the Machine Learning Department (in the School of Computer Science) and the Heinz College of Information Systems and Public Policy at Carnegie Mellon University. His research focuses on developing and using Machine Learning, AI, and Data Science methods for solving high impact social good and public policy problems in a fair and equitable way across criminal justice, education, healthcare, energy, transportation, economic development, workforce development and public safety. He is also the founder and director of the “Data Science for Social Good” summer program for aspiring data scientists to work on data mining, machine learning, big data, and data science projects with social impact. Previously Rayid Ghani was a faculty member at University of Chicago, and prior to that, served as the Chief Scientist for Obama for America (Obama 2012 Campaign).

Ron Jarmin, PhD, is the Deputy Director at the U.S. Census Bureau. He earned a PhD in economics from the University of Oregon and has published in the areas of industrial organization, business dynamics, entrepreneurship, technology and firm performance, urban economics, Big Data, data access and statistical disclosure avoidance. He oversees the