AUTOMATING DATA QUALITY MONITORING : going deeper than data observability /

The world's businesses ingest a combined 2.5 quintillion bytes of data every day. But how much of this vast amount of data--used to build products, power AI systems, and drive business decisions--is poor quality or just plain bad? This practical book shows you how to ensure that the data your o...

Full description

Saved in:
Bibliographic Details
Main Authors: Stanley, Jeremy (Author), Schwartz, Paige (Author)
Format: Electronic eBook
Language:English
Published: Sebastopol, CA : O'Reilly Media, Inc., [2024]
Subjects:
Online Access:CONNECT

MARC

LEADER 00000cam a22000007a 4500
001 in00006442125
006 m o d
007 cr |n|||||||||
008 240114s2024 cau o 000 0 eng d
005 20240130192525.7
035 |a 1WRLDSHRon1417195764 
040 |a YDX  |b eng  |c YDX  |d OCLCO  |d ORMDA 
020 |a 9781098145903  |q (electronic bk.) 
020 |a 1098145909  |q (electronic bk.) 
020 |z 1098145933 
020 |z 9781098145934 
035 |a (OCoLC)1417195764 
037 |a 9781098145927  |b O'Reilly Media 
050 4 |a HF5548.2 
082 0 4 |a 658.05  |2 23/eng/20240116 
049 |a TXMM 
100 1 |a Stanley, Jeremy,  |e author. 
245 1 0 |a AUTOMATING DATA QUALITY MONITORING :  |b going deeper than data observability /  |c by Jeremy Stanley and Paige Schwartz. 
260 |a Sebastopol, CA :  |b O'Reilly Media, Inc.,  |c [2024] 
300 |a 1 online resource 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
520 |a The world's businesses ingest a combined 2.5 quintillion bytes of data every day. But how much of this vast amount of data--used to build products, power AI systems, and drive business decisions--is poor quality or just plain bad? This practical book shows you how to ensure that the data your organization relies on contains only high-quality records. Most data engineers, data analysts, and data scientists genuinely care about data quality, but they often don't have the time, resources, or understanding to create a data quality monitoring solution that succeeds at scale. In this book, Jeremy Stanley and Paige Schwartz from Anomalo explain how you can use automated data quality monitoring to cover all your tables efficiently, proactively alert on every category of issue, and resolve problems immediately. This book will help you: Learn why data quality is a business imperative Understand and assess unsupervised learning models for detecting data issues Implement notifications that reduce alert fatigue and let you triage and resolve issues quickly Integrate automated data quality monitoring with data catalogs, orchestration layers, and BI and ML systems Understand the limits of automated data quality monitoring and how to overcome them Learn how to deploy and manage your monitoring solution at scale Maintain automated data quality monitoring for the long term. 
500 |a O'Reilly Online Learning Platform: Academic Edition (SAML SSO Access)  |5 TMurS 
650 0 |a Business  |x Data processing. 
700 1 |a Schwartz, Paige,  |e author. 
730 0 |a WORLDSHARE SUB RECORDS 
776 0 8 |i Print version:  |z 1098145933  |z 9781098145934  |w (OCoLC)1382625824 
856 4 0 |u https://go.oreilly.com/middle-tennessee-state-university/library/view/-/9781098145927/?ar  |z CONNECT  |3 O'Reilly  |t 0 
949 |a ho0 
994 |a 92  |b TXM 
998 |a wi  |d z 
999 f f |s 0daa8eb6-b098-4d7d-aac3-4cec00a4fd44  |i 87a9f121-bb3f-40c4-8e80-a15873596bce  |t 0 
952 f f |a Middle Tennessee State University  |b Main  |c James E. Walker Library  |d Electronic Resources  |t 0  |e HF5548.2   |h Library of Congress classification