Intelligent data analysis : from data gathering to data comprehension /

"The new tool for analyses is?Intelligent Data Analysis (IDA)?. IDA can be defined as the use of specialized statistical, pattern recognition, machine learning, data abstraction, and visualization tools for analysis of data and discovery of mechanisms that created the data. Such data are typica...

Full description

Saved in:
Bibliographic Details
Other Authors: Gupta, Deepak, active 2015-2016 (Editor)
Format: Electronic eBook
Language:English
Published: Hoboken, NJ : John Wiley & Sons, Inc., 2020.
Series:The Wiley series in intelligent signal and data processing
Subjects:
Online Access:CONNECT

MARC

LEADER 00000cam a2200000 i 4500
001 in00006082435
006 m o d
007 cr |||||||||||
008 200210t20202020nju ob 001 0 eng
005 20220712182603.1
010 |a  2019056736 
035 |a 1WRLDSHRon1149370344 
040 |a DLC  |b eng  |e rda  |e pn  |c DLC  |d OCLCO  |d YDX  |d EBLCP  |d OCLCQ  |d UKAHL  |d OCLCF  |d DG1  |d YDX  |d STF  |d N$T  |d S2H  |d OCLCQ  |d OCLCA  |d DLC  |d OCLCO 
020 |a 9781119544487  |q (electronic book) 
020 |a 1119544483  |q (electronic book) 
020 |a 9781119544463  |q (electronic book) 
020 |a 1119544467  |q (electronic book) 
020 |a 1119544440  |q (electronic book) 
020 |a 9781119544449  |q (electronic bk.) 
020 |z 9781119544456  |q (hardcover) 
035 |a (OCoLC)1149370344 
042 |a pcc 
050 0 4 |a QA76.9.D343  |b I57435 2020 
082 0 0 |a 006.3/12  |2 23 
049 |a TXMM 
245 0 0 |a Intelligent data analysis :  |b from data gathering to data comprehension /  |c edited by Deepak Gupta, Siddhartha Bhattacharyya, Ashish Khanna, Kalpna Sagar. 
264 1 |a Hoboken, NJ :  |b John Wiley & Sons, Inc.,  |c 2020. 
264 4 |c ©2020 
300 |a 1 online resource 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
490 0 |a The Wiley series in intelligent signal and data processing 
504 |a Includes bibliographical references and index. 
520 |a "The new tool for analyses is?Intelligent Data Analysis (IDA)?. IDA can be defined as the use of specialized statistical, pattern recognition, machine learning, data abstraction, and visualization tools for analysis of data and discovery of mechanisms that created the data. Such data are typically complex, meaning that they are characterized by many records, many variables, subtle interactions between variables, or a combination of all three. Engineering, computing sciences, database science, machine learning, and even artificial intelligence are bringing their powers to this newly born data analysis discipline. The main idea underlying the concept of Intelligent Data Analysis is extracting knowledge from a very large amount of data, with a very large amount of variables; data that represents very complex, non-linear, real-life problems. Moreover, IDA can help when starting from the raw data, coping with prediction tasks without knowing the theoretical description of the underlying process, classification tasks of new events based on past ones, or modeling the aforementioned unknown process. Classification, prediction, and modeling are the cornerstones that Intelligent Data Analysis can bring to us"--  |c Provided by publisher 
505 0 |a Cover -- Title Page -- Copyright -- Contents -- List of Contributors -- Series Preface -- Preface -- Chapter 1 Intelligent Data Analysis: Black Box Versus White Box Modeling -- 1.1 Introduction -- 1.1.1 Intelligent Data Analysis -- 1.1.2 Applications of IDA and Machine Learning -- 1.1.3 White Box Models Versus Black Box Models -- 1.1.4 Model Interpretability -- 1.2 Interpretation of White Box Models -- 1.2.1 Linear Regression -- 1.2.2 Decision Tree -- 1.3 Interpretation of Black Box Models -- 1.3.1 Partial Dependence Plot -- 1.3.2 Individual Conditional Expectation 
505 8 |a 1.3.3 Accumulated Local Effects -- 1.3.4 Global Surrogate Models -- 1.3.5 Local Interpretable Model-Agnostic Explanations -- 1.3.6 Feature Importance -- 1.4 Issues and Further Challenges -- 1.5 Summary -- References -- Chapter 2 Data: Its Nature and Modern Data Analytical Tools -- 2.1 Introduction -- 2.2 Data Types and Various File Formats -- 2.2.1 Structured Data -- 2.2.2 Semi-Structured Data -- 2.2.3 Unstructured Data -- 2.2.4 Need for File Formats -- 2.2.5 Various Types of File Formats -- 2.2.5.1 Comma Separated Values (CSV) -- 2.2.5.2 ZIP -- 2.2.5.3 Plain Text (txt) -- 2.2.5.4 JSON 
505 8 |a 2.2.5.5 XML -- 2.2.5.6 Image Files -- 2.2.5.7 HTML -- 2.3 Overview of Big Data -- 2.3.1 Sources of Big Data -- 2.3.1.1 Media -- 2.3.1.2 The Web -- 2.3.1.3 Cloud -- 2.3.1.4 Internet of Things -- 2.3.1.5 Databases -- 2.3.1.6 Archives -- 2.3.2 Big Data Analytics -- 2.3.2.1 Descriptive Analytics -- 2.3.2.2 Predictive Analytics -- 2.3.2.3 Prescriptive Analytics -- 2.4 Data Analytics Phases -- 2.5 Data Analytical Tools -- 2.5.1 Microsoft Excel -- 2.5.2 Apache Spark -- 2.5.3 Open Refine -- 2.5.4 R Programming -- 2.5.4.1 Advantages of R -- 2.5.4.2 Disadvantages of R -- 2.5.5 Tableau 
505 8 |a 2.5.5.1 How TableauWorks -- 2.5.5.2 Tableau Feature -- 2.5.5.3 Advantages -- 2.5.5.4 Disadvantages -- 2.5.6 Hadoop -- 2.5.6.1 Basic Components of Hadoop -- 2.5.6.2 Benefits -- 2.6 Database Management System for Big Data Analytics -- 2.6.1 Hadoop Distributed File System -- 2.6.2 NoSql -- 2.6.2.1 Categories of NoSql -- 2.7 Challenges in Big Data Analytics -- 2.7.1 Storage of Data -- 2.7.2 Synchronization of Data -- 2.7.3 Security of Data -- 2.7.4 Fewer Professionals -- 2.8 Conclusion -- References -- Chapter 3 Statistical Methods for Intelligent Data Analysis: Introduction and Various Concepts 
505 8 |a 3.1 Introduction -- 3.2 Probability -- 3.2.1 Definitions -- 3.2.1.1 Random Experiments -- 3.2.1.2 Probability -- 3.2.1.3 Probability Axioms -- 3.2.1.4 Conditional Probability -- 3.2.1.5 Independence -- 3.2.1.6 Random Variable -- 3.2.1.7 Probability Distribution -- 3.2.1.8 Expectation -- 3.2.1.9 Variance and Standard Deviation -- 3.2.2 Bayes' Rule -- 3.3 Descriptive Statistics -- 3.3.1 Picture Representation -- 3.3.1.1 Frequency Distribution -- 3.3.1.2 Simple Frequency Distribution -- 3.3.1.3 Grouped Frequency Distribution -- 3.3.1.4 Stem and Leaf Display -- 3.3.1.5 Histogram and Bar Chart 
590 |a O'Reilly Online Learning Platform: Academic Edition (SAML SSO Access) 
650 0 |a Data mining. 
650 0 |a Computational intelligence. 
700 1 |a Gupta, Deepak,  |d active 2015-2016,  |e editor. 
730 0 |a WORLDSHARE SUB RECORDS 
776 0 8 |i Print version:  |t Intelligent data analysis.  |d Hoboken, NJ, USA : Wiley, 2020  |z 9781119544456  |w (DLC) 2019056735 
856 4 0 |u https://go.oreilly.com/middle-tennessee-state-university/library/view/-/9781119544456/?ar  |z CONNECT  |3 O'Reilly  |t 0 
949 |a ho0 
994 |a 92  |b TXM 
998 |a wi  |d z 
999 f f |s 9f883daa-960f-4389-af4d-b96c345e09a2  |i 8164d7f3-0fc7-4e21-892a-0b405b2814bf  |t 0 
952 f f |a Middle Tennessee State University  |b Main  |c James E. Walker Library  |d Electronic Resources  |t 0  |e QA76.9.D343 I57435 2020  |h Library of Congress classification 
856 4 0 |3 O'Reilly  |t 0  |u https://go.oreilly.com/middle-tennessee-state-university/library/view/-/9781119544456/?ar  |z CONNECT