Scikit-learn Cookbook : over 50 recipes to incorporate scikit-learn into every step of the data science pipeline, from feature extraction to model building and model evaluation /

If you're a data scientist already familiar with Python but not Scikit-Learn, or are familiar with other programming languages like R and want to take the plunge with the gold standard of Python machine learning libraries, then this is the book for you.

Saved in:
Bibliographic Details
Main Author: Hauck, Trent (Author)
Format: eBook
Language:English
Published: Birmingham, U.K. : Packt Publishing, 2014.
Subjects:
Online Access:CONNECT
CONNECT
LEADER 05530cam a2200589Ii 4500
001 mig00005349889
003 OCoLC
005 20210517062620.6
006 m o d
007 cr unu||||||||
008 141120s2014 enka o 001 0 eng d
019 |a 907297325 
020 |a 9781783989492  |q (electronic bk.) 
020 |a 1783989491  |q (electronic bk.) 
020 |z 1783989491 
020 |z 1783989483 
020 |z 9781783989485 
035 |a (OCoLC)896329131  |z (OCoLC)907297325 
035 0 0 |a ocm00000001wrldshrocn896329131 
037 |a CL0500000505  |b Safari Books Online 
040 |a UMI  |b eng  |e rda  |e pn  |c UMI  |d E7B  |d OCLCF  |d COO  |d DEBBG  |d OCLCQ  |d YDXCP  |d N$T  |d AGLDB  |d ICA  |d NOC  |d D6H  |d OCLCQ  |d VTS  |d CEF  |d STF  |d AU@  |d VT2  |d RDF 
049 |a TXMM 
050 4 |a Q325.5 
082 0 4 |a 641.5  |2 23 
100 1 |a Hauck, Trent,  |e author. 
245 1 0 |a Scikit-learn Cookbook :  |b over 50 recipes to incorporate scikit-learn into every step of the data science pipeline, from feature extraction to model building and model evaluation /  |c Trent Hauck. 
264 1 |a Birmingham, U.K. :  |b Packt Publishing,  |c 2014. 
300 |a 1 online resource (1 volume) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
500 |a "Quick answers to common problems." 
588 0 |a Online resource; title from cover (Safari, viewed November 17, 2014). 
500 |a Includes index. 
505 0 |a Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Premodel Workflow; Introduction; Getting sample data from external sources; Creating sample data for toy analysis; Scaling data to the standard normal; Creating binary features through thresholding; Working with categorical variables; Binarizing label features; Imputing missing values through various strategies; Using Pipelines for multiple preprocessing steps; Reducing dimensionality with PCA; Using factor analysis for decomposition 
505 8 |a Kernel PCA for nonlinear dimensionality reductionUsing truncated SVD to reduce dimensionality; Decomposition to classify with DictionaryLearning; Putting it all together with Pipelines; Using Gaussian processes for regression; Defining the Gaussian process object directly; Using stochastic gradient descent for regression; Chapter 2: Working with Linear Models; Introduction; Fitting a line through data; Evaluating the linear regression model; Using ridge regression to overcome linear regression's shortfalls; Optimizing the ridge regression parameter; Using sparsity to regularize models 
505 8 |a Taking a more fundamental approach to regularization with LARSUsing linear methods for classification -- logistic regression; Directly applying Bayesian ridge regression; Using boosting to learn from errors; Chapter 3: Building Models with Distance Metrics; Introduction; Using KMeans to cluster data; Optimizing the number of centroids; Assessing cluster correctness; Using MiniBatch KMeans to handle more data; Quantizing an image with KMeans clustering; Finding the closest objects in the feature space; Probabilistic clustering with Gaussian Mixture Models; Using KMeans for outlier detection 
505 8 |a Using k-NN for regressionChapter 4: Classifying Data with scikit-learn; Introduction; Doing basic classifications with Decision Trees; Tuning a Decision Tree model; Using many Decision Trees -- random forests; Tuning a random forest model; Classifying data with Support Vector Machines; Generalizing with multiclass classification; Using LDA for classification; Working with QDA -- a nonlinear LDA; Using Stochastic Gradient Descent for classification; Classifying documents with Naïve Bayes; Label propagation with semi-supervised learning; Chapter 5: Post-model Workflow; Introduction 
505 8 |a K-fold cross validationAutomatic cross validation; Cross validation with ShuffleSplit; Stratified k-fold; Poor man's grid search; Brute force grid search; Using dummy estimators to compare results; Regression model evaluation; Feature selection; Feature selection on L1 norms; Persisting models with joblib; Index 
520 |a If you're a data scientist already familiar with Python but not Scikit-Learn, or are familiar with other programming languages like R and want to take the plunge with the gold standard of Python machine learning libraries, then this is the book for you. 
590 |a EBSCO eBook Academic Comprehensive Collection North America 
650 0 |a Machine learning. 
650 0 |a Python (Computer program language) 
730 0 |a WORLDSHARE SUB RECORDS 
776 0 8 |i Print version:  |a Hauck, Trent.  |t Scikit-learn cookbook : over 50 recipes to incorporate scikit-learn into every step of the data science pipeline, from feature extraction to model builing and model evaluation.  |d Birmingham, [England] : Packt Publishing, ©2014  |h iii, 199 pages  |z 9781783989485 
856 4 0 |u https://ezproxy.mtsu.edu/login?url=https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=886453  |z CONNECT  |3 EBSCO  |t 0 
907 |a 4621545  |b 05-26-21  |c 06-30-20 
998 |a wi  |b 05-26-21  |c m  |d z   |e -  |f eng  |g enk  |h 0  |i 2 
994 |a 92  |b TXM 
999 f f |i f21b1c75-29b7-47e7-96cd-76d36c68760e  |s 5759721f-fe2d-45df-bdac-ef73db933fb2  |t 0 
952 f f |t 1  |e Q325.5  |h Library of Congress classification 
856 4 0 |3 EBSCO  |t 0  |u https://ezproxy.mtsu.edu/login?url=https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=886453  |z CONNECT