Portada

LEARN PYSPARK IBD

APRESS
09 / 2019
9781484249604
Inglés

Sinopsis

Leverage machine and deep learning models to build applications on real-time dataáusing PySpark. This book is perfect for those who want to learn to use this language to perform exploratory data analysis and solve an array of business challenges.YouâÇÖll start by reviewing PySpark fundamentals, such as SparkâÇÖs core architecture, and see how to use PySpark for big data processing like data ingestion, cleaning, and transformations techniques. This is followed by building workflows for analyzing streaming data using PySpark and a comparison of various streaming platforms.áYouâÇÖll then see how to schedule different spark jobs using Airflow with PySpark and book examine tuning machine and deep learning models for real-time predictions. This book concludes with a discussion on graph frames and performing network analysis using graph algorithms in PySpark. All the code presented in the book will be available in Python scripts on Github.What YouâÇÖll LearnDevelop pipelines for streaming data processing using PySparkáBuild Machine Learning & Deep Learning models using PySpark latest offeringsUse graph analytics using PySparkáCreate Sequence Embeddings from Text dataáWho This Book is ForáData Scientists, machine learning and deep learning engineers who want to learn and use PySpark for real time analysis on streaming data.

PVP
56,14