Premium Features

Previous Buy now to get complete version Next
  • Home
uCertify Logo
  • login
  • Signup
    • Help & Support
    • Accessibility
    • Testimonials
  • Hello GuestLogin or Signup
  • Feedback & Support
    • Support
    • Keyboard Shortcuts
    • Send Feedback
Scroll to top button

Big Data Analysis with Python

(BIG-DATA-PYTHON.AJ1) / ISBN: 978-1-64459-315-8
This course includes
Lessons
TestPrep
LiveLab
Mentoring (Add-on)
BIG-DATA-PYTHON.AJ1 : Big Data Analysis with Python
Try this course Pre-Assessment and first two Lessons free No credit card required
Are you an instructor? Teach using uCertify products
Request a free evaluation copy

Big Data Analysis with Python

Get hands-on experience of big data analysis with Python with the comprehensive course and lab. The lab provides hands-on learning in analyzing data with the use of python, beginning up with the basics to mastering different types of data. The course and lab deal with python data science stack, statistical visualizations, working with big data frameworks, handling missing values and correlation analysis, exploratory data analysis, reproducibility in big data analysis, and many more. 
Here's what you will get

Lessons
  • 9+ Lessons
  • 20+ Exercises
  • 50+ Quizzes
  • 65+ Flashcards
  • 65+ Glossary of terms
TestPrep
  • 30+ Pre Assessment Questions
  • 30+ Post Assessment Questions
LiveLab
  • 48+ LiveLab
  • 12+ Video tutorials
  • 20+ Minutes
Here's what you will learn
Download Course Outline
Lesson 1: Preface
  • About
Lesson 2: The Python Data Science Stack
  • Introduction
  • Python Libraries and Packages
  • Using Pandas
  • Data Type Conversion
  • Aggregation and Grouping
  • Exporting Data from Pandas
  • Visualization with Pandas
  • Summary
Lesson 3: Statistical Visualizations
  • Introduction
  • Types of Graphs and When to Use Them
  • Components of a Graph
  • Seaborn
  • Which Tool Should Be Used?
  • Types of Graphs
  • Pandas DataFrames and Grouped Data
  • Changing Plot Design: Modifying Graph Components
  • Exporting Graphs
  • Summary
Lesson 4: Working with Big Data Frameworks
  • Introduction
  • Hadoop
  • Spark
  • Writing Parquet Files
  • Handling Unstructured Data
  • Summary
Lesson 5: Diving Deeper with Spark
  • Introduction
  • Getting Started with Spark DataFrames
  • Writing Output from Spark DataFrames
  • Exploring Spark DataFrames
  • Data Manipulation with Spark DataFrames
  • Graphs in Spark
  • Summary
Lesson 6: Handling Missing Values and Correlation Analysis
  • Introduction
  • Setting up the Jupyter Notebook
  • Missing Values
  • Handling Missing Values in Spark DataFrames
  • Correlation
  • Summary
Lesson 7: Exploratory Data Analysis
  • Introduction
  • Defining a Business Problem
  • Translating a Business Problem into Measurable Metrics and Exploratory Data Analysis (EDA)
  • Structured Approach to the Data Science Project Life Cycle
  • Summary
Lesson 8: Reproducibility in Big Data Analysis
  • Introduction
  • Reproducibility with Jupyter Notebooks
  • Gathering Data in a Reproducible Way
  • Code Practices and Standards
  • Avoiding Repetition
  • Summary
Lesson 9: Creating a Full Analysis Report
  • Introduction
  • Reading Data in Spark from Different Data Sources
  • SQL Operations on a Spark DataFrame
  • Generating Statistical Measurements
  • Summary

Hands on Activities (Live Labs)

The Python Data Science Stack

  • Interacting with the Python Shell
  • Calculating the Square
  • Grouping a DataFrame
  • Applying a Function to a Column
  • Subsetting a DataFrame
  • Slicing and Subsetting
  • Reading Data from a CSV File
  • Viewing the Standard Deviation
  • Calculating the Median Value
  • Calculating the Mean Value

Statistical Visualizations

  • Plotting an Analytical Graph
  • Creating a Graph
  • Creating a Graph for a Mathematical Function
  • Creating a Line Graph Using Seaborn
  • Creating a Line Graph Using pandas
  • Creating a Line Graph Using matplotlib
  • Detecting Outliers
  • Displaying Histograms
  • Using a Box Plot
  • Constructing a Scatterplot
  • Plotting a Line Graph with Styles and Color
  • Configuring a Title and Labels for Axis Objects
  • Designing a Complete Plot
  • Exporting a Graph to a File on a Disk

Working with Big Data Frameworks

  • Performing DataFrame Operations in Spark
  • Accessing Data with Spark
  • Parsing Text in Spark

Diving Deeper with Spark

  • Creating a DataFrame Using a CSV File
  • Creating a DataFrame from an Existing RDD
  • Specifying the Schema of a DataFrame
  • Removing a Column from a DataFrame
  • Renaming a Column in a DataFrame
  • Adding a Column to a DataFrame
  • Creating a KDE Plot
  • Creating a Linear Model Plot
  • Creating a Bar Chart

Handling Missing Values and Correlation Analysis

  • Filtering Data
  • Counting Missing Values
  • Handling NaN Values
  • Using the Backward and Forward Filling Methods
  • Calculating Correlation Coefficient

Exploratory Data Analysis

  • Generating the Feature Importance of the Target Variable
  • Identifying the Target Variable
  • Plotting a Heatmap
  • Generating a Normal Distribution Plot

Reproducibility in Big Data Analysis

  • Performing Data Reproducibility
  • Preprocessing Missing Values with High Reproducibility
  • Normalizating the Data
×
uc logo for app downloadDownload our uCertify App [lms_setting_placeholder: This filed is used to set the LMS settings.

Share with your friends and colleagues

We use cookies to enhance your experience. By continuing to visit this site you agree to our use of cookies. More information
Accept