Learn Python

In the Python training path, you learn to program in Python and then use it as a language to analyze small and big data. The entire path has beed designed to help students with no prior computer science or programming experience to think computationally and write programs to tackle useful problems in data science domain. This learning path contains four courses:

The first course, Introduction to Python, is aimed at students new to the language who may or may not have experience with other programming languages. In this course you will learn the main concepts of Python programming. In Python for Data Analysis course, we assume you are already familiar with Python programming, so you will learn SciPy, Numpy, Pandas and matplotlib package in this course. In Machine learning with Python you learn how to use the Python language to implement and apply machine learning and statistics algorithms. Finally, in Spark programming with Python, you will learn to work with Big Data using Spark and Python API.
All of these courses are taught using Data Scientist Workbench, which is an open source environment that you can easily write your python programs to analyze your small datasets or big data with Spark.

Featured Courses

Introduction to Python

This course is under development

Introduction to Python, is aimed at students new to the language who may or may not have experience with other programming languages. In this course you will learn the main concepts of Python programming like variables, data types, loops and conditional statements. This course also gives you a rapid introduction to some Python packages which are useful for mathematic computations and data wrangling, and then, these libraries will be covered in the next courses in the path more deeply.

Python for Data Analysis

This course is under development

In Python for Data Analysis course, we assume students are already familiar with Python programming and they will learn advanced Python techniques useful for load, wrangling, cleaning, transformation and visualization of data. You will learn about SciPy, Numpy, Pandas and matplotlib package in this course.

Introduction to Machine Learning with Python

This course is under development

In Machine learning with Python you learn how to use the Python language to implement and apply machine learning and statistics algorithms. It is a very important skill for all data analysts and data scientists, who want make predictive models. You will be rapidly work with scikit-learn package to extract and identify useful features that best represent your data, to make classification and clustering models, and to evaluate the performance of your machine learning algorithms.

Spark programming with Python

This course is under development

Although, Python is a programming language with excellent libraries for data analysis and machine learning, it is still limited to working with small size datasets that can fit on one machine memory. In Spark programming with Python, you will learn to work with Big Data using Spark and Python API. Apache Spark is an open source cluster computing framework that able you to develop distributed machine learning algorithms on the classic MapReduce computation frameworks.

Introduction to Python

This course is under development

Introduction to Python, is aimed at students new to the language who may or may not have experience with other programming languages. In this course you will learn the main concepts of Python programming like variables, data types, loops and conditional statements. This course also gives you a rapid introduction to some Python packages which are useful for mathematic computations and data wrangling, and then, these libraries will be covered in the next courses in the path more deeply.

Spark programming with Python

This course is under development

Although, Python is a programming language with excellent libraries for data analysis and machine learning, it is still limited to working with small size datasets that can fit on one machine memory. In Spark programming with Python, you will learn to work with Big Data using Spark and Python API. Apache Spark is an open source cluster computing framework that able you to develop distributed machine learning algorithms on the classic MapReduce computation frameworks.

Python for Data Analysis

This course is under development

In Python for Data Analysis course, we assume students are already familiar with Python programming and they will learn advanced Python techniques useful for load, wrangling, cleaning, transformation and visualization of data. You will learn about SciPy, Numpy, Pandas and matplotlib package in this course.

Introduction to Machine Learning with Python

This course is under development

In Machine learning with Python you learn how to use the Python language to implement and apply machine learning and statistics algorithms. It is a very important skill for all data analysts and data scientists, who want make predictive models. You will be rapidly work with scikit-learn package to extract and identify useful features that best represent your data, to make classification and clustering models, and to evaluate the performance of your machine learning algorithms.

What is Big Data University?

An IBM community initiative, Big Data University is the world’s best education on big data. Learn about big data, data science and analytic technologies from experts using hands-on exercises and interactive videos. Best of all, it’s completely free.