Learn Analytics

By following the Big Data Analytics path you will acquire a basic understanding of text analytics and the Annotator Query Language (AQL), the Jaql programming language and how it is used with text analytics and Knowledge about BigSheets to analyze and visualize big data without having to code.

Featured Courses

Text Analytics Essentials

The analysis of emails, blogs, tweets, forums and other forms of unstructured text data constitutes what we call text analytics.  Text analytics is applicable to most industries; for example, if your company is suspicious about company secrets being leaked to competitors by employees, text analytics can help analyze millions of employees’ emails.  If you would like to find common pain points your customers face when using your products, you can analyze their comments and questions in forums. If you would like to measure positive or negative perceptions of a company, brand, or product, you can perform sentiment analysis using text analytics. This course teaches you the basics of text analytics.

Big Data University has been chosen by IBM as one of the issuers of badges as part of the IBM Open Badge program. Share your achievements through LinkedIn, Facebook, Twitter, and more! See more at: http://bigdatauniversity.com/bdu-wp/bdu-course/text-analytics-essentials/

Big Data University leverages the services of Pearson VUE Acclaim to assist in the administration of the IBM Open Badge program.  By enrolling into this course, you agree to Big Data University sharing your details with Pearson VUE Acclaim for the strict use of issuing your badge upon completion of the badge criteria.

This course was tested and validated for v3 of the QSE VM. Please download the v3 VM to complete the lab exercises.

https://www14.software.ibm.com/webapp/iwm/web/preLogin.do?source=swg-beta-iibob

 

Introduction to R

With over 2 million users worldwide R is rapidly becoming the leading programming language in statistics and data science. Every year, the number of R users grows by 40%, and an increasing number of organizations are using it in their day-to-day activities.

In this introduction to R, you will master the basics of this beautiful open source language, such as factors, lists and data frames. With the knowledge gained in this course, you will be ready to undertake your first very own data analysis.

Special offer by Datacamp:  

Complete this course through Big Data University, and gain free access to the entire DataCamp catalog of courses for two weeks!

Introduction to Solr

Solr (pronounced "solar") is an open source enterprise search platform, written in Java, from the Apache Lucene project. Its major features include full-text search, hit highlighting, faceted search, real-time indexing, dynamic clustering, database integration, NoSQL features and rich document (e.g., Word, PDF) handling. Providing distributed search and index replication, Solr is highly scalable and fault tolerant. Solr is the most popular enterprise search engine.

Solr is written in Java and runs as a standalone full-text search server. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it usable from most popular programming languages. Solr's powerful external configuration allows it to be tailored to many types of application without Java coding, and it has a plugin architecture to support more advanced customization.

Big Data University has been chosen by IBM as one of the issuers of badges as part of the IBM Open Badge program. Share your achievements through LinkedIn, Facebook, Twitter, and more!

Big Data University leverages the services of Pearson VUE Acclaim to assist in the administration of the IBM Open Badge program. If this course is part of the IBM Open Badges program, you will be awarded the badge upon the completion of the badge criteria. Please visit this page to find out more (http://bigdatauniversity.com/bdu-badge/). By enrolling into this course, you agree to Big Data University sharing your details with Pearson VUE Acclaim for the strict use of issuing your badge upon completion of the badge criteria.

Introduction to Data Analysis using R

Learn how to tackle data analysis problems using the powerful open source language R. The course will take you from learning the basics of R to using it to explore many different types of data. You will learn how to prepare data for analysis, compute various statistical measures, create meaningful data visualizations, create reusable R functions, create R models to predict expected future outcomes, and more!

Using BigSheets for Spreadsheet-like Analytics

This course is designed to introduce the student to the capabilities of BigSheets. BigSheets is a component of InfoSphere BigInsights. It provides the analyst the ability to be able to visualize and analyze data stored HDFS using a spreadsheet type interface without any programming.

Big Data University has been chosen by IBM as one of the issuers of badges as part of the IBM Open Badge program. Share your achievements through LinkedIn, Facebook, Twitter, and more! See more at: http://bigdatauniversity.com/bdu-wp/bdu-course/using-bigsheets-for-spreadsheet-like-analytics/

Big Data University leverages the services of Pearson VUE Acclaim to assist in the administration of the IBM Open Badge program.  By enrolling into this course, you agree to Big Data University sharing your details with Pearson VUE Acclaim for the strict use of issuing your badge upon completion of the badge criteria.

The labs for this course were recently updated and tested using BigInsights Quick Start 4.0

 

Analyzing Big Data with a Concept IBM Watson and UIMA (beta)

This course provides some insight into Watson-like concepts (natural language processing and dealing with unstructured data). We will first look at how our Concept IBM Watson will understand a question. Next, we will look at how an application like OpenNLP can be used to understand a question that is entered into the system and how the answer can be derived. The second topic focuses on how to get information from unstructured data and convert it into structured data. We will use an architecture called UIMA to do this. Finally, we will look at how to build the necessary servers to make all these things work together.


There will be a lot of heavy Java programming involved. We will use Eclipse as the development platform to create the Java programs that will complete this solution.

 

Text Analytics Essentials

The analysis of emails, blogs, tweets, forums and other forms of unstructured text data constitutes what we call text analytics.  Text analytics is applicable to most industries; for example, if your company is suspicious about company secrets being leaked to competitors by employees, text analytics can help analyze millions of employees’ emails.  If you would like to find common pain points your customers face when using your products, you can analyze their comments and questions in forums. If you would like to measure positive or negative perceptions of a company, brand, or product, you can perform sentiment analysis using text analytics. This course teaches you the basics of text analytics.

Big Data University has been chosen by IBM as one of the issuers of badges as part of the IBM Open Badge program. Share your achievements through LinkedIn, Facebook, Twitter, and more! See more at: http://bigdatauniversity.com/bdu-wp/bdu-course/text-analytics-essentials/

Big Data University leverages the services of Pearson VUE Acclaim to assist in the administration of the IBM Open Badge program.  By enrolling into this course, you agree to Big Data University sharing your details with Pearson VUE Acclaim for the strict use of issuing your badge upon completion of the badge criteria.

This course was tested and validated for v3 of the QSE VM. Please download the v3 VM to complete the lab exercises.

https://www14.software.ibm.com/webapp/iwm/web/preLogin.do?source=swg-beta-iibob

 

Analyzing Big Data with a Concept IBM Watson and UIMA (beta)

This course provides some insight into Watson-like concepts (natural language processing and dealing with unstructured data). We will first look at how our Concept IBM Watson will understand a question. Next, we will look at how an application like OpenNLP can be used to understand a question that is entered into the system and how the answer can be derived. The second topic focuses on how to get information from unstructured data and convert it into structured data. We will use an architecture called UIMA to do this. Finally, we will look at how to build the necessary servers to make all these things work together.


There will be a lot of heavy Java programming involved. We will use Eclipse as the development platform to create the Java programs that will complete this solution.

 

Introduction to Solr

Solr (pronounced "solar") is an open source enterprise search platform, written in Java, from the Apache Lucene project. Its major features include full-text search, hit highlighting, faceted search, real-time indexing, dynamic clustering, database integration, NoSQL features and rich document (e.g., Word, PDF) handling. Providing distributed search and index replication, Solr is highly scalable and fault tolerant. Solr is the most popular enterprise search engine.

Solr is written in Java and runs as a standalone full-text search server. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it usable from most popular programming languages. Solr's powerful external configuration allows it to be tailored to many types of application without Java coding, and it has a plugin architecture to support more advanced customization.

Big Data University has been chosen by IBM as one of the issuers of badges as part of the IBM Open Badge program. Share your achievements through LinkedIn, Facebook, Twitter, and more!

Big Data University leverages the services of Pearson VUE Acclaim to assist in the administration of the IBM Open Badge program. If this course is part of the IBM Open Badges program, you will be awarded the badge upon the completion of the badge criteria. Please visit this page to find out more (http://bigdatauniversity.com/bdu-badge/). By enrolling into this course, you agree to Big Data University sharing your details with Pearson VUE Acclaim for the strict use of issuing your badge upon completion of the badge criteria.

Introduction to R

With over 2 million users worldwide R is rapidly becoming the leading programming language in statistics and data science. Every year, the number of R users grows by 40%, and an increasing number of organizations are using it in their day-to-day activities.

In this introduction to R, you will master the basics of this beautiful open source language, such as factors, lists and data frames. With the knowledge gained in this course, you will be ready to undertake your first very own data analysis.

Special offer by Datacamp:  

Complete this course through Big Data University, and gain free access to the entire DataCamp catalog of courses for two weeks!

Using BigSheets for Spreadsheet-like Analytics

This course is designed to introduce the student to the capabilities of BigSheets. BigSheets is a component of InfoSphere BigInsights. It provides the analyst the ability to be able to visualize and analyze data stored HDFS using a spreadsheet type interface without any programming.

Big Data University has been chosen by IBM as one of the issuers of badges as part of the IBM Open Badge program. Share your achievements through LinkedIn, Facebook, Twitter, and more! See more at: http://bigdatauniversity.com/bdu-wp/bdu-course/using-bigsheets-for-spreadsheet-like-analytics/

Big Data University leverages the services of Pearson VUE Acclaim to assist in the administration of the IBM Open Badge program.  By enrolling into this course, you agree to Big Data University sharing your details with Pearson VUE Acclaim for the strict use of issuing your badge upon completion of the badge criteria.

The labs for this course were recently updated and tested using BigInsights Quick Start 4.0

 

Introduction to Data Analysis using R

Learn how to tackle data analysis problems using the powerful open source language R. The course will take you from learning the basics of R to using it to explore many different types of data. You will learn how to prepare data for analysis, compute various statistical measures, create meaningful data visualizations, create reusable R functions, create R models to predict expected future outcomes, and more!

Big Data University also offers a vast number of courses on various other analytics, big data, and data science topics. View our complete course catalog.

What is Big Data University?

An IBM community initiative, Big Data University is the world’s best education on big data. Learn about big data, data science and analytic technologies from experts using hands-on exercises and interactive videos. Best of all, it’s completely free.