Differences between revisions 1 and 2

Lecture Big Data: Scalable Machine Learning

General Information

Lecture	Thursdays 12-14
Room	MAR 4.065
Teachers	Mikio L. Braun
Contact	mikio.braun@tu-berlin.de

Introduction

In this lecture series, we will discuss how large scale learning is performed. Individual algorithms will be studied, and shown how learning algorithms are modified in order to be able to deal with large data sets. These approaches don't necessarily lead to scalable computations in the sense of distributed systems, but more often rely on skillfull approximations and simplifcations which nevertheless ensure that the resulting algorithm leads to good predictions.

Topics include:

Fast approximation algorithms for classification and regression including stochastic gradient descent and bundle methods.
Optimization theory.
Sampling and approximations.
Graphical models and Variational Bayes.
Markov Chain Monte Carlo for learning.
Hashing and sketches.
Sequential Analysis for Testing and Cross Validation.
Distributed Infrastructures for learning (e.g. parameter servers)

Prerequisites

This is an advanced course which assumes working knowledge of machine learning algorithms as provided by the lectures Machine Learning 1 and/or Machine Learning 2. Apart from that, working knowledge of linear algebra, multivariate analysis, probability theory, as well as computing architectures.

-  ⇤ ← Revision 1 as of 2015-02-05 20:26:00 → 
  Size: 36
  Editor: IreneWinkler
  Comment:
+   ← Revision 2 as of 2015-03-23 10:45:42 → ⇥
  Size: 1476
  Editor: MikioBraun
  Comment:
-Deletions are marked like this.
+Additions are marked like this.
 Line 1:
-Describe Main/SS15_VLBigData here.
+= Lecture Big Data: Scalable Machine Learning =

== General Information ==

|| Lecture || Thursdays 12-14 ||
|| Room || MAR 4.065 ||
|| Teachers || Mikio L. Braun ||
|| Contact || mikio.braun@tu-berlin.de ||

== Introduction ==

In this lecture series, we will discuss how large scale learning is performed. Individual algorithms will be studied, and shown how learning algorithms are modified in order to be able to deal with large data sets. These approaches don't necessarily lead to scalable computations in the sense of distributed systems, but more often rely on skillfull approximations and simplifcations which nevertheless ensure that the resulting algorithm leads to good predictions.

Topics include:

 * Fast approximation algorithms for classification and regression including stochastic gradient descent and bundle methods.
 * Optimization theory.
 * Sampling and approximations.
 * Graphical models and Variational Bayes.
 * Markov Chain Monte Carlo for learning.
 * Hashing and sketches.
 * Sequential Analysis for Testing and Cross Validation.
 * Distributed Infrastructures for learning (e.g. parameter servers)

== Prerequisites ==

This is an advanced course which assumes working knowledge of machine learning algorithms as provided by the lectures Machine Learning 1 and/or Machine Learning 2. Apart from that, working knowledge of linear algebra, multivariate analysis, probability theory, as well as computing architectures.