Training Day

NoSQL matters Training Day 25.04.2013

The training day will give you the opportunity to dive deeply into the secrets of the selected databases. Get the information from first hand and enjoy the familiar atmosphere in the small groups. Each training slot will last 4h and you can choose a training session in the morning and one in the afternoon. Please ensure to provide your own laptop.

We will offer soft drinks and coffee during the day and a warm lunch will be served.

Training Day Agenda


Time House 6
Room 1
House 6
Room 2
House 6
Room 3
House 6
Room 4
House 6
Room 5
08:30-08:55 Registration
09:00-13:00 Fully Booked
Reduce development time and cost with professional MongoDB training by 10gen
Marc Schwering
Fully Booked
Data Modelling in a NoSQL world
Jan Steemann
Getting started with InfiniteGraph
Dr. Nic Caine
Apache Hadoop Essentials
Dr. Stephan Kepser
Get up and running with Apache CouchDB
Jan Lehnardt
13:00-13:45 Lunch
13:45-17:45 Introduction to the graph database Neo4j
Michael Hunger
Fully Booked
Data Modelling in a NoSQL world
Jan Steemann
Apache Drill: interactive query and analysis on large-scale datasets
Michael Hausenblas
Fully Booked
Apache Hadoop Essentials
Dr. Stephan Kepser

Apache Drill: interactive query and analysis on large-scale datasets

Apache Drill is a distributed system for interactive query and analysis of large-scale datasets, inspired by Google’s Dremel technology. It is a design goal to scale to 10,000 servers or more and to be able to process Petabytes of data and trillions of records in seconds.

The training session introduces Apache Drill and its use cases. We will review the Apache Drill architecture and extensibility points and then we will look into the data sources and interfaces supported. In the hands-on part, participants will learn how to set up Drill, configure data sources and query locally and in a distributed fashion.

Trainer: Michael Hausenblas, Chief Data Engineer EMEA, MapR Technologies

Getting started with InfiniteGraph

InfiniteGraph specializes in Big Data applications that need to handle massive amounts of distributed, scalable and complex data. Features such as parallel ingest, high-speed graph traversals and scalability enable maximum performance within Big Data applications. New features in InfiniteGraph 3.0 include improved indexing and query, navigation queries, simplified graph creation and access, etc.
The half day tutorial will give an insight into system architecture of InfiniteGraph, discussing the files, servers, tools and infrastructure that support InfiniteGraph applications.
In the hands-on lesson we will go step by step from installation process to creating our own small graph.

What do you need?
Basically You!… and don´t forget your laptop!
System requirements: Windows (32 or 64bit), Linux (64bit) and MacOS (64bit)

Trainer: Dr. Nic Caine, Director Technical Development EMEA, Objectivity, Inc.

Introduction to the graph database Neo4j

This tutorial covers the core functionality of the Neo4j graph database. With a mixture of theory and hands-on practice sessions, attendees will quickly learn how easy it is to develop a Neo4j-backed application.
Topics covered include:
• Introduction to Neo4j
• Overview of Cypher query language
• Social use cases including recommendations
• Permissions modeling
• Other case studies in real world domains
• Opportunity to consult on current projects, possible implementations and proof of concepts

Attendees won’t need any previous experience with Neo4j, NOSQL databases or specific development languages, but will need their own laptop with Neo4j installed.

Trainer: Michael Hunger, Lead Developer, Neo Technology

Data Modelling in a NoSQL world

Learn about data modelling in a NoSQL environment in this half-day class.
Even though most NoSQL databases follow the “schema-free” data paradigma, what a database is really good at is determined by its underlying architecture and storage model.
It is therefore important to choose a matching data model to get the best out of the underlying database technology. Application requirements such as consistency demands also need to be considered.

During the half-day, attendees will get an overview of different data storage models available in NoSQL databases. There will also be hands-on examples and experiments using key/value, document, and graph data structures.

No prior knowledge of NoSQL databases is required. Some basic experience with relational databases (like MySQL) or data modelling will be helpful but is not essential. Participants will need to bring their own laptop (preferably Linux or MacOS). Installation instructions for the required software will be sent out prior to the class.

Trainer: Jan Steemann, Senior Developer, triAGENS

Reduce development time and cost with professional MongoDB training by 10gen

MongoDB is the open-source, document-oriented technology that is reshaping the market due to the popularity of its agile and scalable approach among developers and IT professionals.
This half-day MongoDB workshop will walk through core concepts, including schema design, querying, replication, sharding and basic administration, using demos and exercises. It is intended to create a foundation that developers can build on in their applications.

Trainer: Marc Schwering, Solutions Architect for 10gen, the MongoDB company

Get up and running with Apache CouchDB

CouchDB was one of the first more popular NoSQL databases and it shines with a unique combination of strong core features. CouchDB’s main contribution to the database world is seamless peer-to-peer, master-master (or master-less) data replication with conflict detection. Think of CouchDB as git, but for you application data. Combined with an easy to use REST API, approachable and flexible MapReduce queries in JavaScript and a plethora of projects supporting the larger CouchDB ecosystem, as well as implementation features like scalability, fault tolerance and support for high concurrency, CouchDB is a must-know in your developer and database admin tool belt.

The workshop will cover:

– API Overview.
– Replication deep dive.
– Querying with MapReduce views.
– CouchDB operation & security.
– CouchDB performance tuning.
– CouchDB vs. Couchbase.

Bring a laptop and a browser, Unices are preferred, but not mandatory. Bonus points if your command line knows `curl`.

Required skills:
– Database basics
– Beginner-level programming language knowledge, which one doesn’t matter.

Trainer: Jan Lehnardt, core contributor to Apache CouchDB

Apache Hadoop Essentials

This half-day course provides a technical overview of Apache Hadoop for those with no prior programming skills. Hands-on labs will give users a deeper understanding of Apache Hadoop using real world use cases to help provide the understanding of the power of Hadoop.
During this half-day class will have 50/50 mix of hands on real world use case labs and lecture/discussion which provides a hands-on, high-level overview of Hadoop, walk through the Hadoop ecosystem.

You do not need any technical background – the hands on labs will provide step-by-step instructions.
Participants must provide their own laptop (with browser loaded) with a minimum of 8Gb of RAM, minimum storage availability of 50Gb, and USB port with VirtualBox (4.2.4) loaded. Upon registration users will be sent configuration information for VirtualBox. Please note there is no dedicated broadband for this class. All labs and materials will be delivered to students via USB flash drive provided by codecentric.

Trainer: Dr. Stephan Kepser is a Senior IT Consultant with codecentric AG, the leading company in Germany for agile software development using innovative technologies. He is a certified Hadoop Development trainer. He’s been discussing questions on Cloud Computing and Big Data for quite some time now co-authoring journal articles and publishing some blog entries on these topics. His interests reach from legal questions to architecture and design issues to technical details of NoSQL databases.