Training Day
NoSQL matters Training Day 25.04.2013
The training day will give you the opportunity to dive deeply into the secrets of the selected databases. Get the information from first hand and enjoy the familiar atmosphere in the small groups. Each training slot will last 4h and you can choose a training session in the morning and one in the afternoon. Please ensure to provide your own laptop.
We will offer soft drinks and coffee during the day and a warm lunch will be served.
| Time | House 6 Room 1 |
House 6 Room 2 |
House 6 Room 3 |
House 6 Room 4 |
House 6 Room 5 |
| 08:30-08:55 | Registration | ||||
| 09:00-13:00 |
Fully Booked Reduce development time and cost with professional MongoDB training by 10gen Marc Schwering |
Fully Booked Data Modelling in a NoSQL world Jan Steemann |
Getting started with InfiniteGraph Dr. Nic Caine |
Apache Hadoop Essentials Dr. Stephan Kepser |
Get up and running with Apache CouchDB Jan Lehnardt |
| 13:00-13:45 | Lunch | ||||
| 13:45-17:45 | Introduction to the graph database Neo4j Michael Hunger |
Fully Booked Data Modelling in a NoSQL world Jan Steemann |
Apache Drill: interactive query and analysis on large-scale datasets Michael Hausenblas |
Fully Booked Apache Hadoop Essentials Dr. Stephan Kepser |
|
Apache Drill: interactive query and analysis on large-scale datasets
Apache Drill is a distributed system for interactive query and analysis of large-scale datasets, inspired by Google’s Dremel technology. It is a design goal to scale to 10,000 servers or more and to be able to process Petabytes of data and trillions of records in seconds.
The training session introduces Apache Drill and its use cases. We will review the Apache Drill architecture and extensibility points and then we will look into the data sources and interfaces supported. In the hands-on part, participants will learn how to set up Drill, configure data sources and query locally and in a distributed fashion.
Trainer: Michael Hausenblas, Chief Data Engineer EMEA, MapR Technologies
Getting started with InfiniteGraph
InfiniteGraph specializes in Big Data applications that need to handle massive amounts of distributed, scalable and complex data. Features such as parallel ingest, high-speed graph traversals and scalability enable maximum performance within Big Data applications. New features in InfiniteGraph 3.0 include improved indexing and query, navigation queries, simplified graph creation and access, etc.
The half day tutorial will give an insight into system architecture of InfiniteGraph, discussing the files, servers, tools and infrastructure that support InfiniteGraph applications.
In the hands-on lesson we will go step by step from installation process to creating our own small graph.
What do you need?
Basically You!… and don´t forget your laptop!
System requirements: Windows (32 or 64bit), Linux (64bit) and MacOS (64bit)
Trainer: Dr. Nic Caine, Director Technical Development EMEA, Objectivity, Inc.
Introduction to the graph database Neo4j
This tutorial covers the core functionality of the Neo4j graph database. With a mixture of theory and hands-on practice sessions, attendees will quickly learn how easy it is to develop a Neo4j-backed application.
Topics covered include:
• Introduction to Neo4j
• Overview of Cypher query language
• Social use cases including recommendations
• Permissions modeling
• Other case studies in real world domains
• Opportunity to consult on current projects, possible implementations and proof of concepts
Attendees won’t need any previous experience with Neo4j, NOSQL databases or specific development languages, but will need their own laptop with Neo4j installed.
Trainer: Michael Hunger, Lead Developer, Neo Technology
Data Modelling in a NoSQL world
Learn about data modelling in a NoSQL environment in this half-day class.
Even though most NoSQL databases follow the “schema-free” data paradigma, what a database is really good at is determined by its underlying architecture and storage model.
It is therefore important to choose a matching data model to get the best out of the underlying database technology. Application requirements such as consistency demands also need to be considered.
During the half-day, attendees will get an overview of different data storage models available in NoSQL databases. There will also be hands-on examples and experiments using key/value, document, and graph data structures.
No prior knowledge of NoSQL databases is required. Some basic experience with relational databases (like MySQL) or data modelling will be helpful but is not essential. Participants will need to bring their own laptop (preferably Linux or MacOS). Installation instructions for the required software will be sent out prior to the class.
Trainer: Jan Steemann, Senior Developer, triAGENS
Reduce development time and cost with professional MongoDB training by 10gen
MongoDB is the open-source, document-oriented technology that is reshaping the market due to the popularity of its agile and scalable approach among developers and IT professionals.
This half-day MongoDB workshop will walk through core concepts, including schema design, querying, replication, sharding and basic administration, using demos and exercises. It is intended to create a foundation that developers can build on in their applications.
Trainer: Marc Schwering, Solutions Architect for 10gen, the MongoDB company
Get up and running with Apache CouchDB
CouchDB was one of the first more popular NoSQL databases and it shines with a unique combination of strong core features. CouchDB’s main contribution to the database world is seamless peer-to-peer, master-master (or master-less) data replication with conflict detection. Think of CouchDB as git, but for you application data. Combined with an easy to use REST API, approachable and flexible MapReduce queries in JavaScript and a plethora of projects supporting the larger CouchDB ecosystem, as well as implementation features like scalability, fault tolerance and support for high concurrency, CouchDB is a must-know in your developer and database admin tool belt.
The workshop will cover:
– API Overview.
– Replication deep dive.
– Querying with MapReduce views.
– CouchDB operation & security.
– CouchDB performance tuning.
– CouchDB vs. Couchbase.
Bring a laptop and a browser, Unices are preferred, but not mandatory. Bonus points if your command line knows `curl`.
Required skills:
– HTTP
– Database basics
– Beginner-level programming language knowledge, which one doesn’t matter.
Trainer: Jan Lehnardt, core contributor to Apache CouchDB
Apache Hadoop Essentials
This half-day course provides a technical overview of Apache Hadoop for those with no prior programming skills. Hands-on labs will give users a deeper understanding of Apache Hadoop using real world use cases to help provide the understanding of the power of Hadoop.
During this half-day class will have 50/50 mix of hands on real world use case labs and lecture/discussion which provides a hands-on, high-level overview of Hadoop, walk through the Hadoop ecosystem.
You do not need any technical background – the hands on labs will provide step-by-step instructions.
Participants must provide their own laptop (with browser loaded) with a minimum of 8Gb of RAM, minimum storage availability of 50Gb, and USB port with VirtualBox (4.2.4) loaded. Upon registration users will be sent configuration information for VirtualBox. Please note there is no dedicated broadband for this class. All labs and materials will be delivered to students via USB flash drive provided by codecentric.
Trainer: Dr. Stephan Kepser is a Senior IT Consultant with codecentric AG, the leading company in Germany for agile software development using innovative technologies. He is a certified Hadoop Development trainer. He’s been discussing questions on Cloud Computing and Big Data for quite some time now co-authoring journal articles and publishing some blog entries on these topics. His interests reach from legal questions to architecture and design issues to technical details of NoSQL databases.

