Pentaho Data Integration quickly with Juju

by Guest on 3 March 2016

Tom Barber at Meteorite explains how beneficial Juju Charms has been when migrating customer services from old, tricky servers.

I promised a demo of Juju GUI in my earlier blog and we’ve worked hard on a Pentaho Data Integration charm, so I figured we’d combine the two.

For those of you who don’t know, Juju is an Application Modelling platform developed by Canonical. Pentaho Data Integration is the best Open Source ETL toolkit on the planet. So why not combine the best of both worlds?

At Meteorite we have been migrating customer services from old, tricky to maintain servers to Juju managed clusters and its been an eye opening experience, in a good way! The latest product in our armoury to get the treatment is PDI. Our brief for this is threefold:

Firstly we need an easy way to deploy PDI on a bunch of different servers, in different clouds and maintain it, along with that it needs to provide on demand and scheduled ETL execution.

We also need a way to spin up and configure remote Carte servers for remote execution of ETL jobs and transformations, we would also like self configuring clusters as an added bonus.

Big Data deployment to work with the Big Data charms Juju already provides.

The charm is still under development, don’t expect everything to work yet, some of the functionality hasn’t even got any code written yet, but the charm is available here

Of our brief 1 and 2 are 90% complete, 3 will be done in the not too distant future.

But of course it wouldn’t be cricket if we didn’t give you a quick video demo of how easy it is so spin up a PDI cluster so here we are…

Apologies for the lack of contrast on the terminal, I need to remember to resolve that before the next video, but luckily we’re mostly interested in the GUI and webpages so hopefully it doesn’t impact things too much!

Original post

About the author

Tom Barber is the founder and technical director of Meteorite.bi, a consulting company specialising in the Saiku Analytics platform. His weekly duties include BI consulting, Scala & Java programming and tinkering with System Administration frameworks. In his spare time Tom is a regular blogger and open source committer. You can read more about Tom on the Meteorite.bi blog.

Related posts

Kubernetes backups just got easier with the CloudCasa charm from Catalogic

For a native integration for Canonical’s Kubernetes platform, Juju was the perfect fit, and the charm makes consuming CloudCasa seamless for users. […]

Canonical and OpenAirInterface to collaborate on open source telecom network infrastructure

Canonical is excited to announce that we are collaborating with OpenAirInterface (OAI) to drive the development and promotion of open source software for open radio access networks (Open RAN). Canonical will bring automation in software lifecycle management to OAI’s RAN stack, alongside additional infrastructure capabilities. This will be […]