ETL Work - Talend

Extract Transform and Load (ETL) is a general concept of how to move data from one system to another.  There are a lot of tools, and a lot of different methodologies to successfully accomplish this.  But I want to touch upon a product and ECO system that I've never heard of before, until 4 months ago.

Talend is a open source eclipse project that allows you to design ETL work in a visual capacity.  It stores its' configuration within XML files, and when you are ready to execute, for debugging or production, it then generates java code and compiles it.

My group has bought into the entire Talend ECO system, which combines many different open source projects into an nicely integrated product.

But I'm a ColdFusion lover, and this blog is about ColdFusion, therefore, there is two integration points that Talend provides.

First, if you buy into the ECO System, then you'll get a Administration server that schedules you, and then you can use modules the consume web services and we all know how awesome ColdFusion is at building web services!

Second, Talend Studio - the eclipse development UI, is free.  Using this free product will provide you an ability to write ETL jobs in java, compile them and then execute them from ColdFusion.

Both scenarios allow you to be a novice JAVA developer and use the powerful more complicated parts of JAVA.  For example, it is a single click to on parallelism for the entire job, therefore, long running jobs would finish in a portion of the time.

Lastly, The use-case that we are using this tool is to transition from one system to another.  And because of the scope of the project it won't be one quick cut over.  Therefore, having the ability to run the ETL job over and over and over (you get it), while developing, testing, and then in production during the transition phase is very powerful.  Along with having the entire java core to be able to transform the data is a great resource to have under your belt.

Comments

Popular posts from this blog

ColdFusion in an Enterprise Environment - Part 1 - Understanding how to use SubVersion (SVN)

coldfusion builder 2 extension not displaying browse button on type=projectdir

Being Thread Safe in Coldbox/Coldfusion