Go Back   CORTEX Forums > Best Practices > Tips and Techniques > Data Warehousing Tips and Techniques > DWH Tip Feeds
Register Blogs FAQ Members List Calendar Search Today's Posts Mark Forums Read

Exporting a Kettle Repository to Files

This is a discussion on Exporting a Kettle Repository to Files within the DWH Tip Feeds forums, part of the Data Warehousing Tips and Techniques category; Hi All! Today I'd like to announce KREX , a small solution I put together to export a Kettle (a.k.a. Pentaho Data Integration) Repository to individual transformation (.ktr) and and ...


Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old 25th November 2009, 12:03 PM   #1
News Bot
 
Join Date: Nov 2007
Posts: 13,989
Latest News Headlines is on a distinguished road
Post Exporting a Kettle Repository to Files

Hi All!

Today I'd like to announce KREX, a small solution I put together to export a Kettle (a.k.a. Pentaho Data Integration) Repository to individual transformation (.ktr) and and job (.kjb) files.

The idea to create this was inspired by this thread on the pentaho forums, started by kandrews. He (she?) wrote:
Has anyone ever been able to export a PDI repository and convert it somehow into regular non-repository .kjb & .ktr files? If you have done this already or this functionality already exists please let me know.

My initial thoughts are possibly an XLS translation against the XML from the repository export. Thoughts?
Well, I hope this helps! Enjoy en let me know if its useful. Be advised that in the same thread, Matt Casters already revealed that the functionality to do this will soon be built into PDI, but until then this may be of use.

To start using KREX,
  • checkout the repository or download the Job and Transformation files to your file system.
  • Open the main Job file export_repository_to_files.kjb using Pentaho Data Integration 3.2's spoon (Currently a Milestone 1 release)
  • Configure the Set Source Repository Step in the set_source_repo_and_target_directory transformation to match the repository you want to export
  • Run the main job file (export_repository_to_files.kjb)
If all goes well, you should now have a directory called pdi_repo_export in your home directory which contains a subdirectory named after your exported repository containing the directory tree with the .ktr and .kjb files.

Here's a quick screenshot of the main job, just to give you an idea:
The heart of the job is formed by the very last transformation, which does the actual legwork of extracting and saving the individual transformations:

The steps before that are mainly configuration and ensuring that the directory tree that is to contain the files is created before we attempt to write any files.

If you have any suggestions or comments, I welcome you to post them here. If you are trying to use KREX but run into an issue, please use the KREX issuelist.

If you are looking for more tips and trick with kettle and Pentaho in general, stay tuned. The "Building Pentaho Solutions" book I'm writing for Wiley together with Jos van Dongen will contain tons and tons of practical tips and solutions, and explain many of its technologies and concepts in thorough detail.

Cheers and until next time,

Roland


More from Roland Bouman's Blog ...
Latest News Headlines is offline  
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiTweet this Post!
Reply With Quote
Reply

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is On
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
open-msp-viewer: Free XSLT utilities to render MS Project files as HTML web pages Latest News Headlines DWH Tip Feeds 0 25th November 2009 12:03 PM
Kettle 4 Logging architecture Latest News Headlines Data Integration News Feeds 0 25th November 2009 11:39 AM
Flat Files with too many Delimiters James Beresford BI Monkey 0 10th September 2009 04:54 PM
Formatting SSIS Configuration files James Beresford BI Monkey 0 11th August 2009 12:58 PM
IBM Files ITC Complaint against ASUSTeK Latest News Headlines 2007 News 0 7th December 2007 06:16 AM


All times are GMT +11. The time now is 03:09 PM.

© The Business Intelligence Group

Search Engine Optimization by vBSEO