Test Center review: Open source data aces
Jitterbit 2.0 impresses with easy GUI for mastering migration projects, while Talend Open Studio 3.0 scales gracefully to meet enterprise integration demandsFollow @infoworld
There are other shortcomings. Although debugging is enhanced by live data views, actual interprocess debugging capabilities are absent. Projects involving multiple data sources with requirements such as data de-duplication and orphaned record management will still require preprocessing. Jitterbit’s forte is normalizing or denormalizing translations rather than actually scrubbing data.
Also of note: I was disappointed to discover the Jitterbit Integration Server phoning home behind my back. While generating server usage reports via the admin shell, I witnessed the Jitterbit server send a blind copy of data to jitterbit.com -- despite my expressly opting out of the User Experience Program during the installation process. Jitterbit indicated that none of the data it siphons is personally identifiable. I’ve yet to evaluate the claim.
Jittering the bits
Jitterbit won InfoWorld's 2008 Bossie Award for Data Migration for good reason: Jitterbit is perhaps the most uncomplicated tool available to get your data from point A to point B. In my testing, Jitterbit made simple work of configuring source and target specifications with its form-based wizards. Although database table relationships must be defined manually, the tool did a fine job picking up Web service details via WSDL. In addition to databases and Web services, Jitterbit can also pull data from XML, FTP, HTTP, LDAP, and flat files.
Transformation mappings are configured via a drag-and-drop wiring process. A simple double-click on a node spawns a separate interface for building formulas to modify data en route. Here you can draw on decent string manipulation tools and regular expressions. Math and logic functions could use some filling out, but a variety of other functions -- for handling XML, date and time, and e-mail -- round out the options. You can even pull live data into the transform for on-the-fly validation.
I found job scheduling to be very flexible, and the granular ability to set runtime priorities was a plus. The onboard dependency checker is also smart, helping to provide impact analysis for easier change management across operations, including WSDL file updates.
Additional features, including a quick test of active connections and ongoing project validation, helped polish the experience. Collapsible panes and auto-formatting in the process designer help keep designs orderly. However, a thumbnail overview would make it easier to navigate larger projects. A few other minor nits -- including slow sync during object renaming and the lack of an onboard SQL builder -- were similarly easy to live with.
Administration of the Jitterbit server is done using the same client interface. User and group access controls are good. Access to projects and sub-objects can be configured to ensure that only the right users have write permission -- a nice touch.
However, queue and server management is limited. Instead of granular administrative control over engines and queues, I found manually refreshed logs and limited opportunity for intervention into stuck jobs. The ability to drill down into processes from the admin UI would be a good idea, as would the ability to reschedule stuck jobs or view live data streamed from multiple servers simultaneously.