Open Source ETL tools vs Commercial ETL tool The ETL-tools are validated on the following categories √ Infrastructure √ Functionality √ Usability √ Platforms supported √ Debugging facilities √ Data Quality / profiling √ Performance √ Future prospects √ Reusability √ Scalability √ Batch vs Real-time √ Native connectivity Pentaho Kettle vs Talend Pentaho Pentaho is a commerical open-source BI suite that has a product called Kettle for data integration. It uses an innovative meta-driven approach and has a strong and very easy-to-use GUI. The company started around 2001 (2002 was when kettle was integrated into it). It has a strong community of 13,500 registered users. It has a stand-alone java engine that process the jobs and tasks for moving data between many different databases and files. It can schedule tasks (but you need a schedular for that - cron). It can run remote jobs on "slave servers" on other machines. It has data quality features: fro...
Error Handling Mechanism in Talend Open Studio Three Error Handling Strategies in Talend Open Studio You can recover from some errors. Others, like system or network failures are fatal. But even in the fatal case, your Talend Open Studio job should die gracefully, notifying the operations team and leaving the data in a good state. This post presents three error handling strategies for your Talend jobs. Some Talend Open Studio job errors are alternate paths that, though infrequent, occur often enough to justify special programming. This programming may come in the form of guard conditions, special logic applied to route the special case to another sub job. For an example of these type of errors, see this blog post on ETL Filter Patterns . Other errors are related to system and network activity or are bugs. There are a few ways to handle this class of error in Talend Open Studio. Do Nothing For simple jobs, say an automated administrative t...