About Talend

Talend Open Studio operates as a code generator allowing data transformation scripts and underlying programs to be generated either in Java (OR) Perl. Its GUI is made of a metadata repository and a graphical designer. The metadata repository contains the definitions and configuration for each job. The information in the metadata repository is used by all of the components of Talend Open Studio.

Course Details : http://talend-training.blogspot.in/2013/04/talend-training-course-details.html

Install Wowza Media Server in Linux Cent OS

Wowza Media Server  is the high-performance, high-value unified streaming media and video software for any screen.

Below are the instructions to install Wowza.

      Step 1:  Connect to the CentOS  server using  root or sudo user. 

      Step 2:  Download server from internet

   wget http://www.wowza.com/downloads/WowzaMediaServer-3-5-2/WowzaMediaServer-3.5.2.rpm.bin  

     Step3 :  Accept license agreement

                   Now  Wowza is Installed on below location:

     Step:4  Execute startup.sh

              sh startup.sh

     Setp5 :  Enter licence key

That’s it, now you’ve got Wowza Media Server up and running on your CentOS server.

What is the difference between the having clause and the group by?

The difference is that WHERE operates on individual rows, while HAVING operates on groups.

You can have WHERE without HAVING, you can have HAVING without WHERE, you can have both WHERE and HAVING, and you can have neither WHERE nor HAVING. But you can't have HAVING without grouping, even if the group consists of the entire result set.

In SQL, the having clause and the group by statement work together when using aggregate functions like SUM, AVG, MAX, etc. This is best illustrated by an example. Suppose we have a table called emp_bonus as shown below. Note that the table hasmultiple entries for employees A and B.

If we want to calculate the total bonus that each employee received, then we would write a SQL statement like this:

select employee, sum(bonus) from emp_bonus group by employee;
The Group By Clause

In the SQL statement above, you can see that we use the "group by" clause with the employee column. The group by clause does allows us to find the sum of the bonuses for each employee. Using the ‘group by’ in combination with the ‘sum(bonus)’ statement will give us the sum of all the bonuses for employees A, B, and C.

Running the SQL above would return this:


Now, suppose we wanted to find the employees who received more than $1,000 in bonuses for the year of 2007. This is when we need to use the HAVING clause, and this is what the SQL look like:

select employee, sum(bonus) from emp_bonus
group by employee having sum(bonus) > 1000;

And the result of running the SQL above would be this:


Difference between having clause and group by statement
So, from the example, we can see that the group by clause is used to group column(s) so that aggregates (like SUM, MAX, etc) can be used to find the necessary information. The having clause is used with the group by clause when comparisons need to be made with those aggregate functions (like “> 1,000″). So, the having clause and group by statements are not really alternatives to each other – but they are used alongside one another!

Database Training: Microsoft SQL Server Online Training

Database Training: Microsoft SQL Server Online Training:                           Microsoft SQL Server Online Training  1. SQL Server and Toolset ·          SQL Server Platform ·       ...

Iterate and load multiple files into database using tFlelist and tMysqlout

Below tutorial will explain you the  steps to load files into database 

Step 1: Open Talend

·         Create new project or open already existing project

Step2: create sample csv files  

·         File1,File2,File3,File4

·         They are same format or schema

·         Create one new folder name as’ fileprocessfolder’ and  put all files in it. I can access the files from this folder.

Step3: Create new a new job in Talend

·         Right click on the job designs in the repository window and select the ‘create job ‘option.

·         Name of the job is’ filelist_fullload’.

·         Click on finish button

Step4:  create metadata for one sample file

·         Go to repository window, click on arrow next to ‘metadata’ and right click on File delimited select the ‘create file delimited’.

·         Enter example name like’file_process’ click on next button.

·         Add metadata file to repository

·         Click on the browse button, select the sample file’file1’, Click on next button

·         In this screen  select field separator  field   click on corresponding combo box  select  ‘comma’ option instead of semicolon
and select check box for set heading row as column, click on refresh for preview, click on next button.

·         In this screen click on the finish button, automatically window will be close.

Step5:  creation of database connection using Mysql database

·         If we have   already database connections  for loading data into target table  no need to create   one more database
Connection, just use existed database connection and by the given credentials.
·         If we don’t have data base connection, just follow the below steps to create new database connection.
·         Step-1
·         Go to Mysql database --->enter valid password ---> after Mysql prompt open create database using commands like
·          Mysql>  create database targetdb; here targetdb is new database name

·         After creation of new database we can grant all permissions to that data base.
·         Using this command we can grant   all permissions to the DB.
·         Mysql> grant all on database name.* to username@’%’ identified by ‘password’.

·         Step- 2
·          Go to Talend repository window --->click on arrow next to ‘metadata’ --->right click on DB Connections-->select
Create connection option.

·         After click on that new database connection window opened ,in that step one we can give the name of the database click on next button.

·         In the second step we can select desire database type and version of the database ,fill the  all options with valid credentials
After check that for DB connection success or failure, click on check button. If creation is successful just click finish button.

·         Here I use this database only to load the result data. Like ‘Target database’. 

Step6: design sample job

·         In step3 already I created one job with ‘filelist_fullload’.
·         Go to repository window --->click on arrow next to ‘job design’--->right click on’ filelist_fullload’ job  select  ‘edit job’ option.
·         in step4  I already created one sample file for metadata ,we can use this like input file  and also it  can process the same  format or schema  files  using  tFlelist component.
·         Go to metadata---> click on the file delimited   select   which file  we can use  as a input file  ,drag and drop  it on  job design console

·         Click on ok button.
·         Go to right side panel  palette  ---> in that  search mode option  just type  tMap  and press enter key , we can get tMap component, drag and drop it on job design window.
·         Right click on the  ‘file process’ component ,select  row----->main connect a row to the tMap component

·         Again go to right side panel palette---> in that search mode option type  tMysqloutput ,press enter key, we can get  that component, select that component drag and drop it on job design window.

·         Now we can arrange the all components in proper order, why because we design  somewhat easily  and better way  to give  the connections etc.,
·         Go to tMap component right click  on that ,select  row--> new output and  connect to tMysqloutput component on that time  that will display one window  for new output name  ,we can give one name relatively to output file  it don’t have no spaces. After that click on ok button.

Step7:  component settings for sample job
·         For easily understanding purpose I run this sample job, mainly we can identify difference between the normal job and iterate to load multiple files in single job.
·          First  I can set the three  component properties one by one
·         double click on ‘File process’  component  we can get  basic-settings in the bottom of the job design window
·         Here we don’t need to change settings, why because already we gave at the time of creating metadata.

·         Now I go to tMap settings, double click on that we can get a new window.

·         I can select all the columns from the row1 (here row1haveing input file or source file) ,drag and drop it over on ‘loadallfiles’ (here it is output ).
·         Here it is optional select columns based on our requirement we can select columns.

·         And also here we have more options ,to change the data types ,length  size of the columns, if  add more columns, or remove existed columns  on both sides  (input, output), etc.,

·         Click on ok button close the tamp settings window.
·         tMysqloutput settings
·         Double click on the tMysqloutput component we can get basic settings in built-in mode.

·         Now we change the property type into repository, automatically   all options filled with valid credentials except table name, here i can enter manually, like”fileprocess”.
·         And also I change the option action on table “create table if not exist”. It creates the table in target database if it doesn’t have previously.

Step8: Run the sample job

·         click on run button or press F6 button from keyboard, it can run the job automatically , it can display the  job execution starting  time and ending  time ,status of the job.

step9:  design job using tFlelist component

·         Here I can continue with  previous job ‘filelist_fullload’
·         Go to palette--> type manually tFlelist in search box ---> select that component drag and drop it over on left side  above  corner of job design window .

·         Right click on tFilelist select row--->iterate, connect that row to input metadata file component “file_process”.

·         tFilelist settings
·         Double click on tFilelist component, we can get basic settings under the job design window screen.

·         In basic settings i can change directory, like this "D:/5.3output/fileprocessfolder.csv", input processing files are located in this directory.

·         Metadata input file component settings, here I used ‘file_process’ for that.
·         Double click on that component, change the property type repository ---->>built in, and change filename/stream like this ((String) globalMap.get ("tFileList_1_CURRENT_FILEPATH").

·         Tamp settings
·         No need to any changes old settings
·         tMysqloutput component settings.
·         Double click on component --->go to basic settings ---> change table name, why because that name I already used in previous job and move to --->action on table select one action based on our requirement.

·         Run the job
·         Click on run button or press F6 from keyboard.
·         Job executed successfully.

Talend Training Course Details

             Talend Course Details

Data Warehousing Introduction
·         Overview of the concept of Data Warehouse.
·         Dimensions, Hierarchy, Facts
·         DW models:- Star and Snowflake schemas.
·         Introduction to Talend
·         Talend Data Integration Overview

Talend Introduction
·         Talend Environment – Overview
·         Talend installation
·         Starting Talend job design and development
·         Repository, Designer and Palette
·         Talend Design

Talend Jobs Designing
·         Types of Components
·         Basic Components - Overview
·         Component Properties
·         Database connectivity components
·         Triggers in Talend
·         Sample Job designing
·         Job Execution
·         Usage of tMap component and other important components from all sections like Processing, Orchestration, File, database etc.
·         Exercises for job development (hands on training)
·         How to Capture information about job execution
·         How to create a table to store monitoring information
·         Detailed Log Generation:
·         Number of records inserted/ updated/ rejected. In case of failure, details of the failed records.
·         Exercises for job development (hands on training)
·         Optimization for running the job

Talend Context and Variables
·         Context Variables - Overview
·         Creation and Usage of Context Variables and Context Groups
·         Dynamic Job designs using Context variables
·         Making database connectivity parameters as job contexts
·         Configuring the job to run it on remote machine
·         Bulk Insert/ Update
·         How to execute the jobs remotely
·         Command line utility and basic and advanced commands
·         How to clean enhance data and reference data
·         How to migrate jobs from DEV to TESTING to PROD environment
·         Case study and hands on development training on the required jobs

Metadata  in Talend.
·         Built-in Connections
·         Shared Connections
·         Source and Destination Connections
·         Database Connections
·         Usage of connections in Job

Logs & Error
·         Logs  and  execution statistics  in Talend
·         Error Handling in Talend
·         Logs & Error Handling Components
·         In case of notification failure Email generation
·         Optimization for running the job

·         Real time schenaios on all discussed components
·         Optimization for running the job
·         How to migrate jobs from DEV to TESTING to PROD environment
·         Case study and hands on development training on the required jobs
·         How to run jobs in Linux Enviromnet
·         How to integrate talend jobs with java.
*And also provides resume preparation.
*After this training you can handle any Data Integration (ETL) or Migration projects independently.
*I will also provide Talend job support.
*And also I will provide free demo session. Please call me on below number.

Raja K
+91 9247360004