Tuesday 26 July 2016

FAQS of TALEND OPEN STUDIO/BIGDATA/ENTERPRISE CERTIFICATION

**tFileInputDelimited :

Header --Enter the number of rows to be skipped in the beginning of file.

**How to partition a data flow?(Partition,Departition,Repartition of a dataflow)?


The Parallelization vertical tab allows you to configure parameters for partitioning a data flow into multiple threads, so as to handle those threads in parallel for better performance. The options that appear in this tab vary depending on the sequence of the row connection in the flow. In addition, different icons will appear in the row connection according to your selection.

Parallelization--Parallelization tab is available only on the condition that you have subscribed to one of the Talend Platform solutions or Big Data solutions

Partition row --Select this option when you need to partition the input records into a specific number of threads.
              ---It is not available to the last row connection of the flow.

Departition row --Select this option when you need to regroup the outputs of the processed parallel threads.
                --It is not available to the first row connection of the flow.

Repartition row --Select this option when you need to partition the input threads into a specific number of threads and regroup the outputs of the processed parallel threads.
                --It is not available to the first or the last row connection of the flow.

 Merge sort partitions --Select this check box to implement the Mergesort algorithm to ensure the consistency of data.
                          This check box appears when you select the Departition row or Repartition row option.


Not every component supports the dynamic schema feature. You will get an error as shown below if you select the Dynamic type for a column in the schema of a component that does not support this feature.
You can find a list of components that support the Dynamic type by following these steps:
  1. Go to the <Talend Studio installation dir>/plugins/ directory and find the jar file org.talend.core.tis_x.x.x.rxxxxxx.jar (for example: org.talend.core.tis_5.6.1.20141207_1530.jar).
  2. Extract the the resources folder from the jar file,
  3. Then open the text file supportDynamic.txt in the folder. You will see a list of components that support the Dynamic type. Such as:
tFileInputDelimited ,tFileOutputDelimited ,tAccessInput ,tAS400Input ,tDBInput ,tDB2Input ,tEXAInput ,tFirebirdInput ,tGreenPlumInput ,tHSQLDBInput ,tIngresInput ,tInformixInput ,tJavaDBInput ,tJDBCInput ,tMaxDBInput ,tMysqlInput ,tMSSqlInput ,tNetezzaInput ,tOracleInput ,tPostgresqlInput ,tPostgresPlusInput ,tParAccelInput ,tSQLiteInput ,tSasInput ,tSybaseInput ,tVectorWiseInput ,tVerticaInput ,tVerticaOutput ,tTeradataInput ,tJava ,tJavaFlex ,tJavaRow ,tLogRow ,tMap ,tOracleOutput ,tMysqlOutput ,tMSSqlOutput ,tPostgresqlOutput ,tAS400Output ,tDB2Output ,tInformixOutput ,tSybaseOutput ,tTeradataOutput ,tAggregateRow ,tSortRow ,tFilterRow ,tWriteDynamicFields ,tExtractDynamicFields ,tUnite ,tUniqRow ,tRunJob ,tReplicate ,tAggregateSortedRow ,tFilterColumns ,tJoin ,tSampleRow ,tHashInput ,tHashOutput ,tFileInputPositional ,tFileOutputPositional ,tAmazonMysqlInput ,tAmazonMysqlOutput ,tAmazonOracleInput ,tAmazonOracleOutput ,tSAPHanaInput ,tSAPHanaOutput ,tLDAPInput ,INPUT ,OUTPUT

**RELATED TO CDC
TSUBSCRIBERS is the Built-in Table and all others are created with the corresponding
TABLE NAME.

--> TSUBSCRIBERS....TALEND_CDC_TABLE_TO_WATCH, TALEND_CDC_SUBSCRIBER_NAME,TALEND_CDC_CREATION_DATE fileds.
 TCDC_CUSTOMERS....CUSTOMERS is the TABLE NAME.
                ...>TALEND_CDC_SUBSCRIBER_NAME,TALEND_CDC_STATE,TALEND_CDC_TYPE,TALEND_CDC_CREATION_DATE,ID FIELDS
CDC_Foundation...TSUBSCRIBERS,TCDC_CUSTOMERS,TCDC_VIEW_CUSTOMERS.
TABLE SCHEMAS..CUSTOMERS...View all Changes...





**Activate AMC in your Talend Studio
To activate AMC in your Talend Studio you can either activate it in your project or activate it for one individual jobs only.

To activate it for the complete project please click on

File -> Edit project properties -> Job Settings -> Stats & Logs.

Select all checkboxes  so that Logs, Statistics and volemetrics can be catched.
Check "In Database" and configure the database connection that you want to use.

Configuration of the Talend Administrative Console

The Talend Administrative Console (TAC) is able to show just the same data the Talend STudio does. To be able to display it, you need to configure your database connection, first.

Click on Dashboard -> Connections and select "Add".Enter all database connection details , including the database tables.

If this doesn´t yet work, then please check if you have a "localhost" URL under Settings -> Configuration -> Dashboard.  If so, please replace it with your actual server URL or server name.


After doing so, you should now see the AMC in the TAC.




**The types of items that you can assign to a BUSINESS MODEL ARE :
Job designs--
Metadata--
Business Models--
Documentation--
Routines (Code)--


**The Assignment tab displays in a tabular form details of the Repository attributes you allocated to a shape or a connection.

To display any assignment information in the table, select a shape or a connection in the active model, then click the Assignment tab in the Business Model view.


You can also display the assignment list placing the mouse over the shape you assigned information to.


**Comparison of 2 jobs can be done thru COMPARE-RESULT-VIEW

**INNER JOIN REJECT-->

**MILLER,RED,OHRA --> Thru which components can you create that single record data
into 3 records 
like
INPUT : MILLER,RED,OHRA
OUTPUT :
 MILLER
RED
OHRA

a)tfileoutputDelemited--not possible
b)tNormalizer--Only Possible with this
c)tmap--Possible with 3 output flows
**

COURTS : CASES : LAWYERS : JUDGES : ::::::::: VICTIMS : ACCUSED

  *We have got so many SMART people in our COUNTRY. *we have got so many IIT completed SMART students in our COUNTRY. * we have got so many ...