Which big data formats are supported by Talend? Interview Question
Talend Big Data supports many file formats either within HDFS or Hive. Depending on the component and target language that will be generated, different file types (formats) are available. For example, tHDFSInput supports both the Text and Sequence file types, but not ORC.
At the time of writing (v5.4.1), Talend Big Data supports the following file types:
At the time of writing (v5.4.1), Talend Big Data supports the following file types:
Classic Data Integration Job | Map / Reduce Job | |||
---|---|---|---|---|
HDFS | Pig | Hive | ||
Text File | X | X | X | Option in HDFS components |
Sequence File | X | X | X | Option in HDFS components |
RC | X | X | ||
ORC (since HDP 2.0 only) | X | |||
Avro | X | X | Specific Avro components available | |
JSON | Get/Put only | Custom Loader | Specific JSON components available |
These interview questions were really useful for my Best Hadoop Training in Chennai. Thanks for sharing.
ReplyDeleteThe content provided here is vital in increasing one's knowledge regarding hadoop, the way you have presented here is simply awesome. Thanks for sharing this. The uniqueness I see in your content made me to comment on this. Keep sharing article like this. Thanks :)
ReplyDeleteHadoop Training in Chennai | Best Hadoop Training in Chennai | Big data training in Chennai