I am getting a sink error when I try to fetch twitter tweets. I added the Twitter API configurations and created a directory in HDFS. I'm not sure what I am doing wrong. I am using Hadoop 2.0.0-cdh4.2.1
ERROR flume.SinkRunner: Unable to deliver event.
ERROR hdfs.HDFSEventSink: process failed
Exception follows.
java.lang.UnsupportedOperationException: This is supposed to be overridden by subclasses.
flume.conf
# Naming the components on the current agent.
TwitterAgent.sources = twitter
TwitterAgent.channels = memoryChannel
TwitterAgent.sinks = HDFS
# Describing/Configuring the source
TwitterAgent.sources.twitter.type = org.apache.flume.source.twitter.TwitterSource
TwitterAgent.sources.twitter.consumerKey = Xxx
TwitterAgent.sources.twitter.consumerSecret = Xxx
TwitterAgent.sources.twitter.accessToken = Xxx
TwitterAgent.sources.twitter.accessTokenSecret = Xxx
TwitterAgent.sources.twitter.maxBatchDurationMillis = 200
TwitterAgent.sources.twitter.channels = memoryChannel
TwitterAgent.sources.twitter.keywords = lsu
TwitterAgent.channels.memoryChannel.type = memory
TwitterAgent.channels.memoryChannel.capacity = 10000
TwitterAgent.channels.memoryChannel.transactionCapacity = 1000
TwitterAgent.sinks.HDFS.type = hdfs
TwitterAgent.sinks.HDFS.channel = memoryChannel
TwitterAgent.sinks.HDFS.hdfs.path = hdfs:/user/flume/tweets/
TwitterAgent.sinks.HDFS.hdfs.fileType = DataStream
TwitterAgent.sinks.HDFS.hdfs.writeFormat = Text
TwitterAgent.sinks.HDFS.hdfs.batchSize = 1000
TwitterAgent.sinks.HDFS.hdfs.rollSize = 0
TwitterAgent.sinks.HDFS.hdfs.rollCount = 10000
TwitterAgent.sinks.HDFS.hdfs.useLocalTimeStamp = true
Copyright Notice:Content Author:「pinkco12」,Reproduced under the CC 4.0 BY-SA copyright license with a link to the original source and this disclaimer.
Link to original article:https://stackoverflow.com/questions/49886162/hadoop-using-flume-to-pull-twitter-tweets