Load large csv in hadoop via Hue would only store a 64MB block

Im using the Cloudera quickstart vm 5.1.0-1 Im trying to load my 3GB csv in Hadoop via Hue and what I tried so far is: Load the csv into the HDFS and specifically into a folder called datasets...

Virtual machine "Cloudera quick start" not booting

I have recently download "QuickStart VM" on http://www.cloudera.com (precisely, the version of virtualbox) This virtual machine use centOS (and my computer is a macbook air) I can not fully start...

Why does start-all.sh from root cause "failed to launch org.apache.spark.deploy.master.Master: JAVA_HOME is not set"?

I am trying to execute a Spark application built through Scala IDE through my standalone Spark service running on cloudera quickstart VM 5.3.0. My cloudera account JAVA_HOME is...

Accessing Hue on Cloudera Docker QuickStart

I have installed the cloudera quickstart using docker based on the instructions given...

Impala Query Editor always shows AnalysisException

I am running a Quickstart VM Cloudera on a Windows 7 computer, with 8Go of RAM and 4Go dedicated to the VM. I loaded tables from a SQL database into Hive, using Sqoop (Cloudera VM tutorial...

Warning: /usr/lib/sqoop/../accumulo does not exist! Accumulo imports will fail.Please set $ACCUMULO_HOME to the root of your Accumulo installation

My VM details: Cloudera Quickstart VM 5.5.0 VM = VM workstation 12 player Windows = Windows 10 / 64 bit Java = Java 1.8 when I run the "sqoop"command , I'm facing the error...

Hue configuration error -/etc/hue/conf.empty - Potential misconfiguration detected

Hi Experts, I'm newbie to Hadoop , linux environment and Cloudera. I installed cloudera vm 5.7 on my machine and imported mysql data to hdfs using SQOOP. I'm trying to execute to some queries...

Requesting executors because tasks are backlogged

I have a spark streaming application that was running absolutely fine until yesterday and all of a sudden running into these warnings. I have the same environment and using the same code. Here are...

ACID transactions on data added from Spark not working

I'm trying to use ACID transactions in Hive but I have a problem when the data are added with Spark. First, I created a table with the following statement : CREATE TABLE testdb.test(id string,...

Can't access Hbase through Hue

So, I'm trying to access HBase through Hue browser in Cloudera VM, but I'm running into a few problems. First when I open the Hue, I get this error : Potential misconfiguration detected. Fix and...

Unable to install spark 2.2 in Cloudera Quickstart VM (5.10)

I have followed the blog (Below mentioned) here and downloaded the parcel and put as per required. Please let me know if any one has installed and the...

Trying to connect to Hadoop Cluster using Talend

I have: Cloudera quickstart VM 5.8 Talend 5.4.1 I have entered the following credentials: Manager URI(with port) http://quickstart.cloudera:7180/ username cloudera password ...

Cannot update metadata - Kafka Producer for Cloudera Quickstart VM

I just installed Cloudera QuickStart VM and added Kafka service to it. After adding the Kafka service, I could easily create a producer/consumer and everything worked as expected. After a couple...

Unable to run HIVE queries

I have a table created using HIVE query in Cloudera VM, below is my DDL to create the table called incremental_tweets. CREATE EXTERNAL TABLE incremental_tweets ( id BIGINT, created_at...

virtual box installation causes error while launching

The virtual machine 'cloudera-quickstart-vm-5.12.0-0-virtualbox' has terminated unexpectedly during startup with exit code 1 (0x1). More details may be available in 'C:\Users\Sri...

Kerberos error while connection to cloudera impala environment

While connection to kerberized hadoop environment error: [Simba][ImpalaJDBCDriver](500169) Unable to connect to server: [Simba][ImpalaJDBCDriver](500591) Kerberos Authentication failed. I've...

What's the username and password for beeline

I am using cloudera VM and I want to connect to beeline but it's asking for username and password when i am leaving empty, it's not connecting. Can someone tell me the username and...

Hue service error: Could not connect to quickstart.cloudera:21050

I have installed cloudera-quickstart-vm-5.13.0-0-virtualbox in virtual box. Configuration Details: CPU: 3 & Memory: 9000MB Now when I launch cloudera express from terminal using command sudo...

Not able to access zeppelin 8080/8180 outside VM

This question might have answered but I am not able to solve. Need some help. Issues: Not able to see http://127.0.0.1:8080/ outside VM but able to see http://127.0.0.1:8088/ and other ports. I am...

How to set JAVA_HOME Cloudera quickstart for Kafka and Zookeeper

I have added Kafka service to my Cloudera cluster and when i try to start it it fails with the following error Exception in thread "main" java.lang.UnsupportedClassVersionError:...

yarn application accepted but not running cloudera despite resource allocation

I am using a Cloudera quickstart VM 5.13.0.0 to run Spark applications in yarn-client mode. I have allocated 10GB and 3 cores to my Cloudera VM. When I submit the application, the application is...

Cloudera manager on docker is not working

I am using cloudera quickstart vm on docker on Ubuntu 18.04LTS . While I launch the vm using run command : sudo docker run --hostname=quickstart.cloudera --privileged=true -t -i -p 8888:8888 -p...

cannot reach to http://localhost:7180 using Cloudera Quickstart VM

I am Installing Cloudera Quickstart VM through Docker Hub (on Mac) sudo /home/cloudera/cloudera-manager --force [[email protected] /]# sudo /home/cloudera/cloudera-manager --express...

How to download quickstart VM 5.x for virtual box for windows 10?

How to download quickstart VM 5.x for virtual box for windows 10? I have installed oracle virtual box. But for cloudera qickstart VM I am not getting any source. I have searched a lot in google...

Where is the hive-site.xml in Cloudera distribution?

I would like to know where the hive-site.xml file configuration is in a Cloudera distribution. Mainly because I would like to know where I can find out properties...

Not able to download Cloudera

I am trying to find a link to download cloudera zip file on VMWare , but unable to get any. Tried searching on google , on cloudera website , but in vain. Can somebody share some views on it.

Which Distribution CDH Vs HDP

I happened to work on CDH longtime back ( around 1 year) and am planning to start again.Now we had CDH , HDP and Hortonwork acquired by Cloudera . Is HDP being developed actively ? Or Is CDH...

How to load data from CSV into an external table in impala

I am following this solution for loading an external table into Impala as I get the same error if I load data by referring to the file. So, If I run: [quickstart.cloudera:21000] > create external...

How to get hortonworks data platform and cloudera distribution for hadoop latest version

I'm currently working on CDH5.13 (Cloudera Distribution Hadoop), and i have a couple of questions: 1- I want to get the latest version of CDH(6.3.3). When i try to download it, i have this message...

Installing Cloudera Quick start VM on M1 macOs

Currently I am learning Hadoop. Previously I used lab where I can access the Hadoop ecosystem. Recently I got M1 Mac and I want to run the same through Cloudera quick start VM. I do know that it...