Python Impyla fails after Kerberos install

I'm on a W7 machine, where I use Python (Anaconda distribution) to connect to Impala in our Hadoop cluster using the Impyla package. My company has recently added Kerberos and that ended up...

Connect to Impala using impyla client with Kerberos auth

I'm on a W8 machine, where I use Python (Anaconda distribution) to connect to Impala in our Hadoop cluster using the Impyla package. Our hadoop cluster is secured via Kerberos. I have followed the...

Python: How do I find which pip package a library belongs to?

I got a script transferred from someone else. And there is a module imported into the script. I'm wondering what is the best way to find out which pip package installed this library (other than...

Getting detailed Impyla error message

When I execute a SQL statement in Impala using Python/Impyla, I am just getting an exception with a generic error message like ""Operation is in ERROR_STATE". How do I get more detailed...

Error Import Impyla library on Windows

I'm having trouble with using impyla library on windows I installed impyla library pip install impyla Error occured when I tried to import impyla libary in python code from impala.dbapi import...

impala connection via sqlalchemy

I'm new to hadoop and impala. I managed to connect to impala by installing impyla and executing the following code. This is connection by LDAP: from impala.dbapi import connect from impala.util...

Executing Hive Scripts in Impyla

The examples I've seen for Impyla are for executing command line queries, i.e. the equivalent to running hive -e 'select * from my_db.my_table' Is there functionality in Impyla to be able to run...

impyla (0.14.0) ERROR - 'TSocket' object has no attribute 'isOpen'

I am getting the following error while trying to create a connection to HiveServer Traceback (most recent call last): File...

impyla - as_pandas - empty dataframe

I have a simple impyla code, and I would like to create a pandas dataFrame from my cursor. My code is running but my dataframe is always an empty dataframe. If I run my query directly on impala,...

Impyla return code 1 from org.apache.hadoop.hive.ql.exec.mr.MapRedTask when querying HiveServer2

I am using Impyla for querying some results from HIVE, however, I met this problem: From Impyla: impala.error.OperationalError: Error while processing statement: FAILED: Execution Error, return...

Python error when building Python package Docker Image

I have installed Docker on a RHEL7 server and it is running. I am trying to build my first Docker Image that I found on GitHub to build a python library docker image for use with Demisto....

How to connect to Apache Hadoop with Impyla and Kerberos

first of all I also read this question (since it seems to be simillar). My problem is that I also try to connect to our Apache Hadoop system which is now secured by Kerberos. I use the impyla...

Using Python to connect to Impala database (thriftpy error)

What I'm trying to do is very basic: connect to an Impala db using Python: from impala.dbapi import connect conn = connect(host='impala', port=21050, auth_mechanism='PLAIN') I'm using Impyla...

create a dockerfile to run python and groovy app

I am working on a project which is using both python and groovy to scrape data from websites and do some engineering on that data. I want to create a dockerfile which should have a python(3.6.5)...

Trying to load Python dataframe into Hadoop (Impala) using `ibis`, getting "AttributeError: module 'ibis' has no attribute 'impala' "

I'm running the following block of Python commands in a Jupyter notebook to upload my dataframe, labeled df, to Impala: import hdfs from hdfs.ext.kerberos import KerberosClient import pandas as...

python - unable to connect to TLS1.2 enabled HiveServer2

I have HiveServer2 with SSL (minimum TLS1.2 enabled only) and LDAP enabled, no kerberos enabled. hive.server2.transport.mode = binary. Beeline connections work fine like: beeline -u...

How to connect to impala using impyla or to hive using pyhive?

I am trying to connect to impala using impyla with this code: from impala.dbapi import connect conn = connect(host='host_name.com', port=21050, user='usr', password='pass', use_ssl=True,...

Python error when running os.system("kinit") - sh: 1: kinit: not found

I am building a python docker image and am testing out the kinit capability. When I run the following `os.system('kinit') I am receiving an error FROM python:3.5.7-buster ADD krb5.conf...

Python - unable to read a large file

How do I read a large table from hdfs in jupyter-notebook as a pandas DataFrame? The script is launched through the docker...

How to understand zlib-compressed query profiles of Apache Impala

Impala currently saves query profile logs at /var/log/impala/profiles , per line in the format <Epoch-Timestamp> <QueryID> <zlib-compressed-data> As mentioned in their document at...

how to use pyhive in lambda function?

I've wrote a function that is using pyhive to read from Hive. Running it locally it works fine. However when trying to use lambda function I got the error: "Could not start SASL: b'Error in...

Ibis create impala table with pandas dataframe and get [Error 61] Connection refused

After doing impyla sql statement, I convert the results into pandas dataframe format. But now I want to auto create a temporary table on impala using Apache Ibis to create table and load a...

How to Connect Superset to Redis with Password?

I'm trying to set up Apache Superset in production mode, and all ocurred well until the Redis connection. I installed superset and redis and made the connection config in superset_config.py. When...

Error while running query on Impala with Superset

I'm trying to connect impala to superset, and when I test the connection prints: "Seems OK!", and when I try to see databases on impala with the SQL Editor in the left side it shows all databases...

How to Impersonate Impala queries on Superset

I'm setting up Superset (0.36.0) in production Mode (with Gunicorn), and I would like to set up impersonate while running Impala queries on my Kerberized Cluster, to each user of Superset have...

AWS Lambda Error: Unable to import module 'function_name': No module named 'module._module'

Please see the screenshots in particular after reading. I am deploying a python script on AWS Lambda which uses the package impyla which has a dependency on the package bitarray. from impala.dbapi...

How to get query id for impala queries executed via sqlalchemy

I am querying impala using sqlalchemy which internally uses impyla. from sqlalchemy import create_engine from sqlalchemy.orm import sessionmaker engine =...

Impyla connection. Cannot start SASL. No mechanism available

I am trying to connect to impala using impyla, each time I am getting this error Could not start SASL: b'Error in sasl_client_start (-4) SASL(-4): no mechanism available: Unable to find a...

Conda 'Package conflicts for' errors?

Win 10, Python 2.7, Miniconda 4.8.3 conda env create -f environment.yml Package setuptools conflicts for: pip=19.0.3 -> setuptools ... Package openssl conflicts for: pyparsing=2.2.0 -> python ->...

impyla : how to setup mem_limit?

I'm using impyla==0.16.2 on python 3.8.3 Tried to execute set mem_limit=1G and after running query it does still give the error of mem_limit. That should be resolved because If I follow the same...