
Home - Dataiku Community
Welcome to the Dataiku Community: a peer-to-peer community to discuss data preparation, analytics, machine learning and AI on the Dataiku platform.
Reading partitions one at a time from Python - Dataiku Community
Mar 5, 2020 · Hi, I am trying to read a partitioned dataset using Python. I got a list of partitions using the following code.
Save DataFrame to a managed folder — Dataiku Community
I am trying to save a pandas DataFrame to a managed folder in Dataiku. My code:
Refresh/Reload Dataset - Dataiku Community
Welcome to the Dataiku Community! To answer your question you can manually refresh the sample for a SQL dataset. In DSS when you add a SQL dataset it will fetch a sample by default first 10,000 …
Writing to partitions - Dataiku Community
Hello everyone, I'm working with datasets with partitions, I found a post showing how to read a partition from a dataset but I have not found a way to write that partition down on another dataframe in the …
Troubleshooting Issues and Getting Help With Dataiku
Dataiku revolutionizes how companies work with their data, enabling any user — from beginners with no programming knowledge to experienced data scientists with advanced knowledge and complex data …
Mount shared drive to DSS managed folder - Dataiku Community
Hi all, I want to access multiple files located in a folder on a shared G drive. I read few posts which tells about mounting local/shared drives to Dataiku managed folder for accessing these files on Dataiku. …
Handling Empty or Missing Datasets Dynamically in Dataiku Python ...
Jan 9, 2025 · Hi everyone, I’m working on a dynamic process in Dataiku where I pull data from Athena using SQL, then do athena unload, and use the resulting dataset as an input for a pyspark recipe.
Add/Remove Column in existing dataset — Dataiku Community
Feb 21, 2023 · Hi All, I have an existing dataset created with 2 columns. I now want to add in a new column to the existing data set. How do I add it?
Export datasets to folders — Dataiku Community
Hello ! I would like to export several datasets of my project to the same folder (each folder for each necessary date). Does someone know how to do that ?