Global web icon
stackoverflow.com
https://stackoverflow.com/questions/57820724/why-d…
python - Why does Dask perform so slower while multiprocessing perform ...
36 dask delayed 10.288054704666138s my cpu has 6 physical cores Question Why does Dask perform so slower while multiprocessing perform so much faster? Am I using Dask the wrong way? If yes, what is the right way? Note: Please discuss with this particular case or other specific and concrete cases. Please do NOT talk generally.
Global web icon
stackoverflow.com
https://stackoverflow.com/questions/44657631/strat…
python - Strategy for partitioning dask dataframes efficiently - Stack ...
The documentation for Dask talks about repartioning to reduce overhead here. They however seem to indicate you need some knowledge of what your dataframe will look like beforehand (ie that there w...
Global web icon
stackoverflow.com
https://stackoverflow.com/questions/72363756/readi…
python - Reading an SQL query into a Dask DataFrame - Stack Overflow
However, passing a meta attribute to read_sql_query and setting head_rows=0 is completely ok as long as there's an efficient way to retrieve/create while dask-sql might work for this case, using it is not an option, unfortunately How can I go about correctly reading an SQL query into dask dataframe?
Global web icon
stackoverflow.com
https://stackoverflow.com/questions/45545110/make-…
dask - Make Pandas DataFrame apply () use all cores? - Stack Overflow
As of August 2017, Pandas DataFame.apply() is unfortunately still limited to working with a single core, meaning that a multi-core machine will waste the majority of its compute-time when you run df.
Global web icon
stackoverflow.com
https://stackoverflow.com/questions/41453473/defau…
python - Default pip installation of Dask gives "ImportError: No module ...
Find the dask package suitable for your python version from the release history page of the pypi dask page. Go back to colab and remove dask completely !pip uninstall dask
Global web icon
stackoverflow.com
https://stackoverflow.com/questions/50355598/how-s…
python - How should I get the shape of a dask dataframe ... - Stack ...
How should I get the shape of a dask dataframe? Asked 7 years, 6 months ago Modified 1 year, 7 months ago Viewed 26k times
Global web icon
stackoverflow.com
https://stackoverflow.com/questions/53394935/what-…
What is the "right" way to close a Dask LocalCluster?
19 I am trying to use dask-distributed on my laptop using a LocalCluster, but I have still not found a way to let my application close without raising some warnings or triggering some strange iterations with matplotlib (I am using the tkAgg backend).
Global web icon
stackoverflow.com
https://stackoverflow.com/questions/68231492/impor…
python - importing large CSV file using Dask - Stack Overflow
I am importing a very large csv file ~680GB using Dask, however, the output is not what I expect. My aim is to select only some columns (6/50), and perhaps filter them (this I am unsure of because ...
Global web icon
stackoverflow.com
https://stackoverflow.com/questions/44432868/dask-…
python - dask dataframe apply meta - Stack Overflow
This is required because apply() is flexible enough that it can produce just about anything from a dataframe. As you can see, if you don't provide a meta, then dask actually computes part of the data, to see what the types should be - which is fine, but you should know it is happening.
Global web icon
stackoverflow.com
https://stackoverflow.com/questions/50569171/how-d…
python - How do I find the length of a dataframe in dask ... - Stack ...
Running len () or compute () on a dask dataframe with several million entries takes longer than the equivalent in pandas. I know I can find the number of partitions with df_dask.npartitions (which is very fast) but is there no attribute that stores the total length/ length of each partition?