Thanks for the amazing project: all efforts which aim at making Pandas faster are to be commended, and this one looks really good.
I see that Modin can use either Dask or Ray as an engine. Looking at https://github.com/modin-project/modin#api-coverage it also seems that the API coverage is similar. However, by looking at a few issues:
I got the impression that Modin on Ray is more mature (there are also some threads here, but as a new user I cannot add more than 2 links to a post). Am I correct? I’ll be running Modin in Docker, so I can install either Dask or Ray as an engine. Since I have some (limited) experience with Dask (and 0 w/ Ray), and I already have Dask as a dependence in some projects, I’d rather use it than introduce another dependence. However, if Modin on Ray is more stable, I’ll bite the bullet and just add the Ray dependency.