Code: Select all
pysmile.learning.BayesianSearch.learn(...)
For example, could we tell SMILE somewhere to use a specific number of threads? Or is there a valid way to divide-and-conquer the learning problem before the "...learn()" call?
When it comes to the later data fitting part, I could easily parallelize testing different mappings to the network nodes using app-level threading, just as described here: viewtopic.php?p=393#p393
For training on a data set with 30 nodes and around 35k records, it took my PC about 5 hours to learn the network (which is actually pretty nice).
So, to be a bit greedy, it would be even better if there were options to speed up the process and make it more scalable.