I've already run into some problems:
- Spatial data is not serializable
- Lack of good documentation from Parallel Python
- No upload data/download results function if you use files.
- Servers time out
- Tasks randomly restart
Now that you know that, you can get started easy as this:
ppservers = ('xx.xxx.xx.xx:8080',)
job_server = pp.Server(ppservers=ppservers,ncpus=0)
You set ncpus = 0 inorder prevent processes from being used locally. To submit a job:
libs = ("arcpy",)
job_server.submit(function,# function to perform
(variables,), # function variable
(),# call back function
libs # modules used by function
)
job_server.wait() # waits for the job to complete
job_server.print_stats() # print some stats about the server and task
del job_server
It's that simple to run the task.
Enjoy