re-spawn worker processes #8192

staabm · 2023-09-10T06:40:36Z

Memory optimization oportunity

watching rector in e.g. htop while running on a big project (e.g. mautic) we can easily see that most of the memory used is within the worker processes

these processes start small and grow over time. the more files a worker has processed, the more memory it hogs.
since we disabled the garbage collector the memory is all time growing.

this means in the end of the analysis the workers contain a lot of allocated memory, which leaked from all the files on their way thru the codebase, which was never freed because no GC.

since we disabled the GC intentionally to improve performance, I think we should limit the number of files a worker processes.
if a worker would die after X amount of files and a new worker would be spawned (so the overall number of workers would still be stable), we would free the memory and rector would need less memory.

I had a look into the codebase but could not find the right parts to implement the idea.
parallel stuff is a bit distributed across classes which made it hard for me to start.

maybe @TomasVotruba or @samsonasik have a better idea of the parallel infrastrutcture, so we could e.g. limit each workers files to be processed to e.g. 10x the job-size or similar. wdyt?

The text was updated successfully, but these errors were encountered:

samsonasik · 2023-09-10T06:45:18Z

There was an effort by @Flyingmana at PR:

regular terminating and respawning of processes rector-src#3722

but cause some process forced terminated before actually complete, so that was reverted at:

Revert "regular terminating and respawning of processes" rector-src#4489

samsonasik · 2023-09-10T12:29:52Z

Resolved at rectorphp/rector-src#4965

staabm added the bug label Sep 10, 2023

staabm mentioned this issue Sep 10, 2023

Implement a max jobs per worker budget rectorphp/rector-src#4965

Merged

samsonasik closed this as completed Sep 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

re-spawn worker processes #8192

re-spawn worker processes #8192

staabm commented Sep 10, 2023 •

edited

samsonasik commented Sep 10, 2023

samsonasik commented Sep 10, 2023

re-spawn worker processes #8192

re-spawn worker processes #8192

Comments

staabm commented Sep 10, 2023 • edited

Memory optimization oportunity

samsonasik commented Sep 10, 2023

samsonasik commented Sep 10, 2023

staabm commented Sep 10, 2023 •

edited