Greetings!
I'm a long-time linux user and frequent defacto admin.
I have a problem: a cron job is running that calls a bash script to change the directory and run a php script which, in turn, scrapes an old-and-ugly site which our office owns but does not control (meaning that doing something sensible like serving the data we now scrape via an API is not an option). With the size of the content currently being scraped, the run time is one or 2 seconds. The cron job runs, at peak, once a minute.
Naturally, I do some logging: The bash script writes out a timestamp to log when it runs, the PHP script writes 'locked' to a status file and 'working' and a timestamp to log. Any exceptions are captured and logged out with timestamp when they occur. End-of run is also logged, with timestamp and, finally, 'unlocked' is written to the status file.
The problem: I'm showing some instances when the cron job runs and the php job doesn't, at all (no status written, no 'working' log entry). It's not being caused by collision (recall the scrape runs in under 2 seconds and the cron interval is 1m and there's app-level logging-o-plenty to expose a collision); additionally, I've seen similar at longer increments (2, 5 mins). PHP error level is set to E_ALL and there is no output that corresponds to the missed runs.
My thinking is that I might be looking at a failure of one scripting environment to hand off to another.
Has anyone seen something like this? I'm having trouble finding system log activity that explains what I'm seeing.
Matt Murphy
Philadelphia Elections Commission