Restarting a job fails immediately with no reason why
richardmoore Jul 29, 2019 4:48 PMI have a job that has failed and when I restart it in the failed step the job fails again with no logging messages as to why.
Below is the jsl.
- copyDiffFileToGdgWeeklyDiff ends with COMPLETED and moves to the deliverDiffFile step.
- The deliverDiffFile fails as follows
2019-07-29 14:13:22.631 jberet-1 MoveFileBatchlet - INFO Move file: /awg/t1/app/data/tmp/GC/AWG_WRLDGCITMC.TXT
2019-07-29 14:13:22.631 jberet-1 MoveFileBatchlet - INFO Move to: /awg/t1/app/data/mft/inbound/retail/AWG_WRLDGCITMC.TXT
2019-07-29 14:13:22.631 jberet-1 MoveFileBatchlet - WARN File does not exist.
2019-07-29 14:13:22.631 jberet-1 MoveFileBatchlet - INFO Ending with a status of FILE_DOES_NOT_EXIST
2019-07-29 14:13:22.658 jberet-1 CawaManager - INFO CAWA job user status updated: Ending AIS_Weekly_UniversalItemChangesGot.deliverDiffFile = FILE_DOES_NOT_EXIST
2019-07-29 14:13:22.658 jberet-1 JobStepListener - INFO =======================================================================
2019-07-29 14:13:22.658 jberet-1 JobStepListener - INFO Ending step: AIS_Weekly_UniversalItemChangesGot.deliverDiffFile = FILE_DOES_NOT_EXIST
2019-07-29 14:13:22.658 jberet-1 JobStepListener - INFO =======================================================================
2019-07-29 14:13:23.485 jberet-1 JobStatsListener - INFO Ending job: AIS_Weekly_UniversalItemChangesGot = FAILED
...
2019-07-29 14:13:23.491 jberet-1 JobStatsListener - INFO Step: deliverDiffFile
2019-07-29 14:13:23.491 jberet-1 JobStatsListener - INFO Execution Id: 62801
2019-07-29 14:13:23.491 jberet-1 JobStatsListener - INFO Start Time: Mon Jul 29 14:13:22 CDT 2019
2019-07-29 14:13:23.491 jberet-1 JobStatsListener - INFO End Time: Mon Jul 29 14:13:22 CDT 2019
2019-07-29 14:13:23.491 jberet-1 JobStatsListener - INFO Exit Status: FILE_DOES_NOT_EXIST
2019-07-29 14:13:23.491 jberet-1 JobStatsListener - INFO Batch Status: COMPLETED
2019-07-29 14:13:23.491 jberet-1 JobStatsListener - INFO To restart in this step use: -DrestartId=43101 -DrestartStep=deliverDiffFile
2019-07-29 14:13:23.492 jberet-1 JobStatsListener - INFO =======================================================================================
2019-07-29 14:13:23.523 jberet-1 CawaManager - INFO CAWA job user status updated: Ending AIS_Weekly_UniversalItemChangesGot = FAILED
<step id="copyDiffFileToGdgWeeklyDiff">
&step-listeners;
<batchlet ref="copyFileToGdg">
<properties>
<property name="source" value="#{systemProperties['APP_DATA_TMP_FCLTY']}/#{jobProperties['diffFile']}"/>
<property name="target" value="#{jobProperties['gdgDirectoryWeeklyDiff']}"/>
<property name="processEmptyFile" value="true"/>
</properties>
</batchlet>
<next on="COMPLETED" to="deliverDiffFile"/>
<fail on="*" exit-status="FAILED"/>
</step>
<step id="deliverDiffFile">
&step-listeners;
<batchlet ref="moveFile">
<properties>
<property name="file" value="#{systemProperties['APP_DATA_TMP_FCLTY']}/#{jobProperties['diffFile']}"/>
<property name="destination" value="#{systemProperties['APP_DATA_MFT_INBOUND']}/retail/#{jobProperties['diffFile']}"/>
<property name="overwrite" value="true" />
</properties>
</batchlet>
<next on="FILE_MOVED" to="deleteDiffFile"/>
<fail on="*" exit-status="FAILED"/>
</step>
<step id="deleteDiffFile">
&step-listeners;
<batchlet ref="deleteFile">
<properties>
<property name="file" value="#{systemProperties['APP_DATA_TMP_FCLTY']}/#{jobProperties['diffFile']}"/>
</properties>
</batchlet>
<end on="FILE_DOES_NOT_EXIST"/>
<end on="FILE_DELETED"/>
<fail on="*" exit-status="FAILED"/>
</step>
I put the file out there and then restart it I get the following giving no reason as to why it failed -
2019-07-29 14:14:48.837 jberet-1 JobStatsListener - INFO Ending job: AIS_Weekly_UniversalItemChangesGot = FAILED
2019-07-29 14:14:48.837 jberet-1 JobStatsListener - INFO =======================================================================================
2019-07-29 14:14:48.837 jberet-1 JobStatsListener - INFO R U N S T A T S
2019-07-29 14:14:48.837 jberet-1 JobStatsListener - INFO =======================================================================================
2019-07-29 14:14:48.837 jberet-1 JobStatsListener - INFO Job: AIS_Weekly_UniversalItemChangesGot
2019-07-29 14:14:48.837 jberet-1 JobStatsListener - INFO Execution Id: 43121
2019-07-29 14:14:48.837 jberet-1 JobStatsListener - INFO Parm: restartStep = 'deliverDiffFile'
2019-07-29 14:14:48.838 jberet-1 JobStatsListener - INFO Parm: restartId = '43101'
2019-07-29 14:14:48.841 jberet-1 JobStatsListener - INFO Start Time: Mon Jul 29 14:14:47 CDT 2019
2019-07-29 14:14:48.842 jberet-1 JobStatsListener - INFO End Time: Mon Jul 29 14:14:48 CDT 2019
2019-07-29 14:14:48.842 jberet-1 JobStatsListener - INFO Exit Status: FAILED
2019-07-29 14:14:49.251 jberet-1 JobStatsListener - INFO =======================================================================================
2019-07-29 14:14:49.278 jberet-1 CawaManager - INFO CAWA job user status updated: Ending AIS_Weekly_UniversalItemChangesGot = FAILED