UNDERSTANDING ERROR LOG EVENT SEQUENCE FOR FAILURE ANALYSIS

dc.contributor.authorNentawe Gurumdimma1 , Desmond Bala Bisandu
dc.date.accessioned2026-03-13T10:56:26Z
dc.date.issued2018
dc.description.abstractDue to the evolvement of large-scale parallel systems, they are mostly employed for mission critical applications. The anticipation and accommodation of failure occurrences is crucial to the design. A commonplace feature of these large-scale systems is failure, and they cannot be treated as exception. The system state is mostly captured through the logs. The need for proper understanding of these error logs for failure analysis is extremely important. This is because the logs contain the “health” information of the system. In this paper we design an approach that seeks to find similarities in patterns of these logs events that leads to failures. Our experiment shows that several root causes of soft lockup failures could be traced through the logs. We capture the behavior of failure inducing patterns and realized that the logs pattern of failure and non-failure patterns are dissimilar.
dc.identifier.issn1597-6343
dc.identifier.urihttps://irepos.unijos.edu.ng/handle/123456789/11459
dc.language.isoen_US
dc.publisherScience World Journal
dc.subjectCluster
dc.subjectHPC
dc.subjectsimilarity
dc.titleUNDERSTANDING ERROR LOG EVENT SEQUENCE FOR FAILURE ANALYSIS
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
18872-ArticleText-75965-1-10-20181224.pdf
Size:
688.48 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: