Hide minor edits - Show changes to markup
http://fta.inria.fr/apache2-default/pmwiki/pub/schema_fta.png
Platform &rtarr; Node -> Component -> Event Trace.
Platform → Node → Component → Event Trace.
Platform -> Node -> Component -> Event Trace.
Platform &rtarr; Node -> Component -> Event Trace.
Platform -> Node -> Component -> Event Trace.
Platform &rarr Node -> Component -> Event Trace.
Platform -> Node -> Component -> Event Trace.
Platform &rarr Node -> Component -> Event Trace.
2 | user present |
3 | CPU threshold exceeded |
2* | user present |
3* | CPU threshold exceeded |
(*) These codes are specifically for the Notre Dame trace.
trace_start | when the trace event first appeared (epoch time) |
trace_end | when the trace event last appeared (epoch time) |
trace_start | when the trace event first appeared (epoch time). Can be NULL if component was always available. |
trace_end | when the trace event last appeared (epoch time). Can be NULL if component was always available. |
A description of the table attributes appears below. The mysql schema is defined in fta.src.
A description of the table attributes appears below. The mysql schema is defined in fta.src.
2029 | IO: SCSI Back Plane |
2030 | IO: HSV |
2031 | IO: IO Controller |
3022 | Network: ELAN |
4030 | Software: Applications |
This document describes the Failure Trace Archive format.
http://fta.inria.fr/apache2-default/pmwiki/pub/schema_fta.png
http://fta.inria.fr/apache2-default/pmwiki/pub/schema_fta.png
http://fta.inria.fr/apache2-default/pmwiki/pub/schema_fta.png
http://fta.inria.fr/apache2-default/pmwiki/pub/schema_fta.png
[http://fta.inria.fr/apache2-default/pmwiki/pub/schema_fta.png]
http://fta.inria.fr/apache2-default/pmwiki/pub/schema_fta.png
http://fta.inria.fr/apache2-default/pmwiki/pub/schema_fta.png
[http://fta.inria.fr/apache2-default/pmwiki/pub/schema_fta.png]
http://fta.inria.fr/apache2-default/pmwiki/pub/schema_fta.png
table.multirow tr.row4 { background-color:#ffcccc; }
1-999 | infrastructure |
1-999 | infrastructure |
table.multirow tr.row3 { background-color:#ffcccc; }
table.multirow tr.row3 { background-color:#ffcccc; }
table.multirow tr.row4 { background-color:#ffcccc; }
table.multirow tr.row3 { background-color:#ffcccc; }
table.multirow tr.row3 { background-color:#ffcccc; }
component_type | type of this component trace (for example, host, CPU, or network). see below for codes |
component_type | type of this component trace (for example, host, CPU, or network). see below for codes |
event_type | type of event. see below for event_type codes |
event_type | type of event. see below for event_type codes |
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable. see below for event_end_reason codes) |
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable. see below for event_end_reason codes) |
(:html:) <!-- Start of StatCounter Code --> <script type="text/javascript"> var sc_project=4932734; var sc_invisible=1; var sc_partition=57; var sc_click_stat=1; var sc_security="d6ef0c3a"; </script>
<script type="text/javascript" src="http://www.statcounter.com/counter/counter.js"></script><noscript><div class="statcounter"><a title="joomla 1.5 stats" href="http://www.statcounter.com/joomla/" target="_blank"><img class="statcounter" src="http://c.statcounter.com/4932734/0/d6ef0c3a/1/" alt="joomla 1.5 stats" ></a></div></noscript> <!-- End of StatCounter Code --> (:htmlend:)
(:html:) <script type="text/javascript"> var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www."); document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E")); </script> <script type="text/javascript"> try { var pageTracker = _gat._getTracker("UA-4677301-2"); pageTracker._trackPageview(); } catch(err) {}</script> (:htmlend:)
4001 | Software: "Disk IO, firmware and storage" |
4002 | Software: "MPI, PVM, Array services" |
4001 | Software: Disk IO, firmware and storage |
4002 | Software: MPI, PVM, Array services |
4019 | Software: Other Software |
4020 | Software: Parallel File System |
4021 | Software: Patch Install |
4022 | Software: Resource Mgmt System |
4023 | Software: Scheduler Software |
4024 | Software: Scratch Drive |
4025 | Software: Scratch FS |
4026 | Software: Security Software |
4027 | Software: Upgrade/Install 3rd Party Sftw |
4028 | Software: Upgrade/Install OS sftw |
4029 | Software: User code |
4030 | Software: Vizscratch FS |
4019 | Software: Parallel File System |
4020 | Software: Patch Install |
4021 | Software: Resource Mgmt System |
4022 | Software: Scheduler Software |
4023 | Software: Scratch Drive |
4024 | Software: Scratch FS |
4025 | Software: Security Software |
4026 | Software: Upgrade/Install 3rd Party Sftw |
4027 | Software: Upgrade/Install OS sftw |
4028 | Software: User code |
4029 | Software: Vizscratch FS |
2013 | IO: Fibre Raid Midplane |
2014 | IO: IDE Cable |
2015 | IO: IOS Disk Logic |
2016 | IO: IOS Disk Logic |
2017 | IO: PCI Fibre Channel Adapter |
2018 | IO: PCI IO Module |
2019 | IO: PCI SCSI Controller |
2020 | IO: PCI Shoebox |
2021 | IO: SAN Appliance |
2022 | IO: SAN Controller |
2023 | IO: SAN Disk Drive |
2024 | IO: SAN Fiber Cable |
2025 | IO: SAN GBIC |
2026 | IO: SAN Shelf |
2027 | IO: SAN Switch |
2028 | IO: SCSI Adapter Card |
2029 | IO: SCSI Controller |
2030 | IO: SCSI Drive |
2031 | IO: SAN Appliance |
2013 | IO: IDE Cable |
2014 | IO: IOS Disk Logic |
2015 | IO: PCI Fibre Channel Adapter |
2016 | IO: PCI IO Module |
2017 | IO: PCI SCSI Controller |
2018 | IO: PCI Shoebox |
2019 | IO: SAN Appliance |
2020 | IO: SAN Controller |
2021 | IO: SAN Disk Drive |
2022 | IO: SAN Fiber Cable |
2023 | IO: SAN GBIC |
2024 | IO: SAN Shelf |
2025 | IO: SAN Switch |
2026 | IO: SCSI Adapter Card |
2027 | IO: SCSI Controller |
2028 | IO: SCSI Drive |
2029 | IO: SAN Appliance |
3016 | Network: Interconnect |
3017 | Network: PCI Ethernet Board |
3018 | Network: PCI GBIT Ethernet Board |
3019 | Network: Router Board |
3020 | Network: PCI GBIT Ethernet Board |
3021 | Network: Site Ethernet Switch |
3022 | Network: Site Network Interface |
3023 | Network: PCI GBIT Ethernet Board |
3024 | Network: PCI GBIT Ethernet Board |
3016 | Network: PCI Ethernet Board |
3017 | Network: PCI GBIT Ethernet Board |
3018 | Network: Router Board |
3019 | Network: Site Ethernet Switch |
3020 | Network: Site Network Interface |
2004 | IO: Fibre Cable |
2005 | IO: Fibre Channel Port Adapter |
2006 | IO: Fibre Drive |
2007 | IO: Fibre HBA |
2008 | IO: Fibre Raid Controller |
2009 | IO: Fibre Raid LCC card |
2010 | IO: Fibre Raid Midplane |
2011 | IO: Fibre Raid Power Supply |
2012 | IO: Fibre Raid Midplane |
2013 | IO: IDE Cable |
2014 | IO: IOS Disk Logic |
2005 | IO: Fibre Cable |
2006 | IO: Fibre Channel Port Adapter |
2007 | IO: Fibre Drive |
2008 | IO: Fibre HBA |
2009 | IO: Fibre Raid Controller |
2010 | IO: Fibre Raid LCC card |
2011 | IO: Fibre Raid Midplane |
2012 | IO: Fibre Raid Power Supply |
2013 | IO: Fibre Raid Midplane |
2014 | IO: IDE Cable |
2016 | IO: PCI Fibre Channel Adapter |
2017 | IO: PCI IO Module |
2018 | IO: PCI SCSI Controller |
2019 | IO: PCI Shoebox |
2020 | IO: SAN Appliance |
2021 | IO: SAN Controller |
2022 | IO: SAN Disk Drive |
2023 | IO: SAN Fiber Cable |
2024 | IO: SAN GBIC |
2025 | IO: SAN Shelf |
2026 | IO: SAN Switch |
2027 | IO: SCSI Adapter Card |
2028 | IO: SCSI Controller |
2029 | IO: SCSI Drive |
2030 | IO: SAN Appliance |
2016 | IO: IOS Disk Logic |
2017 | IO: PCI Fibre Channel Adapter |
2018 | IO: PCI IO Module |
2019 | IO: PCI SCSI Controller |
2020 | IO: PCI Shoebox |
2021 | IO: SAN Appliance |
2022 | IO: SAN Controller |
2023 | IO: SAN Disk Drive |
2024 | IO: SAN Fiber Cable |
2025 | IO: SAN GBIC |
2026 | IO: SAN Shelf |
2027 | IO: SAN Switch |
2028 | IO: SCSI Adapter Card |
2029 | IO: SCSI Controller |
2030 | IO: SCSI Drive |
2031 | IO: SAN Appliance |
1040 | Hardware: WACS Logic |
1040 | Hardware: Wire Harness |
1040 | Hardware: Xtown Board |
1041 | Hardware: WACS Logic |
1042 | Hardware: Wire Harness |
1043 | Hardware: Xtown Board |
4001 | Software: "Disk IO, firmware and storage" |
4002 | Software: "MPI, PVM, Array services" |
4003 | Software: Cluster File System |
4004 | Software: Cluster Software |
4005 | Software: Compilers and libraries |
4006 | Software: DST |
4007 | Software: DST-Scan for errors /scratch |
4008 | Software: DST-Upgrd/Install OS sftw |
4009 | Software: DST-Upgrd/Instl 3rdParty Sftw |
4010 | Software: DST-Upgrd/Instl OS Sftw |
4011 | Software: Interconnect |
4012 | Software: IOS Software |
4013 | Software: Kernel software |
4014 | Software: Modify kernel parameters |
4015 | Software: Modify system config files |
4016 | Software: Network |
4017 | Software: NFS |
4018 | Software: OS |
4019 | Software: Other Software |
4020 | Software: Parallel File System |
4021 | Software: Patch Install |
4022 | Software: Resource Mgmt System |
4023 | Software: Scheduler Software |
4024 | Software: Scratch Drive |
4025 | Software: Scratch FS |
4026 | Software: Security Software |
4027 | Software: Upgrade/Install 3rd Party Sftw |
4028 | Software: Upgrade/Install OS sftw |
4029 | Software: User code |
4001 | Software: "Disk IO, firmware and storage" |
4002 | Software: "MPI, PVM, Array services" |
4003 | Software: Cluster File System |
4004 | Software: Cluster Software |
4005 | Software: Compilers and libraries |
4006 | Software: DST |
4007 | Software: DST-Scan for errors /scratch |
4008 | Software: DST-Upgrd/Install OS sftw |
4009 | Software: DST-Upgrd/Instl 3rdParty Sftw |
4010 | Software: DST-Upgrd/Instl OS Sftw |
4011 | Software: Interconnect |
4012 | Software: IOS Software |
4013 | Software: Kernel software |
4014 | Software: Modify kernel parameters |
4015 | Software: Modify system config files |
4016 | Software: Network |
4017 | Software: NFS |
4018 | Software: OS |
4019 | Software: Other Software |
4020 | Software: Parallel File System |
4021 | Software: Patch Install |
4022 | Software: Resource Mgmt System |
4023 | Software: Scheduler Software |
4024 | Software: Scratch Drive |
4025 | Software: Scratch FS |
4026 | Software: Security Software |
4027 | Software: Upgrade/Install 3rd Party Sftw |
4028 | Software: Upgrade/Install OS sftw |
4029 | Software: User code |
5000 | Human Error: Human Error |
5000 | Human Error: Human Error |
1002 | Hardware: Power Supply |
1003 | Hardware: Rack Power Distribution Unit |
1004 | Hardware: Interconnect Switch |
1005 | Hardware: Ktown Board |
1006 | Hardware: Graphics Accel Hdwr |
1007 | Hardware: Ethernet Switch |
1008 | Hardware: Site Ethernet Switch |
1009 | Hardware: SAN Switch |
1010 | Hardware: Fibre Raid Power Supply |
1011 | Hardware: Gig E Switch |
1012 | Hardware: Xtown Board |
1013 | Hardware: GFX Power Supply |
1014 | Hardware: Site Network Interface |
1015 | Hardware: Power cord |
1004 | Hardware: Console Interface Module |
1005 | Hardware: CPU |
1006 | Hardware: DG Board |
1007 | Hardware: Fan |
1008 | Hardware: Fan Assembly |
1009 | Hardware: GFX Power Supply |
1010 | Hardware: Graphics Accel Hdwr |
1011 | Hardware: Graphics Video Card |
1012 | Hardware: Heatsink bracket |
1013 | Hardware: IO6 |
1014 | Hardware: IOS Buffer Memory |
1015 | Hardware: IOS CPU |
1016 | Hardware: KGPSA |
1017 | Hardware: Ktown Board |
1018 | Hardware: Maintenance |
1019 | Hardware: Memory Dimm |
1020 | Hardware: Memory Dimm/CPU |
1021 | Hardware: Memory Module |
1022 | Hardware: MIA |
1023 | Hardware: Mid-plane |
1024 | Hardware: MMB |
1025 | Hardware: Module Assembly |
1026 | Hardware: MSC Board |
1027 | Hardware: OCP |
1028 | Hardware: PCI Back Plane |
1029 | Hardware: Power cord |
1030 | Hardware: Power Supply |
1031 | Hardware: Rack Power Distribution Unit |
1032 | Hardware: Riser Card |
1033 | Hardware: RM Board |
1034 | Hardware: SSD Logic |
1035 | Hardware: SSD Memory Module |
1036 | Hardware: System Board |
1037 | Hardware: System Controller |
1038 | Hardware: Temp Probe |
1039 | Hardware: Term Server |
1040 | Hardware: Vhisp |
1040 | Hardware: WACS Logic |
1040 | Hardware: Wire Harness |
1040 | Hardware: Xtown Board |
2000 | IO: IO |
2001 | IO: Disk Cabinet |
2002 | IO: Disk Drive |
2003 | IO: Drive Cage |
2004 | IO: Fddi |
2004 | IO: Fibre Cable |
2005 | IO: Fibre Channel Port Adapter |
2006 | IO: Fibre Drive |
2007 | IO: Fibre HBA |
2008 | IO: Fibre Raid Controller |
2009 | IO: Fibre Raid LCC card |
2010 | IO: Fibre Raid Midplane |
2011 | IO: Fibre Raid Power Supply |
2012 | IO: Fibre Raid Midplane |
2013 | IO: IDE Cable |
2014 | IO: IOS Disk Logic |
2015 | IO: IOS Disk Logic |
2016 | IO: PCI Fibre Channel Adapter |
2017 | IO: PCI IO Module |
2018 | IO: PCI SCSI Controller |
2019 | IO: PCI Shoebox |
2020 | IO: SAN Appliance |
2021 | IO: SAN Controller |
2022 | IO: SAN Disk Drive |
2023 | IO: SAN Fiber Cable |
2024 | IO: SAN GBIC |
2025 | IO: SAN Shelf |
2026 | IO: SAN Switch |
2027 | IO: SCSI Adapter Card |
2028 | IO: SCSI Controller |
2029 | IO: SCSI Drive |
2030 | IO: SAN Appliance |
3003 | Network: Network |
4000 | Software: Scheduler Software |
4001 | Software: Network |
4002 | Software: Upgrade/Install OS sftw |
4003 | Software: Kernel software |
4004 | Software: Other Software |
4005 | Software: Upgrade/Install 3rd Party Sftw |
4006 | Software: DST-Upgrd/Instl OS Sftw |
4007 | Software: IOS Software |
4008 | Software: Cluster Software |
4009 | Software: DST-Upgrd/Instl 3rdParty Sftw |
4010 | Software: DST-Upgrd/Install OS sftw |
4011 | Software: Security Software |
3003 | Network: Ethernet Cable |
3004 | Network: Ethernet Copper Cable |
3005 | Network: Ethernet Fiber Cable |
3006 | Network: Ethernet Switch |
3007 | Network: GE Board |
3008 | Network: Gig E Connection |
3009 | Network: Gig E Switch |
3010 | Network: Interconnect |
3011 | Network: Interconnect Cable |
3012 | Network: Interconnect Interface |
3013 | Network: Interconnect Misc |
3014 | Network: Interconnect Soft Error |
3015 | Network: Interconnect Switch |
3016 | Network: Interconnect |
3017 | Network: PCI Ethernet Board |
3018 | Network: PCI GBIT Ethernet Board |
3019 | Network: Router Board |
3020 | Network: PCI GBIT Ethernet Board |
3021 | Network: Site Ethernet Switch |
3022 | Network: Site Network Interface |
3023 | Network: PCI GBIT Ethernet Board |
3024 | Network: PCI GBIT Ethernet Board |
4000 | Software: Software |
4001 | Software: "Disk IO, firmware and storage" |
4002 | Software: "MPI, PVM, Array services" |
4003 | Software: Cluster File System |
4004 | Software: Cluster Software |
4005 | Software: Compilers and libraries |
4006 | Software: DST |
4007 | Software: DST-Scan for errors /scratch |
4008 | Software: DST-Upgrd/Install OS sftw |
4009 | Software: DST-Upgrd/Instl 3rdParty Sftw |
4010 | Software: DST-Upgrd/Instl OS Sftw |
4011 | Software: Interconnect |
4012 | Software: IOS Software |
4013 | Software: Kernel software |
4014 | Software: Modify kernel parameters |
4015 | Software: Modify system config files |
4016 | Software: Network |
4017 | Software: NFS |
4018 | Software: OS |
4019 | Software: Other Software |
4020 | Software: Parallel File System |
4021 | Software: Patch Install |
4022 | Software: Resource Mgmt System |
4023 | Software: Scheduler Software |
4024 | Software: Scratch Drive |
4025 | Software: Scratch FS |
4026 | Software: Security Software |
4027 | Software: Upgrade/Install 3rd Party Sftw |
4028 | Software: Upgrade/Install OS sftw |
4029 | Software: User code |
4030 | Software: Vizscratch FS |
5000 | Human Error: Human Error |
6004 | User: User |
1-999 | infrastructure (e.g. power outage) |
1-999 | infrastructure |
4 | Infrastructure: Chillers |
5 | Infrastructure: Environment |
6 | Infrastructure: UPS |
1001 | Hardware: Console Network Device |
1001 | Hardware: 40MM Cooling Fan |
1002 | Hardware: Bach Plane Assembly |
1003 | Hardware: CD ROM |
3001 | Network: 1 GBit Ethernet Card |
3002 | Network: 100 MBit Ethernet Card |
3003 | Network: Network |
0 | undetermined |
NULL | not reported |
0 | reported as undetermined |
0 | undetermined |
NULL | not reported |
0 | reported as undetermined |
component_type | type of this component trace. see below for codes |
component_type | type of this component trace (for example, host, CPU, or network). see below for codes |
component_type | type of this component trace (0 -> host availability, network, CPU, client, memory, etc) |
component_type | type of this component trace. see below for codes |
1 | Infrastructure: Power Outage |
2 | Infrastructure: Power Spike |
2 | Infrastructure: Power Outage |
3 | Infrastructure: Power Spike |
!code | !definition |
code | definition |
---|
!code range | !category |
code range | category |
---|
!code | !definition |
code | definition |
---|
code | definition |
---|
!code | !definition |
code range | category |
---|
!code range | !category |
code | definition |
---|
!code | !definition |
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable. see below for end_reason codes) |
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable. see below for event_end_reason codes) |
event_type | type of event.see below for event_type codes |
event_type | type of event. see below for event_type codes |
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable. |
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable. see below for end_reason codes) |
event_type | type of event |
event_type | type of event.see below for event_type codes |
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable. see below for end_reason codes ) |
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable. |
code | definition |
code | definition |
---|
code range | category |
code range | category |
---|
code | definition |
code | definition |
---|
2000-2999 | human error |
2000-2999 | IO |
5000-5999 | user |
5000-5999 | human error |
6000-6999 | user |
5000 | User: User |
5001 | User: Mouse Activity |
5000 | User: Keyboard Activity |
5000 | User: Shutdown |
5000 | User: User |
6000 | User: User |
6001 | User: Mouse Activity |
6002 | User: Keyboard Activity |
6003 | User: Shutdown |
6004 | User: User |
event_type | type of event (0 -> unavailability, 1-> availability). Event id's up to 10,000 are reserved; the rest can be user defined |
event_type | type of event |
Below are code ranges and definitions for the fields above. The failure classification is based on the traces from LANL.
Below are code ranges and definitions for the fields above.
Event id's up to 10,000 are reserved; the rest can be user defined.
code | definition |
0 | unavailability |
1 | availbility |
The failure classification below is based on the traces from LANL.
Below are code ranges and definitions for the fields above.
Below are code ranges and definitions for the fields above. The failure classification is based on the traces from LANL.
Below are code ranges and definitions for the fields above.
5000 | User: Shutdown |
5000 | User: User |
4=scheduled_downtime, 5=graceful_shutdown, 6=hard_shutdown
5000-5999 | user |
1002 | Hardware: |
1003 | Hardware: |
1004 | Hardware: |
1005 | Hardware: |
1006 | Hardware: |
1007 | Hardware: |
1008 | Hardware: |
1009 | Hardware: |
1010 | Hardware: |
1011 | Hardware: |
1012 | Hardware: |
100 | Hardware: |
100 | Hardware: |
100 | Hardware: |
100 | Hardware: |
|
| Hardware: Power Supply | | Hardware: Rack Power Distribution Unit | | Hardware: Interconnect Switch | | Hardware: Ktown Board | | Hardware: Graphics Accel Hdwr | | Hardware: Ethernet Switch | | Hardware: Site Ethernet Switch | | Hardware: SAN Switch | | Hardware: Fibre Raid Power Supply | | Hardware: Gig E Switch | | Hardware: Xtown Board | | Hardware: GFX Power Supply | | Hardware: Site Network Interface | | Hardware: Power cord
3000-3999 | network |
4000-4999 | software |
0=undefined, 1-999: Infrastructure 1000-1999: Hardware 1=miscellaneous, 2=mouse_activity, 3=keyboard_activity, 4=scheduled_downtime, 5=graceful_shutdown, 6=hard_shutdown
1002 | Hardware: Power Supply |
1003 | Hardware: Rack Power Distribution Unit |
1004 | Hardware: Interconnect Switch |
1005 | Hardware: Ktown Board |
1006 | Hardware: Graphics Accel Hdwr |
1007 | Hardware: Ethernet Switch |
1008 | Hardware: Site Ethernet Switch |
1009 | Hardware: SAN Switch |
1010 | Hardware: Fibre Raid Power Supply |
1011 | Hardware: Gig E Switch |
1012 | Hardware: Xtown Board |
1013 | Hardware: GFX Power Supply |
1014 | Hardware: Site Network Interface |
1015 | Hardware: Power cord |
3000 | Network: Network |
4000 | Software: Scheduler Software |
4001 | Software: Network |
4002 | Software: Upgrade/Install OS sftw |
4003 | Software: Kernel software |
4004 | Software: Other Software |
4005 | Software: Upgrade/Install 3rd Party Sftw |
4006 | Software: DST-Upgrd/Instl OS Sftw |
4007 | Software: IOS Software |
4008 | Software: Cluster Software |
4009 | Software: DST-Upgrd/Instl 3rdParty Sftw |
4010 | Software: DST-Upgrd/Install OS sftw |
4011 | Software: Security Software |
5000 | User: User |
5001 | User: Mouse Activity |
5000 | User: Keyboard Activity |
5000 | User: User |
5000 | User: User |
4=scheduled_downtime, 5=graceful_shutdown, 6=hard_shutdown
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable: 0=undefined, 1=miscellaneous, 2=mouse_activity, 3=keyboard_activity, 4=scheduled_downtime, 5=graceful_shutdown, 6=hard_shutdown) |
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable. see below for end_reason codes ) |
code range | category |
0 | undetermined |
1-999 | infrastructure (e.g. power outage) |
1000-1999 | hardware |
2000-2999 | human error |
3000-3999 | network |
4000-4999 | software |
code | definition |
0 | undetermined |
1 | Infrastructure: Infrastructure |
1 | Infrastructure: Power Outage |
2 | Infrastructure: Power Spike |
1000 | Hardware: Hardware |
1001 | Hardware: Console Network Device |
1002 | Hardware: |
1003 | Hardware: |
1004 | Hardware: |
1005 | Hardware: |
1006 | Hardware: |
1007 | Hardware: |
1008 | Hardware: |
1009 | Hardware: |
1010 | Hardware: |
1011 | Hardware: |
1012 | Hardware: |
100 | Hardware: |
100 | Hardware: |
100 | Hardware: |
100 | Hardware: |
|
| Hardware: Power Supply | | Hardware: Rack Power Distribution Unit | | Hardware: Interconnect Switch | | Hardware: Ktown Board | | Hardware: Graphics Accel Hdwr | | Hardware: Ethernet Switch | | Hardware: Site Ethernet Switch | | Hardware: SAN Switch | | Hardware: Fibre Raid Power Supply | | Hardware: Gig E Switch | | Hardware: Xtown Board | | Hardware: GFX Power Supply | | Hardware: Site Network Interface | | Hardware: Power cord
3000-3999 | network |
4000-4999 | software |
0=undefined, 1-999: Infrastructure 1000-1999: Hardware 1=miscellaneous, 2=mouse_activity, 3=keyboard_activity, 4=scheduled_downtime, 5=graceful_shutdown, 6=hard_shutdown
misc_notes | miscellaneous notes |
notes | miscellaneous notes |
sfpop_speed | maximum single precision floating point speed (ops/sec) |
dfpop_speed | maximum double precision floating point speed (ops/sec) |
sfpop_speed | single precision floating point speed (ops/sec) |
dfpop_speed | double precision floating point speed (ops/sec) |
A description of the table attributes appears below. The mysql schema is defined in fta.src.
A description of the table attributes appears below. The mysql schema is defined in fta.src.
This document describes the Failure Trace Archive format. The mysql schema is defined in fta.src.
This document describes the Failure Trace Archive format.
A description of the table attributes appears below.
A description of the table attributes appears below. The mysql schema is defined in fta.src.
This document describes the Failure Trace Archive format.
This document describes the Failure Trace Archive format. The mysql schema is defined in fta.src.
platform_id | A unique number identifying this platform.\\ It allows one to differentiate pools of nodes. |
platform_id | A unique number identifying this platform. It allows one to differentiate pools of nodes. |
platform_id | A unique number identifying this platform. >> It allows one to differentiate pools of nodes. |
platform_id | A unique number identifying this platform.\\ It allows one to differentiate pools of nodes. |
INSERT picture of schema.
---picture of schema.
A platform contains a set of nodes. Examples of a platform include SETI@home, desktops at Microsoft.
A node contains a set of components, which is a software module or hardware resource of the node. Each node can have several components (e.g. CPU speed, availability memory, client availability), each of which has a corresponding trace.
A component describes attributes of a software module or hardware resource of a node.
component_perf is the component performance, as measured through benchmarks for example.
A creator is the person responsible for the trace data set. This table stores details about citations and copyright.
An event_trace is the trace of an event, with all of corresponding timing information
event_state is the state corresponding to an event_trace. For example, for CPU availability, the event_state could be the idleness of the CPU. For host availability, it could be the monitoring information associated with the event.
This document describes the Failure Trace Archive format.
The trace format is organized hierarchically as follows:
Platform -> Node -> Component -> Event Trace.
INSERT picture of schema.
We summary the meaning of each table below. Table names are shown in bold.
A platform contains a set of nodes. Examples of a platform include SETI@home, desktops at Microsoft.
A node contains a set of components, which is a software module or hardware resource of the node. Each node can have several components (e.g. CPU speed, availability memory, client availability), each of which has a corresponding trace.
A component describes attributes of a software module or hardware resource of a node.
component_perf is the component performance, as measured through benchmarks for example.
A creator is the person responsible for the trace data set. This table stores details about citations and copyright.
An event_trace is the trace of an event, with all of corresponding timing information
event_state is the state corresponding to an event_trace. For example, for CPU availability, the event_state could be the idleness of the CPU. For host availability, it could be the monitoring information associated with the event.
A description of the table attributes appears below.
platform_id | A unique number identifying this platform. >> It allows one to differentiate pools of nodes. |
platform_name | name of the platform (e.g. "Berkeley_NOW_Lab_Fall_1998") |
platform_location | location name of the platform source (e.g. "Berkeley NOW Lab - Soda Hall 2nd Floor, USA, Planet Earth") |
platform_type | type of the platform (cluster, multicluster, grid, desktop_grid, or volunteer_computing) |
misc_notes | miscellaneous notes |
node_id | unique ID for this node |
platform_id | id of the platform containing node |
node_name | name of node |
node_ip | IP address |
node_location | location of the node (e.g. country, geographic coordinates) |
timezone | time zone of the resource (second offset from GMT) |
proc_model | processor name, model, version number |
os_name | name and version of the resource OS |
cores_per_proc | number of cores per processor |
num_procs | number of processors for this node |
mem_size | number of bytes of memory |
disk_size | number of bytes of disk space |
up_bw | number of bytes/sec of upload speed |
down_bw | number of bytes/sec of download speed |
metric_id | unique ID for performance metric (e.g. benchmark) |
notes | other notes related to this resource |
metric_id | unique ID for performance metric (benchmark) |
component_id | unique ID for the component |
node_id | unique ID for this node |
platform_id | ID of platform containing node |
sfpop_speed | maximum single precision floating point speed (ops/sec) |
dfpop_speed | maximum double precision floating point speed (ops/sec) |
iop_speed | integer operation speed (ops/sec) |
i_val | integer |
f_val | float |
s_val | string |
component_id | unique ID for this component |
node_id | ID of the node containing this component |
platform_id | ID of platform containing this node |
node_name | Name of the node |
component_type | type of this component trace (0 -> host availability, network, CPU, client, memory, etc) |
trace_start | when the trace event first appeared (epoch time) |
trace_end | when the trace event last appeared (epoch time) |
resolution | resolution of the traces in seconds |
creator_id | ID of creator of this component trace data |
component_id | unique ID for this component trace data |
node_id | ID of the node corresponding to this trace |
platform_id | ID of platform containing node |
creator | name(s) of the person(s) who recorded the event traces |
cite | citation (bibtex, etc) for using the data from the event traces |
copyright | details of the copyright and rights reserved |
event_id | unique ID of event state |
component_id | unique ID for this component trace data |
node_id | unique ID for this node |
platform_id | ID of platform that is the node parent |
node_name | name of node |
event_type | type of event (0 -> unavailability, 1-> availability). Event id's up to 10,000 are reserved; the rest can be user defined |
event_start_time | start of this event (UNIX epoch time) |
event_end_time | end of this event (UNIX epoch time) |
event_end_reason | reason the event type or state changed at the end of this trace (for example, reason that CPU became unavailable: 0=undefined, 1=miscellaneous, 2=mouse_activity, 3=keyboard_activity, 4=scheduled_downtime, 5=graceful_shutdown, 6=hard_shutdown) |
event_id | unique ID of event state |
component_id | unique ID for this component trace data |
node_id | unique ID for this node |
platform_id | ID of platform that is the node parent |
i_val | integer |
f_val | float (for example, 0% - 100% for CPU availability) |
s_val | string |