<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Blog O' Matty &#187; Solaris Fault Management</title>
	<atom:link href="http://prefetch.net/blog/index.php/category/fault-management/feed/" rel="self" type="application/rss+xml" />
	<link>http://prefetch.net/blog</link>
	<description>Blog O' Matty</description>
	<lastBuildDate>Sat, 13 Mar 2010 15:28:36 +0000</lastBuildDate>
	
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Debugging a Solaris fault manager fault</title>
		<link>http://prefetch.net/blog/index.php/2009/04/04/debugging-a-solaris-fault-manager-fault/</link>
		<comments>http://prefetch.net/blog/index.php/2009/04/04/debugging-a-solaris-fault-manager-fault/#comments</comments>
		<pubDate>Sat, 04 Apr 2009 13:59:06 +0000</pubDate>
		<dc:creator>matty</dc:creator>
				<category><![CDATA[Solaris Fault Management]]></category>

		<guid isPermaLink="false">http://prefetch.net/blog/?p=1336</guid>
		<description><![CDATA[I recently debugged an issue where a host panicked with the following message:
Apr  3 04:41:44 pluto.prefetch.com genunix: [ID 663943 kern.notice] Unrecoverable Machine-Check Exception
These errors are typically generated due to CPU or memory faults, but on this specific machine nothing was being displayed when I checked the fault and errors logs. Upon closer inspection, it [...]]]></description>
		<wfw:commentRss>http://prefetch.net/blog/index.php/2009/04/04/debugging-a-solaris-fault-manager-fault/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SCSI Enclosure Services</title>
		<link>http://prefetch.net/blog/index.php/2008/07/15/scsi-enclosure-services/</link>
		<comments>http://prefetch.net/blog/index.php/2008/07/15/scsi-enclosure-services/#comments</comments>
		<pubDate>Tue, 15 Jul 2008 15:18:08 +0000</pubDate>
		<dc:creator>mike</dc:creator>
				<category><![CDATA[Solaris Fault Management]]></category>
		<category><![CDATA[Solaris Storage]]></category>

		<guid isPermaLink="false">http://prefetch.net/blog/?p=876</guid>
		<description><![CDATA[Eric Schrock has done some really cool work with integrating disk (SMART) /platform monitoring (IPMI)  information into Opensolaris.   Just recently, he has extended FMA with a new technology called SES (SCSI Enclosure Services) into build 93 of OpenSolaris.
This looks like some really cool stuff.  The following was taken directly from his blog  on [...]]]></description>
		<wfw:commentRss>http://prefetch.net/blog/index.php/2008/07/15/scsi-enclosure-services/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Monitoring the IPMI system event log</title>
		<link>http://prefetch.net/blog/index.php/2007/12/30/monitoring-bmc-system-event-logs/</link>
		<comments>http://prefetch.net/blog/index.php/2007/12/30/monitoring-bmc-system-event-logs/#comments</comments>
		<pubDate>Sun, 30 Dec 2007 19:42:00 +0000</pubDate>
		<dc:creator>matty</dc:creator>
				<category><![CDATA[Solaris Fault Management]]></category>
		<category><![CDATA[Solaris Utilities]]></category>

		<guid isPermaLink="false">http://prefetch.net/blog/index.php/2007/12/30/monitoring-bmc-system-event-logs/</guid>
		<description><![CDATA[If you have a relatively recent server, your machine most likely supports IPMI. One technology that makes IPMI extremely useful is the baseboard management controller (BMC), which is an out-of-band controller that monitors the health of your server platform. Health monitoring is accomplished by distributing sensors throughout the server, and feeding the data these sensors [...]]]></description>
		<wfw:commentRss>http://prefetch.net/blog/index.php/2007/12/30/monitoring-bmc-system-event-logs/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Getting notified when hardware breaks</title>
		<link>http://prefetch.net/blog/index.php/2007/09/03/getting-notified-when-hardware-breaks/</link>
		<comments>http://prefetch.net/blog/index.php/2007/09/03/getting-notified-when-hardware-breaks/#comments</comments>
		<pubDate>Mon, 03 Sep 2007 12:49:20 +0000</pubDate>
		<dc:creator>matty</dc:creator>
				<category><![CDATA[Solaris Fault Management]]></category>

		<guid isPermaLink="false">http://prefetch.net/blog/index.php/2007/09/03/getting-notified-when-hardware-breaks/</guid>
		<description><![CDATA[With the introduction of Solaris 10, the Solaris kernel was modified and userland tools were added to detect and report on hardware faults. The fault analysis is handled by the Solaris fault manager, which currently detects and responds (the kernel can retire memory pages,  CPUs, etc. when it detects faulty hardware) to failures in [...]]]></description>
		<wfw:commentRss>http://prefetch.net/blog/index.php/2007/09/03/getting-notified-when-hardware-breaks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Solaris SMART support is finally becoming a reality!!</title>
		<link>http://prefetch.net/blog/index.php/2007/06/06/solaris-smart-support-is-finally-becoming-a-reality/</link>
		<comments>http://prefetch.net/blog/index.php/2007/06/06/solaris-smart-support-is-finally-becoming-a-reality/#comments</comments>
		<pubDate>Thu, 07 Jun 2007 00:05:18 +0000</pubDate>
		<dc:creator>matty</dc:creator>
				<category><![CDATA[Solaris Fault Management]]></category>

		<guid isPermaLink="false">http://prefetch.net/blog/index.php/2007/06/06/solaris-fma-disk-transport-diagnosis-engine/</guid>
		<description><![CDATA[A while back I wrote a blog entry about the lack of SMART support in Solaris. Just recently, Eric Schrock added a FMA disk-transport diagnosis engine, which provides generic SMART monitoring as part of the base operating system. The disk-transport diagnosis engine currently only supports SATA disk drives, but SCSI support is right around the [...]]]></description>
		<wfw:commentRss>http://prefetch.net/blog/index.php/2007/06/06/solaris-smart-support-is-finally-becoming-a-reality/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>FMA support for AMD Opteron processors</title>
		<link>http://prefetch.net/blog/index.php/2006/06/11/fma-support-for-amd-opteron-processors/</link>
		<comments>http://prefetch.net/blog/index.php/2006/06/11/fma-support-for-amd-opteron-processors/#comments</comments>
		<pubDate>Mon, 12 Jun 2006 01:15:49 +0000</pubDate>
		<dc:creator>matty</dc:creator>
				<category><![CDATA[Solaris Fault Management]]></category>

		<guid isPermaLink="false">http://daemons.net/~matty/blog/?p=446</guid>
		<description><![CDATA[Gavin Maltby has an awesome blog entry about the FMA support that is presently in Nevada, and soon to be in Solaris 10 update 2:
http://blogs.sun.com/roller/page/gavinm/20060315
I have written about FMA before, and still think it&#8217;s my favorite Solaris 10 feature.
]]></description>
		<wfw:commentRss>http://prefetch.net/blog/index.php/2006/06/11/fma-support-for-amd-opteron-processors/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Solaris fault manager overview</title>
		<link>http://prefetch.net/blog/index.php/2005/09/29/solaris-fault-manager-fmd/</link>
		<comments>http://prefetch.net/blog/index.php/2005/09/29/solaris-fault-manager-fmd/#comments</comments>
		<pubDate>Thu, 29 Sep 2005 18:06:35 +0000</pubDate>
		<dc:creator>matty</dc:creator>
				<category><![CDATA[Solaris Fault Management]]></category>

		<guid isPermaLink="false">http://daemons.net/~matty/blog/?p=169</guid>
		<description><![CDATA[One of the coolest features in Solaris 10 in the fault management service. Fault management allows system software to send telemetry data to the fmd(1m) daemon, which then diagnoses the problem, and takes action (e.g., offlining a faulty components and logging an error with FMRI/UUID information to syslog) based on the type of event received. [...]]]></description>
		<wfw:commentRss>http://prefetch.net/blog/index.php/2005/09/29/solaris-fault-manager-fmd/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
