ZFS–notes from a couple of years of production

So, back in 2009, we decided to try out the whole ZFS route for our storage needs (we have around 500 VMs on a mix of VMWare, Virtuozzo and now OnApp platforms). After this year, quite a bit of experience has accumulated, and I’ll try to get some of it written down over a few posts in the weeks to come.

Let’s start at the end. What we have running now is:

Primary systems:

Controller nodes
Supermicro 2U (don’t have the model number here)
Dual E5645 2.4GHz SixCore CPUs
48GB RAM
2 LSI SAS 9200-8e (dual port 6G SAS HBAs)
Intel 310 series SSDs for caching
Intel quad 1G networking (will be going for 10G “real soon now”)
Running Nexenta 3.05 (at present), most likely going to OI or Solaris soon

Storage nodes (1 per controller at present)
Supermicro SC847E26-RJBOD1
42 Seagate Constellation ES 1TB SAS drives, set up in mirrored vdevs (20 mirrored pairs, 2 spares)
Intel 311 series SSDs for logs

Secondary systems:

Supermicro SC848A
Dual E5645 2.4GHz SixCore CPUs
48GB RAM
3 LSI 6G controllers (no SAS expander on backplane, so we’re using fanout cables instead)
22 Seagate Constellation ES 3TB drives (10 mirrored pairs, 2 spares)
Space left over for two cache SSDs from the main nodes if needed

After toying around with a lot of HA, we’ve actually decided to go for standalone boxes instead. We can schedule maintenance windows with our customers if needed, and we actually had far more problems with the HA software than we’ve ever had hardware issues. We therefore decided to go the KISS route instead and simply bet on more than one horse and good replication and sparepart policies. We will be going for SC848A chassis for the next primary nodes as well (to keep it nice and simple).

The main nodes each export a number of iSCSI volumes (and a few NFS shares) for the client systems, and do snapshots each hour. The snaps are then exported to the secondary systems, whose roles are to act as DR nodes. In the event of a catastrophic failure on the primary nodes, we can replicate the scsi setup on the secondary and get back up and running quickly. The secondary systems are then hosted in a secondary datacenter, which we can use for DR purposes.

We do snapshots on hourly (72), daily (14), weekly (12) and monthly (12) schedules. I’ll post some scripts later.

We’ve actually got a tertiary system as well, in the other end of the country, running a SC848 chassis with RAIDZ2’ed disks.

Buggy DM5

Lidt billeder frea dagens (og årets sidste) afdeling af 1:8 Offroad, afholdt i HRCR i silende regnvejr.

Der er lidt udvalgte her

…Og hele dynen her

 

DSC_4563

DSC_4698

DSC_4558

DSC_4647

Buggy DM4

GRCC lagde søndag bane til Buggy DM4. Lækkert vejr og lækkert race :)

DSC_4160
DSC_4174
DSC_4190
DSC_4196

Udvalgte billeder
Hele dynen
…Og mit kamera strejkede lidt under Semi B, så jeg blev barnefornærmet, satte mig op på volden, og skød lidt video i stedet:
http://www.youtube.com/watch?v=pzwzX2Vjg_k&fmt=18

Truggy DM4

Var en tur forbi GRCC i dag og nappe et par billeder og lidt video. :)

Som sædvanlig er der:

En stak udvalgte billeder

Hele dynen

…Og nu også med (tvivlsom kvalitet) video

 
DSC_3815
DSC_3984
DSC_3997
DSC_3855

Sommerferie i Skagen

Lørdag -> fredag tilbragte vi i et hyggeligt sommerhus på toppen af Danmark. Solnedgang

Personligt har jeg aldrig rigtig været nord for Aalborg før, så der var lidt ting at kigge på. Vi så den meget flotte opvisning i Ørnereservatet, var oppe på grenen og se havene mødes, lallede rundt med hunden i Råbjerg Mile, spiste et par udgaver af Nordjyllands bedste is i Tversted og kiggede på solnedgang fra vestkysten.

Har lagt lidt billeder her:

Ørnereservatet: https://picasaweb.google.com/tommy.eriksen/Eagleworld2011

Sommerferien: https://picasaweb.google.com/tommy.eriksen/Skagen2011

…Og solnedgangen: https://picasaweb.google.com/tommy.eriksen/Skagen2011Solnedgang

 

:)

 

 

 

 

 

 

 

DASU DM2 – SOS

Så blev det tid til DM2, og tid til at få luftet kameraet lidt igen :)
Dagen startede en anelse fesent med en forsinkelse på 3 timers tid pga regn, men derefter kom der gang i den, og der blev fedt race en mas.

Som altid kan har jeg valgt lidt ud, og lagt her

…Og hele dynen (omkring 1000, når de er færdige med at uploade) ses her

DSC_1116
DSC_1716
DSC_1764
DSC_1150