HOME FORUMS BROKERS SOFTWARE BOOKS CONTACT US
Elite Trader Your Account  •  Become a Member  •  Help  •  Search    
    Forums ›› Tools of the Trade ›› Hardware ›› How to build a 16TB box for less than $5000  


Post A Reply
    Page 2 of 4:   1   2  3  4  
Landis82
 

Registered: Jul 2006
Posts: 8580

 

07-22-12 03:22 AM

I LOVE these kinds of threads on ET.

This is the initial reason why I came to Elitetrader in the first place years ago... The HARDWARE Section has always been filled with tons of VALUE over the years and it still keeps me coming back!

Very cool stuff.

    Edit/Delete Quote Complain
377OHMS
 

Registered: Feb 2005
Posts: 6497

 

07-22-12 03:48 AM


Quote from Landis82:

I LOVE these kinds of threads on ET.

This is the initial reason why I came to Elitetrader in the first place years ago... The HARDWARE Section has always been filled with tons of VALUE over the years and it still keeps me coming back!

Very cool stuff.



Agree, great thread. The hardware forum has been valuable to me over the years too. From smartphones to desktops there has been some great information offered and the OP has done a nice job here.

    Edit/Delete Quote Complain
WinstonTJ
 

Registered: Jan 2009
Posts: 1947

 

07-22-12 04:11 AM


Quote from FCXoptions:What in the world do you need 16TB for? That is a lot of videos or information


Normally my answer would be PORN...

But this particular box was requested as a desktop/tower so that it can be physically located on the trading floor of a very small hedge fund so that the guys can be directly connected to the machine. The setup is a bit quirky but think of it as a massive NAS that 5 guys are connected to via cross-connects.

They have one data server (in a rack and a climate control room) that is only 8TB that has their tick data on it. This thing is just for the programmers to push (junk, garbage, porn, files... I have no idea) over to where they can have essentially a workspace or workbench.

This machine needs to be fast enough that it acts the same as a mapped network drive on a local machine or LAN - but 16TB of mapped network drive.

    Edit/Delete Quote Complain
WinstonTJ
 

Registered: Jan 2009
Posts: 1947

 

07-23-12 05:52 PM


Quote from dyson:Re: How to build a 26TB box for less than $3000

I built a similar server back in August/September 2011 (before the Thailand flooding/hard drive shortage). I visited Newegg to compile a list based on the same specifications and current pricing. If you plan the build (and buy parts over a couple of months), you can also take advantage of coupons and rebates.

A few substitutions are needed because of deactivated items. Example: I am using an Intel Core i3-2100T (35W) CPU which is no longer available. Energy consumpsion was important since I am running the server 24/7 and did not want a spike in energy bills. Current energy usage is between 110W and 140W.

I am also using Hitachi 5K3000 (0F12117) 2TB drives. At the time they were going for $49 ($69 with $20 rebate). They are no longer available as Hitachi is now HGST (Western Digital).

I am using Linux (Ubuntu) as my OS with ZFS as the file system. I prefer the software RAID (zfs) vs hardware for compatibility reasons.

Here is the parts list with Newegg item numbers. Total cost is $2,968.77.



What are you using this for? I've got to say that's a great price for that much space however your example and mine are for two totally different purposes, use different types of hardware and (I would venture to guess) have two different intentions.

The biggest differences between your example and mine is the drives and raid cards. There is a major difference between 5400rpm drives and 7200rpm drives and an equally large difference between a RAID card/controller with battery back up and onboard RAM (so that write caching can be enabled) and a non-BBU card that is just being used for extra SATA ports.

Normally when I build a database such as my example the intent is to have many different processes as well as many different users accessing the array at the same time. This means it will be bombarded with constant high loads of reads and writes therefore the overall I/O of the disks and the RAID cards is very important to me. Using a 5400rpm green drive with 32mb of cache is not an option. Most of these things are being used for modeling or optimization or backtesting and with all of that going on at once the I/O to the array is paramount.

Overall, for $5,000 you get server-grade components (motherboard, NICs, CPUs, RAM (fully buffered, ECC), RAID card, etc.) with hardware RAID which is going to outperform most other retail grade options and be more reliable for 24/7 sustained use.

One more question, how do you get to 26TB with 16x drives? Is that just the usable space you get from 16 2TB drives?


On another note, I finished building the thing and it's burning in so I have more pictures and the rest of the build & configuration as well as benchmarks to post shortly.

    Edit/Delete Quote Complain
dyson
 

Registered: Jul 2010
Posts: 29

 

07-23-12 08:03 PM


Quote from WinstonTJ:

What are you using this for? I've got to say that's a great price for that much space however your example and mine are for two totally different purposes, use different types of hardware and (I would venture to guess) have two different intentions.



My server is split into 3 zfs pools: work ( 6 x 2 ), media ( 8 x 2 ), and temp ( 2 x 2 ).

The work pool contains my historical data and support files. I have been adding to my tick collection over time (now around 2.5 TB). Unless you are using tick level data, daily and minute take up only minimal space ( < 200 GB).

The media pool contains my DVD (mkv/x264) and CD (mp3 & flac) collections. The media pool is only for streaming to other devices; no encoding. All transcoding was done over the years on other (faster) machines. I would not recommend using a i3-21xxT for encoding.

The temp pool contains my nightly backups, etc. This pool is for everyday use.

Most home-users may have a simple home NAS of some kind (drobo, etc). I started out with a 4 bay NAS which I quickly grew out of.


The biggest differences between your example and mine is the drives and raid cards. There is a major difference between 5400rpm drives and 7200rpm drives and an equally large difference between a RAID card/controller with battery back up and onboard RAM (so that write caching can be enabled) and a non-BBU card that is just being used for extra SATA ports.

Normally when I build a database such as my example the intent is to have many different processes as well as many different users accessing the array at the same time. This means it will be bombarded with constant high loads of reads and writes therefore the overall I/O of the disks and the RAID cards is very important to me. Using a 5400rpm green drive with 32mb of cache is not an option. Most of these things are being used for modeling or optimization or backtesting and with all of that going on at once the I/O to the array is paramount.



Do you run your simulations locally on the server or over the network on workstations?

If over the network, do you experience any network saturation? Even with gigabit, I think network bandwidth will be more of a factor than access speed (read/write) depending on the number of processes trying to access the server simultaneously.

When I planned my design, one key decision was to make a server for storage i/o only. No user applications are run directly on the server. All CPU intensive tasks are run on a desktop machine.


Overall, for $5,000 you get server-grade components (motherboard, NICs, CPUs, RAM (fully buffered, ECC), RAID card, etc.) with hardware RAID which is going to outperform most other retail grade options and be more reliable for 24/7 sustained use.


I perfer software RAID (in my case zfs) vs hardware. Not all hardware RAID cards are compatible. If you don't have a backup of the same card (make/model), you may run into difficulty restoring a RAID 5/6 partition if you have a hardware failure.


One more question, how do you get to 26TB with 16x drives? Is that just the usable space you get from 16 2TB drives?


I am running zfs raidz (RAID 5 equivalent). When factoring raidz overhead, usable space is closer to 24 TB total. 26 TB is the free space reported by linux; although different linux tools/utils will give different results when querying free space.

    Edit/Delete Quote Complain
TIKITRADER
 

Registered: Nov 2006
Posts: 3145

 

07-23-12 09:16 PM

Awesome thread WinstonTJ... Nice posts

    Edit/Delete Quote Complain
    Page 2 of 4:   1   2  3  4  
Post A Reply


Receive an email whenever a new post is added to this thread by subscribing to it.
 
Rate This Thread:

Forum Jump:
 

 

   Conduct Rules  -  Privacy Policy  -  Day Trader -  Day Trader Forum -  Best Trading Software -  Sitemap Copyright © 2013, Elite Trader. All rights reserved.    
 
WHILE YOU'RE HERE, TAKE A MINUTE TO VISIT SOME OF OUR SPONSORS:
Advantage Futures
Futures Brokerage & Clearing
AMP Global Clearing
Futures and FX Trading
Bright Trading
Professional Equities Trading
CTS
Futures Trading Software
DaytradingBias.com
Professional Trading Analytics
ECHOtrade
Professional Trading Firm
eSignal
Trading Software Provider
FXCM
Forex Trading Services
Global Futures
Futures, Options & FX Trading
Interactive Brokers
Pro Gateway to World Markets
JC Trading Group
Direct Access Trading
MB Trading
Direct Access Trading
MultiCharts
Trading Software Provider
NinjaTrader
Trading Software Provider
OANDA
Currency Trading
optionshouse
Option Trading & Education
Rithmic
Futures Trade Execution Platform
SpeedTrader
Direct Access Trading
SpreadProfessor
Spread Trading Instruction
thinkorswim by TD Ameritrade
Direct Access TradingAdvertisement
TradersStudio
System Building & Backtesting
Trading Technologies
Trading Software Provider
Trend Following
Trading Systems Provider