Unix Technical Forum

most bang for buck with ~ $20,000

This is a discussion on most bang for buck with ~ $20,000 within the Pgsql Performance forums, part of the PostgreSQL category; --> I've asked for some help here a few months ago and got some really helpfull answers regarding RAID controllers ...


Go Back   Unix Technical Forum > Database Server Software > PostgreSQL > Pgsql Performance

FAQ Members List Calendar Search Today's Posts Mark Forums Read
  #1 (permalink)  
Old 04-19-2008, 09:13 AM
Kenji Morishige
 
Posts: n/a
Default most bang for buck with ~ $20,000

I've asked for some help here a few months ago and got some really helpfull
answers regarding RAID controllers and server configuration. Up until
recently I've been running PostgreSQL on a two year old Dual Xeon 3.06Ghz
machine with a single channel RAID controller (previously Adaptec 2200S, but
now changed to LSI MegaRAID). The 2U unit is from a generic vendor using what
I believe is a SuperMicro motherboard. In the last week after upgrading the
RAID controller, the machine has had disk failure and some other issues. I
would like to build a very reliable dedicated postgreSQL server that has the
ultimate possible performance and reliabily for around $20,000. The data set
size is only currently about 4GB, but is increasing by approximately 50MB
daily. The server also requires about 500 connections and I have been
monitoring about 100-200 queries per second at the moment. I am planning to
run FreeBSD 6.1 if possible, but I am open to any other suggestions if it
improves performance.

I am considering a setup such as this:
- At least dual cpu (possibly with 2 cores each)
- 4GB of RAM
- 2 disk RAID 1 array for root disk
- 4 disk RAID 1+0 array for PGDATA
- 2 disk RAID 1 array for pg_xlog

Does anyone know a vendor that might be able provide such setup? Any
critique in this design? I'm thinking having a 2 channel RAID controller to
seperate the PGDATA, root and pg_xlog.

Sincerely,
Kenji

---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #2 (permalink)  
Old 04-19-2008, 09:13 AM
Joshua D. Drake
 
Posts: n/a
Default Re: most bang for buck with ~ $20,000

>
> I am considering a setup such as this:
> - At least dual cpu (possibly with 2 cores each)
> - 4GB of RAM
> - 2 disk RAID 1 array for root disk
> - 4 disk RAID 1+0 array for PGDATA
> - 2 disk RAID 1 array for pg_xlog
>
> Does anyone know a vendor that might be able provide such setup?


I would look at the HP DL 385 or 585. The 385 is going to max a (2) dual
core cpus. The 585 is (4) dual core cpus.

For 20k you should be able to go with a bit more then what you have
above, specifically in the RAM department.

Joshua D. Drake




Any
> critique in this design? I'm thinking having a 2 channel RAID controller to
> seperate the PGDATA, root and pg_xlog.
>
> Sincerely,
> Kenji
>
> ---------------------------(end of broadcast)---------------------------
> TIP 2: Don't 'kill -9' the postmaster
>



--

=== The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive PostgreSQL solutions since 1997
http://www.commandprompt.com/



---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #3 (permalink)  
Old 04-19-2008, 09:13 AM
Kenji Morishige
 
Posts: n/a
Default Re: most bang for buck with ~ $20,000

The 1+0 on the WAL is better than on PGDATA? I guess I'm confused about the
write sequence of the data. I will research more, thank you!
-Kenji

On Tue, Aug 08, 2006 at 04:59:09PM -0500, Thomas F. O'Connell wrote:
>
> On Aug 8, 2006, at 4:49 PM, Joshua D. Drake wrote:
>
> >>I am considering a setup such as this:
> >> - At least dual cpu (possibly with 2 cores each)
> >> - 4GB of RAM
> >> - 2 disk RAID 1 array for root disk
> >> - 4 disk RAID 1+0 array for PGDATA
> >> - 2 disk RAID 1 array for pg_xlog
> >>Does anyone know a vendor that might be able provide such setup?

>
> Wouldn't it be preferable to put WAL on a multi-disk RAID 10 if you
> had the opportunity? This gives you the redundancy of RAID 1 but
> approaches the performance of RAID 0, especially as you add disks to
> the array. In benchmarking, I've seen consistent success with this
> approach.
>
> --
> Thomas F. O'Connell
> Sitening, LLC
>
> http://www.sitening.com/
> 3004B Poston Avenue
> Nashville, TN 37203-1314
> 615-469-5150 x802
> 615-469-5151 (fax)


---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #4 (permalink)  
Old 04-19-2008, 09:13 AM
Scott Marlowe
 
Posts: n/a
Default Re: most bang for buck with ~ $20,000

On Tue, 2006-08-08 at 15:43, Kenji Morishige wrote:
> I've asked for some help here a few months ago and got some really helpfull
> answers regarding RAID controllers and server configuration. Up until
> recently I've been running PostgreSQL on a two year old Dual Xeon 3.06Ghz
> machine with a single channel RAID controller (previously Adaptec 2200S, but
> now changed to LSI MegaRAID). The 2U unit is from a generic vendor using what
> I believe is a SuperMicro motherboard. In the last week after upgrading the
> RAID controller, the machine has had disk failure and some other issues. I
> would like to build a very reliable dedicated postgreSQL server that has the
> ultimate possible performance and reliabily for around $20,000. The data set
> size is only currently about 4GB, but is increasing by approximately 50MB
> daily. The server also requires about 500 connections and I have been
> monitoring about 100-200 queries per second at the moment. I am planning to
> run FreeBSD 6.1 if possible, but I am open to any other suggestions if it
> improves performance.


This really depends on your usage patterns.

OLAP or OLTP workloads? Do you need 24/7 reliability and therefore a
two machine setup? There's a lot of variety in load.

Generally, you spend your money on disks, then memory, then CPU, in that
order.

Look at the Areca cards, they've come highly recommended here. Look at
LOTS of drives. Given the size of your db, you can go with LOTS of
smaller drives and get good performance. If you can find a good box to
hold 12 to 16 drives and fill it with 37 gig 15k RPM drives, you'll have
lots of storage, even in RAID 1+0 config. That's aiming at
transactional throughput.

Toss as much memory as is reasonably affordable at it. That's normally
in the 4 to 8 gig range. After that things start to get expensive fast.

Multiple - dual core CPUs are a good idea. Opterons seem to be better
"data pumps" with large memory and >2 CPUs than Intels right now.
Better to have a 2xdual core opteron with slower processors than a
single dual core or dual single core CPU(s) with a faster clock speed.
As long as the memory access is equivalent, the more CPUs the better in
Opterons, where their interconnect speed increases as you increase the
number of CPUs. Intel Xeons are the opposite. Better with fewer faster
CPUs / cores.

I just ran through a configurator on a site selling quad dual core
opteron servers. 8 Seagate cheetah 15k rpm drives, 8 gig ram, and the
slowest (1.8 GHz) AMD dual core CPUs (4 of them) for 8 cores, came out
to $13,500 or so.

I'd take the other $7.5 grand and buy a backup server that can old as
much but isn't quite as beefy and set up slony to have a live hot spare
sitting ready. Oh, and maybe to buy some spare parts to sit in the desk
drawer in case things break.

---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #5 (permalink)  
Old 04-19-2008, 09:14 AM
Kenji Morishige
 
Posts: n/a
Default Re: most bang for buck with ~ $20,000

Great info, which vendor were you looking at for these Opterons? I am goign
to be purchasing 2 of these. I do need 24/7 reliability.

On Tue, Aug 08, 2006 at 05:08:29PM -0500, Scott Marlowe wrote:
> On Tue, 2006-08-08 at 15:43, Kenji Morishige wrote:
> > I've asked for some help here a few months ago and got some really helpfull
> > answers regarding RAID controllers and server configuration. Up until
> > recently I've been running PostgreSQL on a two year old Dual Xeon 3.06Ghz
> > machine with a single channel RAID controller (previously Adaptec 2200S, but
> > now changed to LSI MegaRAID). The 2U unit is from a generic vendor using what
> > I believe is a SuperMicro motherboard. In the last week after upgrading the
> > RAID controller, the machine has had disk failure and some other issues. I
> > would like to build a very reliable dedicated postgreSQL server that has the
> > ultimate possible performance and reliabily for around $20,000. The data set
> > size is only currently about 4GB, but is increasing by approximately 50MB
> > daily. The server also requires about 500 connections and I have been
> > monitoring about 100-200 queries per second at the moment. I am planning to
> > run FreeBSD 6.1 if possible, but I am open to any other suggestions if it
> > improves performance.

>
> This really depends on your usage patterns.
>
> OLAP or OLTP workloads? Do you need 24/7 reliability and therefore a
> two machine setup? There's a lot of variety in load.
>
> Generally, you spend your money on disks, then memory, then CPU, in that
> order.
>
> Look at the Areca cards, they've come highly recommended here. Look at
> LOTS of drives. Given the size of your db, you can go with LOTS of
> smaller drives and get good performance. If you can find a good box to
> hold 12 to 16 drives and fill it with 37 gig 15k RPM drives, you'll have
> lots of storage, even in RAID 1+0 config. That's aiming at
> transactional throughput.
>
> Toss as much memory as is reasonably affordable at it. That's normally
> in the 4 to 8 gig range. After that things start to get expensive fast.
>
> Multiple - dual core CPUs are a good idea. Opterons seem to be better
> "data pumps" with large memory and >2 CPUs than Intels right now.
> Better to have a 2xdual core opteron with slower processors than a
> single dual core or dual single core CPU(s) with a faster clock speed.
> As long as the memory access is equivalent, the more CPUs the better in
> Opterons, where their interconnect speed increases as you increase the
> number of CPUs. Intel Xeons are the opposite. Better with fewer faster
> CPUs / cores.
>
> I just ran through a configurator on a site selling quad dual core
> opteron servers. 8 Seagate cheetah 15k rpm drives, 8 gig ram, and the
> slowest (1.8 GHz) AMD dual core CPUs (4 of them) for 8 cores, came out
> to $13,500 or so.
>
> I'd take the other $7.5 grand and buy a backup server that can old as
> much but isn't quite as beefy and set up slony to have a live hot spare
> sitting ready. Oh, and maybe to buy some spare parts to sit in the desk
> drawer in case things break.


---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #6 (permalink)  
Old 04-19-2008, 09:14 AM
Steve Atkins
 
Posts: n/a
Default Re: most bang for buck with ~ $20,000


On Aug 8, 2006, at 1:43 PM, Kenji Morishige wrote:

>
>
> I am considering a setup such as this:
> - At least dual cpu (possibly with 2 cores each)
> - 4GB of RAM
> - 2 disk RAID 1 array for root disk
> - 4 disk RAID 1+0 array for PGDATA
> - 2 disk RAID 1 array for pg_xlog
>
> Does anyone know a vendor that might be able provide such setup? Any
> critique in this design? I'm thinking having a 2 channel RAID
> controller to
> seperate the PGDATA, root and pg_xlog.


It's fairly similar to the system I'm using - I have 6 spindles for
PGDATA rather than 4. It's a 9550SX SATA based box with
26 bays (16 empty right now) dual Opterons and 4 gigs of
RAM. About $7k from asacomputers.com. Performance is
good, drastically better than the SCSI Dell server it replaced,
but I've not benchmarked it.

You mention 500 simultaneous connections. If that's really
500 simultaneous database connections (rather than, say,
500 simultaneous web users feeding into a pool of fewer
database connections) then that's a lot. I might go with a
bit more RAM than 4GB.

You should have a lot left over from
$20k, and spending some of it on more RAM would be a good
investment (as might a better RAID controller than the 9550SX,
some dual core CPUs and maybe some more PGDATA spindles).

Cheers,
Steve


---------------------------(end of broadcast)---------------------------
TIP 6: explain analyze is your friend

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #7 (permalink)  
Old 04-19-2008, 09:14 AM
Joshua D. Drake
 
Posts: n/a
Default Re: most bang for buck with ~ $20,000

Thomas F. O'Connell wrote:
>
> On Aug 8, 2006, at 4:49 PM, Joshua D. Drake wrote:
>
>>> I am considering a setup such as this:
>>> - At least dual cpu (possibly with 2 cores each)
>>> - 4GB of RAM
>>> - 2 disk RAID 1 array for root disk
>>> - 4 disk RAID 1+0 array for PGDATA
>>> - 2 disk RAID 1 array for pg_xlog
>>> Does anyone know a vendor that might be able provide such setup?

>
> Wouldn't it be preferable to put WAL on a multi-disk RAID 10 if you had
> the opportunity? This gives you the redundancy of RAID 1 but approaches
> the performance of RAID 0, especially as you add disks to the array. In
> benchmarking, I've seen consistent success with this approach.


WALL is written in order so RAID 1 is usually fine. We also don't need
journaling for WAL so the speed is even faster.

Joshua D. Drake

>
> --
> Thomas F. O'Connell
> Sitening, LLC
>
> http://www.sitening.com/
> 3004B Poston Avenue
> Nashville, TN 37203-1314
> 615-469-5150 x802
> 615-469-5151 (fax)
>
> ---------------------------(end of broadcast)---------------------------
> TIP 4: Have you searched our list archives?
>
> http://archives.postgresql.org
>



--

=== The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive PostgreSQL solutions since 1997
http://www.commandprompt.com/



---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #8 (permalink)  
Old 04-19-2008, 09:14 AM
Thomas F. O'Connell
 
Posts: n/a
Default Re: most bang for buck with ~ $20,000


On Aug 8, 2006, at 5:28 PM, Joshua D. Drake wrote:

> Thomas F. O'Connell wrote:
>> On Aug 8, 2006, at 4:49 PM, Joshua D. Drake wrote:
>>>> I am considering a setup such as this:
>>>> - At least dual cpu (possibly with 2 cores each)
>>>> - 4GB of RAM
>>>> - 2 disk RAID 1 array for root disk
>>>> - 4 disk RAID 1+0 array for PGDATA
>>>> - 2 disk RAID 1 array for pg_xlog
>>>> Does anyone know a vendor that might be able provide such setup?

>> Wouldn't it be preferable to put WAL on a multi-disk RAID 10 if
>> you had the opportunity? This gives you the redundancy of RAID 1
>> but approaches the performance of RAID 0, especially as you add
>> disks to the array. In benchmarking, I've seen consistent success
>> with this approach.

>
> WALL is written in order so RAID 1 is usually fine. We also don't
> need journaling for WAL so the speed is even faster.


In which case, which is theoretically better (since I don't have a
convenient test bed at the moment) for WAL in a write-heavy
environment? More disks in a RAID 10 (which should theoretically
improve write throughput in general, to a point) or a 2-disk RAID 1?
Does it become a price/performance question, or is there virtually no
benefit to throwing more disks at RAID 10 for WAL if you turn off
journaling on the filesystem?

--
Thomas F. O'Connell
Sitening, LLC

http://www.sitening.com/
3004B Poston Avenue
Nashville, TN 37203-1314
615-469-5150 x802
615-469-5151 (fax)

---------------------------(end of broadcast)---------------------------
TIP 3: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faq

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #9 (permalink)  
Old 04-19-2008, 09:14 AM
Joshua D. Drake
 
Posts: n/a
Default Re: most bang for buck with ~ $20,000

>
> In which case, which is theoretically better (since I don't have a
> convenient test bed at the moment) for WAL in a write-heavy environment?
> More disks in a RAID 10 (which should theoretically improve write
> throughput in general, to a point) or a 2-disk RAID 1? Does it become a
> price/performance question, or is there virtually no benefit to throwing
> more disks at RAID 10 for WAL if you turn off journaling on the filesystem?


Over 4 drives, I would gather that RAID 10 wouldn't gain you anything.
Possibly over 6 or 8 however, it may be faster because you are writing
smaller chunks of data, even if two copies of each.

Sincerely,

Joshua D. Drake


--

=== The PostgreSQL Company: Command Prompt, Inc. ===
Sales/Support: +1.503.667.4564 || 24x7/Emergency: +1.800.492.2240
Providing the most comprehensive PostgreSQL solutions since 1997
http://www.commandprompt.com/



---------------------------(end of broadcast)---------------------------
TIP 2: Don't 'kill -9' the postmaster

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #10 (permalink)  
Old 04-19-2008, 09:14 AM
Thomas F. O'Connell
 
Posts: n/a
Default Re: most bang for buck with ~ $20,000


On Aug 8, 2006, at 6:24 PM, Joshua D. Drake wrote:

>> In which case, which is theoretically better (since I don't have a
>> convenient test bed at the moment) for WAL in a write-heavy
>> environment? More disks in a RAID 10 (which should theoretically
>> improve write throughput in general, to a point) or a 2-disk RAID
>> 1? Does it become a price/performance question, or is there
>> virtually no benefit to throwing more disks at RAID 10 for WAL if
>> you turn off journaling on the filesystem?

>
> Over 4 drives, I would gather that RAID 10 wouldn't gain you
> anything. Possibly over 6 or 8 however, it may be faster because
> you are writing smaller chunks of data, even if two copies of each.


Yeah, where I've seen the benefits in practice, the scenarios have
involved the availability of a minimum of 6 drives for a RAID 10 for
WAL. I really should do a comparison of a 2-disk RAID 1 with a
variety of multi-disk RAID 10 configurations at some point.

--
Thomas F. O'Connell
Sitening, LLC

http://www.sitening.com/
3004B Poston Avenue
Nashville, TN 37203-1314
615-469-5150 x802
615-469-5151 (fax)

---------------------------(end of broadcast)---------------------------
TIP 4: Have you searched our list archives?

http://archives.postgresql.org

Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply


Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump


All times are GMT. The time now is 04:21 AM.


Powered by vBulletin® Version 3.6.5
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Optimization by vBSEO 3.2.0
www.UnixAdminTalk.com