Hypermail

From: Willy Tarreau <w#1wt.eu>
Date: Mon, 22 Mar 2010 21:42:34 +0100

Hi Leon,

[ first, please use meaningful mail subject next time, I was about to delete this one believing it was a spam ]

On Mon, Mar 22, 2010 at 02:38:54PM +0800, ?????? wrote:
> Hi all
> I have some question about Haproxy that we are teting. We want to
> loadbance disk cache of huge image application. Haproxy provide a
> consistent hash for backend.
> As we know haproxy's backup server can not config for main server
> 1 vs 1 .

This is something which is planned ("buddy server") but it's not as trivial to implement as it first seems, so maybe we'll see this in 1.5, maybe not.

> If an image cache server is down, Consistent hash also
> cause image cache server rebuilding cache.

But only that server, that's the purpose of consistent hashing.

> It is dangerous because robust image cache rebuilding, storage
> server can not accept the fact.

Well, then you don't even need consistent hashing. The very purpose of consistent hashing is to be able to add/remove servers in the pool with minimal redistribution, leading to minimal impact. If your servers don't even support a minimal redistribution, you should change the method and think about it differently.

> So haproxy can provide a pool based hash ? such as:
> ===orgin===
> backend image-cache-server-end
> mode http
> balance uri
> hash-type consistent
> option allbackups
> server webA 127.0.0.1:8081 check inter 1000
> server webB 127.0.0.2:8081 check inter 1000
> server webC 127.0.0.3:8081 check inter 1000 backup
> server webD 127.0.0.4:8081 check inter 1000 backup
>
> ====my suggest====
>
> serversdefine image-farm-pool1
> option allbackups #define every server provide service in the pool
> server 127.0.0.1:8081 check inter 1000
> server 127.0.0.2:8081 check inter 1000 backup
> server 127.0.0.3:8081 check inter 1000 backup
>
> serversdefine image-farm-pool2
> option allbackups
> server 127.0.0.4:8081 check inter 1000
> server 127.0.0.5:8081 check inter 1000 backup
> server 127.0.0.6:8081 check inter 1000 backup
>
> serversdefine image-farm-pool3
> option allbackups
> server 127.0.0.7:8081 check inter 1000
> server 127.0.0.8:8081 check inter 1000 backup
> server 127.0.0.9:8081 check inter 1000 backup
>
> backend image-cache-server-end
> mode http
> balance uri
> hash-type consistent #just only for servers hash.
> option allbackups
> servers image-farm-pool1
> servers image-farm-pool2
> servers image-farm-pool3
>
> If haproxy provide such as configure. We can build a reliable disk
> cache for huge image cache cluster.

Yes that's doable. Just proceed as we did in the old days when there were no frontend/backend distinction. You basically want to have two levels of load balancing, so use two frontend/backend levels :

first level, the backend connects to one of the second level frontends :

backend image-cache-server-end

        balance uri
        # not needed: hash-type  consistent    #just only for servers hash.
        servers image-farm-pool1 127.0.0.1:60001
        servers image-farm-pool2 127.0.0.1:60002
        servers image-farm-pool3 127.0.0.1:60003

    listen pool1
        bind 127.0.0.1:60001
        server webA 127.0.0.1:8081 check inter 1000
        server webB 127.0.0.2:8081 check inter 1000 backup
        server webC 127.0.0.3:8081 check inter 1000 backup
     
    listen pool2
        bind 127.0.0.1:60002
        server webA 127.0.0.4:8081 check inter 1000
        server webB 127.0.0.5:8081 check inter 1000 backup
        server webC 127.0.0.6:8081 check inter 1000 backup
     
    listen pool3
        bind 127.0.0.1:60003
        server webA 127.0.0.7:8081 check inter 1000
        server webB 127.0.0.8:8081 check inter 1000 backup
        server webC 127.0.0.9:8081 check inter 1000 backup

That way you have 3 servers, each one with two backups. If the disk caches are shared between all servers in a pool, you can even use other algorithms such as leastconn or roundrobin and remove the "backup" keywords. One could even imagine performing balancing on a limited uri depth/length at the first level and using the whole length at the second level.

If you're afraid of losing a complete pool, it's easy to add checks using a monitor-uri in each "listen" for instance and enabling the checks on the first level backend.

You just need to be aware that proceeding like this will reduce the performance, but if the images are huge and the connection rate is low, you could use TCP splicing on Linux and then the performance drop should be very small.

Regards,
Willy Received on 2010/03/22 21:42

Re: about