Trying to expand my knowledge a bit on clustered file systems. I've never had any experience working with them so I figured spinning up a couple test VMs would be a good starting point.
Anyway, one of the first question that popped into my head is how does RAID fit into the picture with Gluster? I understand that Gluster has three different types of architectures: Distributed (not-redundant), Replicated (redundant), and Distributed Replicated (an amalgamation of the two).
Am I understanding correctly that the Distributed type is similar to JBOD? Wouldn't RAID be necessitated under this type of architecture if one cared about resiliency? I'm trying to understand why one would use Distributed? Would its use case be for utilizing the maximum amount of underlying storage available similar to JBOD? Are Distributed Gluster deployments typically in Production?
Secondly, the Replicated architecture type seems to be quite interesting. Am I correct in assuming that RAID wouldn't be required in this scenario since we are essentially creating copies of the same data on every brick in the pool? To me this sounds essentially like a RAID mirror which seems pretty neat. Can the GlusterFS be built upon LVM? If so, adding storage to each node in order to expand the bricks at a later time would be pretty easy.
Finally, from doing some reading, it appears Gluster isn't known for its speed. Of course a lot of that depends on the network infrastructure its built on but I don't see how it could ever be as fast as local disks. What is the typically use case for something like Gluster? Massive storage requirements?