Optimizing Data Distribution

posted Oct 11, 2013, 8:19 AM by Sachchida Ojha
When you create a table in Greenplum Database, it is important to declare a distribution key that allows for even data distribution across all segments in the system. Because the segments work on a query in parallel, Greenplum Database will always be as fast as the slowest segment. If the data is unbalanced, the segments that have more data will return their results slower.
Comments