MarketBitonic sorter
Company Profile

Bitonic sorter

Bitonic mergesort is a parallel algorithm for sorting. It is also used as a construction method for building a sorting network. The algorithm was devised by Ken Batcher. The resulting sorting networks consist of comparators and have a delay of , where is the number of items to be sorted. This makes it a popular choice for sorting large numbers of elements on an architecture which itself contains a large number of parallel execution units running in lockstep, such as a typical GPU.

Bitonic Sorter
A bitonic sorter for k=1 (n=2) is simply a comparator. This is illustrated by the given box layout, in which X and Y represent the inputs, while H and L represent the higher and lower outputs, respectively. With the sorter for k=1, we can recursively create a sorter of higher order. For example, consider the following k=2 (n=4) bitonic sorter. The bitonic sorter consists of two layers: a recombination layer, which recombines the bitonic inputs into two new bitonic sequences that are each half as long as the original sequence, and a bitonic sort layer consisting of two bitonic sorters of order k-1, each of which sorts one of the two bitonic sequences produced by the previous layer. This structure may be extended recursively for higher values of k by ensuring that each comparator always accepts one input from each of the two halves of the bitonic sequence it is meant to help sort. The following illustration depicts these connections schematically. As you can see, elements of the first half of the input sequence are pairwise compared against corresponding elements of the last half of the input sequence. Comparing each element of the (green) subsequence with the element of the other (orange) subsequence at the respective index produces two bitonic subsequences. These two bitonic series (blue and red, respectively) can then be fed into the next lower-order bitonic sorter. This can be done because all elements in the red sequence are guaranteed to be higher than all elements in the blue series. Correctness of the bitonic sorter Ken Batcher provided some mathematical proof sketch in his paper. With out loss of generality the bitonic input sequence is assumed to be a_1 \leq a_2 \leq \dots \leq a_{j-1} \leq a_j \geq a_{j+1} \geq \dots \geq a_{2n} with 1 \leq j \leq 2n. With out loss of generality the sequence can be reversed therefore, we can assume n \leq j \leq 2n. Case 1: If a_n \leq a_{2n} then every element of the two subsequences are smaller. In this case d_i=a_i and e_i=a_{n+i} with 1 \leq i \leq n and therefore d_i and e_i are trivially bitonic. Case 2: Otherwise there exists a k such that the element a_k of the first sub-sequence is bigger than a_k of the second sub-sequence while it is the opposite for a_{k+1}. This means that a_k \leq a_{k+n} and a_{k+1} > a_{k+n+1} are true for a specific k. Therefore, we now know that: 1. For 1 \leq i \leq k the sequences are d_i=a_i and e_i=a_{n+i} 2. For k the sequences are defined as the opposite of 1, with d_i=a_{n+i} and e_i=a_i In the original paper he then claims the following inequalities result from those definitions: Following from 1: • For 1 \leq i \leq k that d_i \leq d_{i+1} • For j-n \leq i \leq k that e_i \geq e_{i+1} • For 1 \leq i \leq j-n that e_i \leq e_{i+1} Following from 2: • For k that d_i \geq d_{i+1} • For k that e_i \leq e_{i+1} From both: e_n \leq e_1 From the paper claims follows that the sequences d_i and e_i are in fact bitonic. == Bitonic Sorting Networks (Bitonic Merge Sort) ==
Bitonic Sorting Networks (Bitonic Merge Sort)
A bitonic sorting network is created by using several bitonic sorters. These bitonic sorters are recursively used to create two monotonic sequences, one decreasing and one increasing, which are then put into the next stage. This creates a bitonic series for the next stage, which can then use this bitonic series as a monotonic series for the next stage. Consider the following example for an n=4 bitonic sort network. The bitonic sorting network for k=2 can be created by using a k=2 bitonic sorter and two k=k_{prev}-1 sorters. The two sorters create a decreasingly or increasingly sorted sequence in order to create a bitonic input for the bitonic sorter. Bitonic sorting networks of a lower order are mostly used for the two pre-sorters; therefore, a recursive definition of a bitonic sorting network from bitonic sorters can be described. In the above example, the two bitonic sorting networks are k=1 networks; hence, they are just a comparator. The following figure shows the overall scheme. This overall scheme requires the sorter to have an input of sequence that is a power of two. There are, however, possibilities to mitigate this by, for example, using sentinel values. == Pseudocode ==
Pseudocode
The following pseudocode describes the sorting process. In the code, a is the array to be sorted, low is the index of the first item in the sub-array to be sorted, k and count is the number items in the sub-array that are being sorted in this function call. direction is a boolean value that determines whether the sub-array is being sorted into ascending / descending order. The function call bitonicSort(a, 0, n, 1) is used to sort a (ascending), where n is the number of items in a . function bitonicMerge(a, low, count, direction) is if count > 1 THEN k ← count / 2 // Compare and swap elements across the halves for i ← low to low + k do // determine if two elements of a are out of order in relation to the direction of sorting. if (direction == 1 AND a[i] > a[i + k]) OR (direction == 0 AND a[i] 1 THEN k ← count / 2 // Sort first/second half into ascending/descending order bitonicSort(a, low, k, 1) bitonicSort(a, low + k, k, 0) // Merge entire sequence in desired order bitonicMerge(a, low, count, direction) == Complexity ==
Complexity
In this section we assume that our sorter has n=2^k input elements as previously. Each recursion in a bitonic sorting network adds a sorter of order k_{next}=k_{prev}-1, which consists of bitonic k_{next} sorter and the next recursion. As both sub-sorters can be done in parallel, only one level is added for each level in both sub-sorters. Each bitonic sorter has, therefore, one recombination layer and a lower-order bitonic sorter for its recursion. This results in k levels per bitonic sorter. Therefore, we can describe this construction's levels as the following sum: \sum_{i=1}^{k} i. This sum can be reduced using the Gauss sum formula \sum_{i=1}^{k} i = \dfrac{1}{2}k(k+1) Therefore, the number of levels in which each comparison can be done in parallel is given by \dfrac{1}{2} k(k+1). Which gives us \mathcal{O}(k^2+k)=\mathcal{O}(k^2)=\mathcal{O}((\log_2 n)^2) assuming n comparisons can be performed in parallel. Although the absolute number of comparisons is typically higher than Batcher's odd-even sort, many of the consecutive operations in a bitonic sort retain a locality of reference, making implementations more cache-friendly and typically more efficient in practice. ==See also==
tickerdossier.comtickerdossier.substack.com