The NNUE used for the original 2018
computer shogi implementation consists of four weight layers: W1 (16-bit integers) and W2, W3 and W4 (8-bit). It has 4 fully-connected layers,
ReLU activation functions, and outputs a single number, which is the score of the board. NNUE starts with how it looks at the
board. Instead of passing a simple 8x8 grid, it uses a massive, sparse input layer, often called the HalfKP feature set. This means the network tracks the
position of every piece relative to the
king's location. Because only one piece moves at a time, the engine doesn't need to recalculate the entire network from the start for every move, it simply "updates" the changed
neurons. The first layer is massive, often containing tens of thousands of input features, but it stays fast because it is
sparse. == Performance ==