A Byte of Blockchain - Week 23 Block Mining

·

8 min read

A Byte of Blockchain - Week 23
Block Mining

Photo by Kanchanara on Unsplash

Recap

Last week we explained how blocks are "chained" to each other thus earning the name Blockchain.

We deep dived into block headers which contain the following fields :

  1. The previous block hash
  2. Fields related to mining
  3. Merkle Root Tree - a unique hash of all the transactions in the block combined.

BlockHash-1.jpg

We explained the previous block hash last week. Before we get into fields related to mining, let us explain what exactly is mining in a blockchain?

Mining

Mining is the process of appending new blocks to the blockchain data structure

It is the mechanism by which transactions are authorized and added to the decentralized ledger.

So, when a new block is added to the existing chain of blocks, we say the block is mined. Now, let us go back to transaction authorization in a bank. There are system & organizational controls when inputting and authorizing a transaction like :

  1. Only an authorized staff who has the relevant system privileges can input a transaction.
  2. A bank staff who has the system privileges to authorize a transaction only can do so and no one else.

Similarly, in a blockchain, only certain nodes can mine blocks. These nodes are called miner or mining nodes. Miner nodes VALIDATE transactions and ADD THEM to the decentralized ledger.

So, the next question is are there any controls when adding transactions to the decentralized ledger? Not in the traditional sense since the ledger is decentralized amongst nodes.

In the absence of controls like that in centralized systems, how is integrity of the decentralized ledger ensured? The integrity is ensured by increasing the complexity & cost of mining a block that acts as a deterrent to add a fraudulent or false transaction to the ledger. What does this mean & what happens if a miner adds fraudulent or false transactions in a block?

We know that miners can add blocks to the blockchain. But these blocks have to be validated by the other nodes in the network & agreed by all of them to get confirmed (called CONSENSUS - Refer weeks 9 & 10 for more details on consensus) . If there are fraudulent or double spending transactions, these will be rejected by the nodes thus rendering that block invalid.

Now, assume the miner spent significant resources like processing power and electricity to mine the block. Then, all that effort & cost of adding the block would be wasted. Also, the miner would lose incentives in the form of newly minted bitcoin & transaction fees (Refer Weeks 14 & 15 for more details on incentive mechanisms). Thus, there is a strong deterrence by way of high effort & cost to ensure miners add only valid transactions to the blocks as it would get rejected otherwise.

Proof of Work

A mining node can add blocks to a blockchain only after solving hard cryptographic puzzles. And these nodes compete with each other to add blocks to the blockchain. The puzzle is so difficult to solve that there are specialized hardware only to solve these puzzles. The cost factor to mine a block is the processing power and electricity expended for solving this problem.

The solution to the cryptographic problem is called PROOF OF WORK. What is this cryptographic problem and what is the proof of work solution?

Let us now explain the two fields related to mining in the Block Header. These are

  1. Target
  2. Nonce

During mining, the block header is hashed repeatedly, changing one parameter, till the resulting hash matches the specified target. The hash function SHA256 is used in the bitcoin mining process. Sounds a mouthful? Let us visualize it with an example below:

Go to this website sha256algorithm.com.

There is a place to input text on the top left where you can input any text and the SHA256 hash output will be displayed at the bottom. The below screenshots will help in visualizing the same :

SHA256-screenshot.jpg

Now, let us do an exercise to hash through a block header called "Blockchain" and find out the SHA256 hash outputs. If we hash "Blockchain", we will get the same hash. To get different hashes, we need to add a parameter which will change each time we run the hash. So we add two digit numbers from 00 to 10 as below :

Blockchain00

Blockchain01

Blockchain02

Blockchain03

Blockchain04

Blockchain05

Blockchain06

Blockchain07

Blockchain08

Blockchain09

Blockchain10

The SHA256 hash for each of the above items are:

SHA256.jpg

Nonce

Each of the above produces a completely different hash even though the only difference in the input is the last two digits which starts from 00 to 10. The last two digits which vary in the above list is called a NONCE. The nonce is used to iterate through a series of inputs to produce a different set of hash outputs.

  1. Why do we have to do all this hashing and iterations ? and

  2. Why do we need a Nonce?

Target & Block Time

The goal of all this hashing and iterations is to find a hash that is less than or equal to something called TARGET. In other words, the proof-of-work algorithm must produce a hash that is less than or equal to the target.

What is this target? Who fixes this target and how is it calculated?

If we take bitcoin blockchain, blocks are generated every 10 minutes on an average. This is called Block Time. It refers to the time it takes to mine a new block. In Ethereum, the average block time is between 12 to 14 seconds. This expected block time is fixed by the protocol itself and adjusts itself taking into consideration the increase in nodes and processing power over the years. This means

  1. If the number of nodes or the processing power of the nodes increase which can result in more blocks being mined, the protocol adjusts the difficulty level (target) to make it more difficult to mine the block - meaning more processing power & electricity or
  2. If the number of nodes decrease or there is a decrease in processing power, the protocol adjusts the difficulty level (target) to make it easier to mine the block

Thus, the 10 minute block time is kept constant across irrespective of the number of nodes or increase in processing power.

The below visualization makes it clear to understand

NodesIncrease.jpg

Now, what do we exactly mean by

  1. finding a hash that is less than or equal to the target
  2. How does the protocol make it more or less difficult

Game of Dice

Let us play a numbers game with a pair of dice

brett-jordan-4aB1nGtD_Sg-unsplash.jpg

Photo by Brett Jordan on Unsplash

We are given a target and we have to throw the pair of dice equal to or less than the target.

In the first round, our target is to get a number less than or equal to 10. Unless we throw 6+6, or 5+6, we won the first round and we don't have to throw much to arrive at the target.

In the second round, our target is to get a number less than or equal to 9. Unless we throw 6+6 or 5+6 or 5+5 or 4+6, we won the second round and we have to throw slightly more to arrive at 9 compared to 10

Assume after several rounds, the target is down to 3. Now, the number of throws required to get a number less than or equal to 3 will increase exponentially meaning it gets tougher to arrive at 3 and we will have to keep trying again and again till the pair of dice gives us a combination of 3 or less.

The below visualization makes it more easy :

Dicethrows.jpg

Using the above analogy, the target hash

  1. Is set at a lower threshold where the difficulty level should be higher and
  2. Is set at a higher level when the difficulty level should be lower

In place of throws, it is the number of hashing attempts required to arrive at the target.

For example, to make it more easier to mine a block, the target hash would be set at say, 10 and hence lesser number of hashing attempts would be required to get a hash equal to or less than 10. To make it more difficult to mine a block, the target hash would be set at say 3 and more number of hashing attempts would be required to get a hash equal to or less than 3.

As we explained above, the hashing attempts are done by iterating the nonce each time giving a different hash output. In our example above, let us assume that the target requires us to get a hash which starts with two zeroes. This is achieved in "Blockchain01" meaning it took one attempt for us to achieve that target. This is where a nonce changes like a counter each time incrementing by n to change the block header so that the required target hash is arrived say at X attempts.

Summary

To summarize :

  1. Mining is the mechanism by which transactions are authorized & added to the decentralized ledger.

  2. Mining can only be done by specialized nodes called MINER NODES who use specialized mining rigs.

  3. Miners spend resources like processing power & electricity to mine a block.

  4. The processing power & electricity is used to solve a problem whose solution is called PROOF OF WORK

  5. The objective of solving the problem is to arrive at a block hash which satisfies a target condition.

  6. The complexity of the problem is controlled by the protocol depending on the number of nodes & their processing power. More number of nodes mean higher difficulty threshold & lower number of nodes mean lower difficulty threshold so that the block time is maintained at 10 minutes (for Bitcoin Blockchain) or 12 to 14 seconds (for Ethereum Blockchain)

  7. The target field in the block header gives the target which the miner has to solve to add the block to the blockchain

  8. The Nonce field is used to iterate the block header to keep on hashing till the output hash satisfies the target conditions

What is Merkle Tree & now that the miner solved the proof of work, what next?